Comments (6)
It is very likely that the tracking would be faster with fewer points being tracked, but it might become a bit less accurate.
Another way to speed up the model is to perform fewer scales when tracking and looking at smaller window sizes. Try playing around with window_sizes_small
and window_sizes_init
in the LandmarkDetectorParameters
, changing window_sizes_small
will allow you to control how many scales are run during the tracking and how big is the region of interest.
Thanks,
Tadas
from openface.
Thank you for your helpful comment!
I appreciate it.
from openface.
Could you please describe the meaning of the parameters, window_sizes_small
and window_sizes_init
, in more detail? How do they relate to the number of scales and the size of the ROI?
Per default the parameters are
window_sizes_init.at(0) = 11;
window_sizes_init.at(1) = 9;
window_sizes_init.at(2) = 7;
window_sizes_init.at(3) = 5;/
window_sizes_small[0] = 0;
window_sizes_small[1] = 9;
window_sizes_small[2] = 7;
window_sizes_small[3] = 0;
Is there a special reason to use this ordering, [0,9,7,9]
, for the window_sizes_small
parameters?
Many Thanks!
from openface.
Each of the numbers in window_sizes_small
and window_sizes_init
describes the search area (region of interest in pixels, e.g. 11x11
).
The vector describes the regions of interest for each scale, so for example of the current windows are set to window_sizes_init
, then the algorithm will search an area of 11x11
for scale 1, area of 9x9
for scale 2, area of 7x7
for scale 3, and finally area of 5x5
at scale 4.
In case the window size is set to 0 (as is the case for window_sizes_small
in scales 1 and 4), that scale will be skipped. That means the algorithm will only look at a 9x9
area for scale 2 and a 7x7
for scale 3.
Hope this helps.
Thanks,
Tadas
from openface.
Many thanks for the explanation! This helps a lot, but what is meant with "scale 1" or "scale 2"? Where are the scale pyramids defined? Or is it just sampling every pixel (scale = 1), every second (scale = 2) ... ?
from openface.
OpenFace performs landmark detection through 4 pyramid scales, each patch expert is trained to deal with face images of a particular size/scale. The area of interest for each patch expert (centered around current best landmark eztimate) is actually resized and in-plane rotation corrected using bilinear interpolation instead of just pixel resampling. This is done in the following call in PatchExperts.cpp
:
cv::warpAffine(grayscale_image, area_of_interest, sim, area_of_interest.size(), cv::WARP_INVERSE_MAP + CV_INTER_LINEAR);
from openface.
Related Issues (20)
- bboxdir text files does not work with `x_min y_min x_max y_max` format
- BLAS : Program is Terminated. Because you tried to allocate too many memory regions HOT 6
- .mp4 does not support memory mapping
- make failed HOT 1
- Could not find the HAAR face detector location HOT 18
- Problems with GUI (Images shown too short) HOT 2
- macOS make issue HOT 3
- How to turn off the display before building OpenFace? HOT 3
- Memory allocate error HOT 5
- How to get the camera-to-world matrix?
- Issues making OpenFace work in Visual Studio 2017 HOT 22
- Can I call the api in python to read the face box? HOT 4
- No documentation for option to choose face detector HOT 12
- The AU intensities generated by OpenFace is inconsistent with the AU intensities manually marked
- GPU support for feature extraction HOT 1
- How to output csv file from input mp4 using openfaceoffline? HOT 6
- I encountered the following problem when compiling the project, I would like to ask how to solve it ah HOT 1
- FaceAnalyzer DetectLandmarks completely crashes PI 5
- Confidence precision levels in CSV file HOT 7
- Requesting help with "cmake -D WITH_OPENMP=ON CMAKE_BUILD_TYPE=RELEASE .." HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from openface.