luoyetx / face-alignment-at-3000fps Goto Github PK

View Code? Open in Web Editor NEW

197.0 197.0 120.0 911 KB

C++ implementation of Face Alignment at 3000 FPS via Regressing Local Binary Features

License: BSD 3-Clause "New" or "Revised" License

C++ 97.91% CMake 2.09%

face-alignment-at-3000fps's Issues

Dataset site is offline

The dataset required by this algorithm is offline.
Another mirror is required.
I suggest putting it on the repo.

代码速度问题

我把3000fps运行程序在i5的电脑上运行速度是4ms左右，然而将代码移植到arm-v7 的开发板上运行，运行速度是90ms 左右，正常情况下，电脑和板子的速度比是1:6；请问是什么原因呢？

variance question

I understood this code.
'double m1 = cv::mean(vec_)[0];'
'double m2 = cv::mean(vec_.mul(vec_))[0];'
'double variance = m2 - m1*m1;'

But I don't understand formula below.
'double variance_all = (calcVariance(delta_shapes.col(0)) + calcVariance(delta_shapes.col(1)))*N;'

Does 2-D variance addition each 1-dimensional variance?
What means of N?
N is count of training sample. But I don't understand why multiply N.

Could you please provide some reference material?

Assertion fail on FaceAlignment prepare

line 49 from prepare.cpp: assert(inFile && outFile);

./FaceAlignment prepare
FaceAlignment: /home/bh/git/alignment/face1/src/prepare.cpp:49: void genTxt(const string&, const string&): Assertion `inFile && outFile' failed.
[1]    8831 abort (core dumped)  ./FaceAlignment prepare

why current_shapes[idx] = bboxes_[i].ReProject(bboxes_[k].Project(gt_shapes_[k])) ?

Dear luoyetx,
i am reading your code, and i find that in the trian.cpp, line 180, i can not understand why we do like this? one face's initial shape is initialize by another face?

int N = imgs_.size();
int L = N*config.initShape_n;
vector<Mat> imgs(L), gt_shapes(L), current_shapes(L);
vector<BBox> bboxes(L);
RNG rng(getTickCount());
for (int i = 0; i < N; i++) {
    for (int j = 0; j < config.initShape_n; j++) {
        int idx = i*config.initShape_n + j;
        int k = 0;
        do {
            //if (i < (N / 2)) k = rng.uniform(0, N / 2);
            //else k = rng.uniform(N / 2, N);
            k = rng.uniform(0, N);
        } while (k == i);
        imgs[idx] = imgs_[i];                               
        gt_shapes[idx] = gt_shapes_[i];
        bboxes[idx] = bboxes_[i];
        current_shapes[idx] = bboxes_[i].ReProject(bboxes_[k].Project(gt_shapes_[k])); <----line 180
    }
}

for (int i = 0; i < N; i++) {
fscanf(fd, "%s", img_path);
for (int j = 0; j < 4; j++) {
fscanf(fd, "%lf", &bbox[j]);
}
for (int j = 0; j < landmark_n; j++) {
fscanf(fd, "%lf%lf", &x[j], &y[j]);
}
Mat img = imread(img_path);
// crop img
double x_min, y_min, x_max, y_max;
x_min = *min_element(x.begin(), x.end());
x_max = *max_element(x.begin(), x.end());
y_min = *min_element(y.begin(), y.end());
y_max = *max_element(y.begin(), y.end());
x_min = max(0., x_min - bbox[2] / 2);
x_max = min(img.cols - 1., x_max + bbox[2] / 2);
y_min = max(0., y_min - bbox[3] / 2);
y_max = min(img.rows - 1., y_max + bbox[3] / 2);
double x_, y_, w_, h_;
x_ = x_min; y_ = y_min;
w_ = x_max - x_min; h_ = y_max - y_min;
BBox bbox_(bbox[0] - x_, bbox[1] - y_, bbox[2], bbox[3]);
Rect roi(x_, y_, w_, h_);
img = img(roi).clone();

    Mat gray;
    cvtColor(img, gray, CV_BGR2GRAY);
    LOG("Run %s", img_path);
    Mat shape = lbf_cascador.Predict(gray, bbox_);
    img = drawShapeInImage(img, shape, bbox_);
    imshow("landmark", img);
    waitKey(0);
}

test的时候用到了ground truth的数据，是不是在做另外其他图像没有.pts文件就不能检测了?

Assertion error on void genTxt()

When I try to ./FaceAlignment prepare:

FaceAlignment: /home/bh/git/face-alignment-at-3000fps/src/prepare.cpp:49: void genTxt(const string&, const string&): Assertion `inFile && outFile' failed.

Lib versions problem

$ ./FaceAlignment prepare    
$ ./FaceAlignment: /home/bh/anaconda3/lib/libgomp.so.1: version `GOMP_4.0' not found (required by ./FaceAlignment)
$ ./FaceAlignment: /home/bh/anaconda3/lib/libstdc++.so.6: version `GLIBCXX_3.4.21' not found (required by ./FaceAlignment)

Trying to use ./FaceAlignment prepare causes this error, related to wrong library versions.
Any tip to solve this?

webcam model test?

I've tried jwyang's implementation of the algorithm. The result is quite poor. I tired it on webcam, and the points detected are always shaking. and the side face detection is also unacceptable. So what is the differences between your code and jwyang's. Any improvement on that?

怎样写数据库？

数据库是不是只要往Path_Images_train.txt和Path_Images_test.txt中写好.jpg和.pts的路径就好了？训练集大概需要多少张图片呢

SWAP of landmarks

I see in the data_augmentation function of train.cpp, there is macro SWAP. I am worried that it destroys the data integrity. May I know why this is being done ?

Thank you

SimilarityTransform returning identity

Hi,

I see that the calcSimilarityTransform() function always returns identity matrix for rotation and only the scale is computed which is also close to 1 in almost all the cases. Is this a bug in the function or is it how it should be ? (Mean face landmarks are always vertical because of the flipped shapes.)

Because, if you use lfpw/afw/helen datasets, there are many cases where the face is not vertical but tilted up to some degrees. It returns identity rotation in these cases as well.

Could you help me with this problem ?
Thanks

能分享一下训练好的模型吗？

我想测试一下该算法的精度，能够分享一份已经训练好的模型吗？谢谢~！！我的邮箱[email protected]

Dataset site is offline (http://ibug.doc.ic.ac.uk/resources/facial-point-annotations)

The dataset required by this algorithm is offline (http://ibug.doc.ic.ac.uk/resources/facial-point-annotations)
Another mirror is required.
I suggest putting it on the repo.

out of Memory

Hi， i successed in training the model using 300w which contain about 4k images.

but when i do data-argument(including rotation and offset faceRect)，the images from 4K to 100k.

and when training again,it occur a error ,i find in function parseTxt(), the cpu available from 28G to 1G

after loading about 18k images.

how can i solve the problem about cpu memory.

Thanks

lbp.cpp
#define FREE_MODEL(model)

That wasn't happend on running under X86 mode.

Did that happened before?
@luoyetx

Regards!

Rui

is pupils landmark wrong in common.cpp?

in common.cpp line 210.
int pupils[][6] = { { 36, 37, 38, 39, 40, 41 }, { 42, 43, 44, 45, 46, 47 } };

but in the ibug dataset landmarks, pupils should be 37 to 42, and 43 to 48.

luoyetx / face-alignment-at-3000fps Goto Github PK

face-alignment-at-3000fps's Issues

Recommend Projects

Recommend Topics

Recommend Org