leon2milan / imagerecognition Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 0.0 14.11 MB

triplet loss, focal loss, circle softmax, cos softmax, arc softmax

License: MIT License

Python 1.65% Jupyter Notebook 98.35%

imagerecognition's Introduction

Image Recognition

本项目用于构建图像类识别以及验证。

环境

conda activate darwinml_ve_2.3

数据收集

人脸数据

1. 下载格林深瞳开源数据集 (共计9.4万ID，280万张图片)。
更多数据可以查看
此外格林深瞳数据使用mxnet存储，可以使用python data/preprocess.py 进行处理。（其中age30， cfp-fp， lfw等为人脸的测试数据）
若使用格林深瞳数据需要 pip install -r requirments.txt
2. 更新环境变量 export PYTHONPATH=$model/facenet/src:$PYTHONPATH （$model 代表本代码所在目录）
3. 运行如下命令crop出人脸所在区域

python facenet/src/align/align_dataset_mtcnn.py ../datasets/asian_face/raw/ ../datasets/asian_face/imgs/ --image_size 112

如果识别对别其他类型数据，需要得到大小为[112, 112]的图像， align后的图像保存在$data/imgs下

也可以使用其他align model对齐人脸

公章数据

1. 使用fake 公章 + 真实公章数据。抽取20% fake 公章 + 真实公章数据作为test 数据。

需要大量各类型数据
fake 公章最好支持各种真实场景，如不同底纹，不同来源（拍照，影印等），不同角度等

2. 需要公章目标检测模型检测公章
3. 对其公章模型
4. 生成测试数据的pair文件

更新环境变量 export PYTHONPATH=$model:$PYTHONPATH
在data目录下运行python generate_pairs.py。需要修改代码中的测试数据集的路径。
test 数据集最好人工review，且固定不变。因为需要根据test 数据集生成best thresho

数据统一格式

数据统一格式如下（$data 代表数据所在目录），



$data/raw/
       ---> id1
            ---> id1_1.jpg
       ---> id2
            ---> id2_1.jpg
       ---> id3
            ---> id3_1.jpg
            ---> id3_2.jpg