deepbaksuvision / yolo9000 Goto Github PK

View Code? Open in Web Editor NEW

1.0 4.0 3.0 70 KB

yolo9000's Introduction

YOLO9000

yolo9000's People

Contributors

Stargazers

Watchers

Forkers

visionnoob ssaru cris-j-dev

yolo9000's Issues

Paper Review : YOLO9000: Better, Faster, Stronger

https://arxiv.org/abs/1612.08242

Survey for Python coding convention Cheat-sheet

ImageNet pretrained darknet-19 weight for pytorch

https://s3.ap-northeast-2.amazonaws.com/deepbaksuvision/darknet19-deepbaksu.pth

Repository에 기본적인 test 및 travis 관리

아래 내용 알아보고, 공부하기
그리고 적용하기.

python 버전은 3.6으로 고정하기
mypy 사용하기.
http://mypy-lang.org/
https://item4.github.io/2017-09-14/Python-Typing-with-mypy/
python makefile 알아보기
https://github.com/kkweon/icarebot/blob/master/Makefile
https://krzysztofzuraw.com/blog/2016/makefiles-in-python-projects.html
pytest 알아보기
https://twpower.github.io/15-install-pytest-and-basic-usage
doctest 알아보기
https://python-guide-kr.readthedocs.io/ko/latest/writing/tests.html
pytest --cov 옵션 알아보기
특정 폴더의 코드를 coverage 대상으로 지정
isort 알아보기
import 순서 정리
https://pypi.org/project/isort/
black 알아보기
코드 정리
https://github.com/ambv/black
https://black.readthedocs.io/en/stable/
pylint 알아보기
https://github.com/kkweon/icarebot/blob/master/.pylintrc
개발환경 고정
virtualenv
pipreqs 알아보기
requirements.txt 자동 생성
https://github.com/bndr/pipreqs

NMS 함수 만들기

Test data로 확인
Test data merge가 원하는 방식으로 되는지 확인

K-means cluster measure distance IOU 바꾸기

Measure distance를 교체할 수 있도록 짜기

Test data
- PASCAL VOC 데이터 labeling loader 짜기
Test
- PASCAL VOC 데이터 clustering 후, darknetv2 cfg 비교

Darknet-19 pretrained weight 에서 input normalization 문제.

pretrained weight를 가져와서 테스트해보니
input을 zero-mean으로 하면 안되고
그냥 그대로(?) to_tensor로 만들어야 정답이 제대로 나옵니다.

가령 아래와 같이 짜면 값이 안 나오고

dataset = dset.ImageFolder(root="samples/",
                           transform=transforms.Compose([
                               transforms.Resize((256,256)),
                               transforms.ToTensor(),       # Tensor로 바꾸고 (0~1로 자동으로 normalize)
                               transforms.Normalize(mean=[0.485, 0.456, 0.406],
                                                     std=[0.229, 0.224, 0.225]),
                           ]))

이렇게 해야합니다..

dataset = dset.ImageFolder(root="samples/",
                           transform=transforms.Compose([
                               transforms.Resize((256,256)),
                               transforms.ToTensor()      # Tensor로 바꾸고 (0~1로 자동으로 normalize)
                           ]))

#13 Darknet19 의 마지막 linear activation의 정체

https://github.com/pjreddie/darknet/blob/master/cfg/darknet19.cfg

darknet19.cfg 를 보면 마지막 conv_layer의 activation이 linear_layer 입니다.
이때 linear_layer의 정체는 뭘까요?
우선은 Torch.nn.modules.linear 으로 구현해놨습니다.

대충은 알겠는데 bias가 들어가는지가 제일 궁금하네요..
조만간 코드를 한번 뜯어보겠습니다.

We also shrink the network to operate on 416 input images instead of 448×448. We do this because we want an odd number of locations in our feature map so there is a single center cell. Objects, especially large objects, tend to occupy the center of the image so it’s good to have a single location right at the center to predict these objects instead of four locations that are all nearby. YOLO’s convolutional layers downsample the image by a factor of 32 so by using an input image of 416 we get an output feature map of 13 × 13

여기에서