Giter Club home page Giter Club logo

fashionculturedatabase_dloader's Introduction

Fashion Culture DataBase (FCDB)

Issues

  • Mar. 4, 2020: YFCC100M, the source dataset of Fashion Culture DataBase currently may have an issue on downloading. Please check updates of this page.

Updates

  • Mar. 26, 2020: Pre-train weights are published
  • Mar. 4, 2020: Repository is published
  • Nov. 8, 2019: Repository creation

Summary

FCDB has been constructed based on the following papers.

Kaori Abe, Teppei Suzuki, Shunya Ueta, Akio Nakamura, Yutaka Satoh, Hirokatsu Kataoka
"Changing Fashion Cultures," arXiv pre-print:1703.07920, 2017.

Hirokatsu Kataoka, Kaori Abe, Munetaka Minoguchi, Akio Nakamura and Yutaka Satoh
"Ten-million-order Human Database for World-wide Fashion Culture Analysis,"
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW), 2019.

The repository provides codes and bounding boxes (bboxes) in order to construct FCDB which is based on YFCC100M dataset. Please note that we are NOT serving original images and meta information including YFCC100M dataset. Therefore, please download YFCC100M images yourself by following the Yahoo's instruction. We are sharing only person bboxes which are corresponding to YFCC100M images. The detailed sharing files are shown below.

  • Image identification number (Image ID) and bboxes on FCDB
  • 3 types of dataset representation
    • Images divided into 16 directories
    • Pascal VOC format (for person detection)
    • MS COCO format (for person detection)

Our FCDB is also applied as a large-scale pre-training dataset for person detection. Please see also the document.

Munetaka Minoguchi, Ken Okayama, Yutaka Satoh, Hirokatsu Kataoka
“Weakly Supervised Dataset Collection for Robust Person Detection”
arXiv pre-print:2003.12263, 2020.

Citation

If you use the dataset or codes, please cite the following:

@inproceedings{KataokaCVPRW2019_FCDB,
  author={Hirokatsu Kataoka, Kaori Abe, Munetaka Minoguchi, Akio Nakamura and Yutaka Satoh},
  title={Ten-million-order Human Database for World-wide Fashion Culture Analysis},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshop (CVPRW)},
  year={2019},
}

@inproceedings{Minoguchi_WSPD,
  author={Munetaka Minoguchi, Ken Okayama, Yutaka Satoh, Hirokatsu Kataoka},
  title={Weakly Supervised Dataset Collection for Robust Person Detection},
  booktitle={arXiv pre-print:2003.12263},
  year={2020},
}

Requirements

  • python 3
  • numpy, xml, json, argparse
  • 400 GB vacant space in your computer

Preparation

A user must download in advance, due to FCDB has constructed based on YFCC100M. The rights including copyright and license are belonged to YFCC100M. Please refer to the description of YFCC100M YFCC100M. The required data can be available on yfcc100m_dataset on Amazon s3.

Download

  • Image ID and bboxes
    Please fill out the form to obtain a file which contains image ID and bboxes. After our confirmation, we will send an email to get the file.

  • Pre-train weights
    It shares the trained weights of M2Det and SSD which are trained FCDB. The configuration of each detector follows the default settings of each original repository.
    Download link is here.

Running the code

We provide three types of dataset representation. Please see the following instruction what you want. Please properly set a directory path in your environment.

16 cities

FCDB is divided into 16 directories. The directory is corresponding at each city.

python ImageFolder.py --yfcc='./yfcc100m_dataset' \
                        --id_json='./image_id_list.json' \
                        --save_dir='./FCDBv2'

Pascal VOC (for Person Detection)

FCDB is transformed by Pascal VOC form which is used in object detection. The image ID and bbox are paired.

python VocFomat.py --yfcc='./yfcc100m_dataset' \
                        --id_json='./image_id_list.json' \
                        --save_dir='./FCDBv2'

MS COCO (for Person Detection)

FCDB is transformed by MS COCO form which is used in object detection. The image ID and bbox are paired.

python CocoFomat.py --yfcc='./yfcc100m_dataset' \
                        --id_json='./image_id_list.json' \
                        --save_dir='./FCDBv2'

fashionculturedatabase_dloader's People

Contributors

mametch avatar hirokatsukataoka16 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.