Giter Club home page Giter Club logo

polyvore-dataset's Introduction

Polyvore Dataset

Dataset used in ACM MM'17 paper "Learning Fashion Compatibility with Bidirectional LSTMs" [paper] [code]

This dataset is also available on Google Drive.

Contact

Author: Xintong Han

Contact: [email protected]

Polyvore.com

Polyvore.com is a popular fashion website, where user can create and upload outfit data. Here is an exmaple.

Dataset

Download and decompress polyvore.tar.gz.

Polyvore outfits

This dataset contains 21,889 outfits from polyvore.com, in which 17,316 are for training, 1,497 for validation and 3,076 for testing. The train, validation and test outfits are in {train,valid,test}_no_dup.json, respectively.

Each JSON item has the following information:

{
    "name": Name of the outfit, 
    "views": Number of views of the outfit,
    "items": [
        Fashion items in the outfit.
        {
            "index": Index of the fashion item in this outfit on Polyvore,
            "name": Description of the fashion item,
            "price": Price of the fashion item (usually in US dollars),
            "likes": Number of likes of the item,
            "image": Image url of the item,
            "categoryid": Category ID of the item,
        }, 
        {
            ...
        }, 
        ...
    ], 
    "image": Outfit image url,
    "likes": Number of likes of the outfit,
    "date": Upload date of the outfit,
    "set_url": Outfit url,
    "set_id": Outfit ID,
    "desc": Outfit description.
}

If you do not want to download all these images using their urls, they are in polyvore-images.tar.gz on Google Drive. This file contains the images of 33,375 outfits, which include all 21,889 outfits in polyvore dataset. The other ~11k outfits are uploaded more than 3 years ago. We are afraid that they are out-of-fashion so we do not use them).

category_id.txt contains the maping between category ID and category name. Thanks Zhenyu for providing it!

Fill-in-the-blank Fashion Recommendation

fill_in_the_blank_test.json contains the questions used to evaluate in the fill-in-the-blank fashion recommendation task. It follows the following format:

{
    "question": Fashion item sequence to form the question,
    "answers": Multiple choice set to choose from,
    "blank_position": The blank position to be filled in.
},

The name of a fashion item is SetID_ItemIndex, e.g., 119704139_1 is the fashion item with "index" 1 in the outfit with "set_id" 119704139. The first answer in "answers" is the correct one (i.e., original fashion item in the outfit).

Fashion Compatibility Prediction

fashion-compatibility-prediction.txt contains ~7,000 outfits, where 4,000 are incompatible and 3,000 are compatible.

In each line the first number indicating the compatibility (1 is compatible, 0 is not) followed by a sequence of fashion items consisting the outfit.

Some Notes

  1. These outfits are crawled around 02/19/2017, so you can estimate the extact upload date of an outfit by looking at the "date" filed.

  2. For outfits that contain too many fashion items, we only keep their first 8 items.

  3. We delete the fashion items with non-fashion "categoryid" such as background, texts, decorations. As a result, the indices of items in an outfit may not be consecutive.

Citation

If this dataset helps your research, please cite our paper:

@inproceedings{han2017learning,
  author = {Han, Xintong and Wu, Zuxuan and Jiang, Yu-Gang and Davis, Larry S},
  title = {Learning Fashion Compatibility with Bidirectional LSTMs},
  booktitle = {ACM Multimedia},
  year  = {2017},
}

Other Datasets from Polyvore.com

There are several datasets crawled from Polyvore.com:

  1. The Elements of Fashion Style. [paper] [dataset]

  2. Mining Fashion Outfit Composition Using An End-to-End Deep Learning Approach on Set Data. [paper] [dataset]

  3. NeuroStylist: Neural Compatibility Modeling for Clothing Matching. [paper] [dataset]

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.