There are two types of categories in the MetaData, i.e. category_id and semantic_category. A possible explaination for the performance of the existing Compatibility Learning models is that these models are learning categories. We discuss that if there is similar categories in the negetive outfit samples (low quality negative samples), the model can perform well in compatibility learning task when it only learns categories. To evaluate this assumption, we train some predictive models on the dataset when only the category of the images are present in the dataset.
We use the compatibility list provided in the polyvore dataset. This list includes 16,995 positive and 16,995 negetive samples of outfits. For each instance, we only include the category_id in the training dataset. category_id is an integer number for each image. The integer is between 1-224. Similarly we treat the test dataset which includes 15,145 positve instances and 15,145 negative instances. You can use creat_dataset_from_metadata_handler.py to generate these lists.
We train a Decision Tree, a Logistic Regression, and a MLP network on this training set (cat_id_importance_eval.py). The results show no better than random prediction for all three models.
Decision Tree:
Logistic Regression:
MLP:
semantic_category
Compatibility Learning