Giter Club home page Giter Club logo

mnre's Introduction

MNRE

Resource and Code for ICME 2021 paper MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social Media Posts

Recent Updates

  • 2021.6.22 We provide MNRE-2, a refined version which merges several ambigious categories with much more support samples. The original version has been moved to Version-1

Overview

This project aims to present a new task -- multimodal neural relation extraction and a dataset (MNRE) for model evaluation. The MNRE task requires a understanding of both vision and language. We envisage a range of well-designed methods and resources for such a challenge that would boost the development of multimodal alignment towards a higher semantic level.

This is an example of multimodal relation extraction in Twitter. There are three entities in this sentence: "JFK", "Obama" and "Harvard". The main task of relation extraction is to identify the relations of each entity pair. Previous works incorrectly classify the relations of "JFK" and "Harvard" as "Residence" and the "JFK" and "Obama" as "Spouse" due to the missing of contexts. However, we can know that "JFK" and "Harvard" are in the relation of "Graduated at" with the visual concepts "Bachelor cap" and "Gown". Still, the relations of "JFK" and "Obama" can be identified as "Alumni" with the guidance of all the visual objects about "Campus".

Data Statistics

Data Statistics Compared to Previous NRE Dataset

Dataset # Image # Word # Sentence # Entity # Relation # Instance
SemEval-2010 Task 8 - 205k 10,717 21,434 9 8,853
ACE 2003-2004 - 297k 12,783 46,108 24 16,771
TACRED - 1,823k 53,791 152,527 41 21,773
FewRel - 1,397k 56,109 72,124 100 70,000
MNRE 9,201 258k 9,201 30,970 23 15,485

This is the version 2 of our MNRE dataset. We refine and merge some categories for better understandings. The dataset contains 15,484 samples and 9,201 images with 23 relation categories. We split the dataset into training, development and testing set with 12247, 1624 and 1614 samples, respectively.

Category Distribution

We start tagging relation types depending on the entity types. For example, the relations between one person and another person can be classified into ''alumni'', ''couple'' and ''relative'' et al.

Data Collection

We build the original corpus from three sources: two available multimodal named entity recognition datasets - Twitter15 and Twitter17, and crawling data from Twitter.

We utilize a pretrained NER tagging tool elmo for extracting both entities and their corresponding types.

Data Usage

Our processed textual relations are in ./mnre_txt/, the image data can be downloaded here.

Each sentence is split into several instances (depending on the number of relations). Each line contains

'token': Texts preprocessed by a tokenizer
'h': Head entities and their positions in a sentence
't': Tail entities and their positions in a sentence
'image_id': You can find the corresponding images using the link above
'relation': The relations and entity categories

Case Study

Four examples for illustrating the effectiveness of visual information in extracting relations. The first line shows that the visual objects and their attributes can help in identifying relations. Further more, we show that the interactions of person-to-person or person-to-object can also provide clues for classifying relations.

Citation

If you find this repo helpful, please cite the following:

@inproceedings{zheng2021mnre,
  title={MNRE: A Challenge Multimodal Dataset for Neural Relation Extraction with Visual Evidence in Social Media Posts},
  author={Zheng, Changmeng and Wu, Zhiwei and Feng, Junhao and Fu, Ze and Cai, Yi},
  booktitle={2021 IEEE International Conference on Multimedia and Expo (ICME)},
  pages={1--6},
  year={2021},
  organization={IEEE}
}

mnre's People

Contributors

thecharm avatar flash0303 avatar

Stargazers

lizou avatar  avatar JIMMY ZHAO avatar  avatar  avatar Rui avatar  avatar 林昕奎 avatar Divine avatar  avatar  avatar Augus Yan avatar  avatar lisx avatar Pengfei Luo avatar Faith avatar Tianhao Gao avatar  avatar ray avatar  avatar  avatar  avatar Zhenbin,Chan avatar BOBOTANG avatar Xinyu Wang avatar wurentidai avatar  avatar Chen haojie avatar Shiping avatar  avatar Essen avatar Yongliang Shen avatar ASTONE avatar WangZhen avatar BenfengXu avatar  avatar  avatar  avatar MrChen18 avatar  avatar Yuqi Bu avatar Tapas Nayak avatar  avatar  avatar  avatar

Watchers

longlongman avatar  avatar

mnre's Issues

The images in mnre_obj_org.zip seem to be damaged

After I downloaded the mnre_obj_org.zip, I found that the images in it seem to be damaged. I wondered what is the difference between mnre_obj_org.zip and mnre_img_org.zip? And if I want to use the data for my work, which zip file should I use?

image

Missing entity labels in MNRE dataset

In the mnre_txt, there are only the entity spans in the files without entity labels. From the paper, it seems that the entity labels are utilized. Could you share the dataset with entity labels?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.