Giter Club home page Giter Club logo

annotator-embeddings's Introduction

Annotator-Embeddings

Cleaned repo for our paper You Are What You Annotate: Towards Better Models through Annotator Representations at Findings of EMNLP 2023.


Citation

@misc{deng2023annotate,
      title={You Are What You Annotate: Towards Better Models through Annotator Representations}, 
      author={Naihao Deng and Xinliang Frederick Zhang and Siyang Liu and Winston Wu and Lu Wang and Rada Mihalcea},
      year={2023},
      eprint={2305.14663},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

What our paper is about

  • Rather than aggregating labels, we propose a setting of training models to directly learn from data that contains inherent disagreements.

  • We propose TID-8, The Iherent Disagreement - 8 dataset, a benchmark that consists of eight existing language understanding datasets that have inherent annotator disagreements.

  • We propose weighted annotator and annotation embeddings, which are model-agnostic and improve model performances on six out of the eight datasets in TID-8.

  • We conduct a detailed analysis on the performance variations of our methods and how our methods can be potentially grounded to realworld demographic features.


Structure of this repo

├── ablation_studies: scripts for ablations
│   ├── annotation_tendencies
│   ├── annotator_accs
│   ├── disagreement_examples
│   ├── heatmaps
│   ├── performance_ablation
│   ├── person_annotation_bars
│   ├── spider_plots
│   └── tsne_plots
├── experiment-results: raw experimental results and the processing script
└── src
    ├── example-data: data for each dataset
    └── src: modeling scripts
        ├── baseline_models
        ├── dataset
        ├── metrics
        ├── tokenization
        ├── training_paradigm
        ├── transformer_models
        └── utils

You may create the python environment by using the environment.yml file.


Other links to resources for our paper

annotator-embeddings's People

Contributors

dnaihao avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.