Giter Club home page Giter Club logo

gender-discrimination-nlp's Introduction

♂️ Gender discrimination in Natural Language Processing ♀️

This repository contains a project realized as part of the Ethics in Artificial Intelligence course of the Master's degree in Artificial Intelligence, University of Bologna.

Description

The aim of this project is to develop a proof of concept about how to address the gender discrimination in NLP. Two approaches have been investigated:

  • Hard-Debiasing on pre-trained Italian Word Embeddings
  • GN-GloVe which reduce the bias during the training of word embedidngs

In order to have a deeper understanding of the problem, take a look at the presentation of the project.

Repository structure

.
├── data                             # Contains the files of words used for the experiments
├── debiaswe                         # Contains debiasing functions 
│   ├── co_occurrence.py             # Functions to compute the co-occurence matrix for GN-Glove
│   ├── data.py                      # Functions to load data files
│   ├── debias_glove.py              # Actual implementation of GN-Glove debiasing
│   ├── metrics.py                   # Functions to compute metrics for the experiments 
│   └── we.py                        # Auxiliar functions to load and manage word embeddings
├── embeddings                       # Contains the word embeddings file for the hard-debiasing approach
├── scripts                          # Contains the scripts to convert the original twitter word embeddings to a tsv file and fileter 
├── gn-glove_we_visualization.ipynb  # Visualization of the word embeddings generated by GN-Glove
├── hard_debias_italian_we.ipynb     # Visualization of the word embeddings generated by Hard-Debiasing                        
├── presentation.pdf                 # Slides about the project
├── LICENSE
└── README.md

Results

The results of both approaches are presented below:

  • Hard-Debiasing:

  • GN-GloVe:

Versioning

We use Git for versioning.

Group members

Name Surname Email Username
Davide Angelani [email protected] qnozo
Eric Rossetto [email protected] Erhtric
Giuseppe Murro [email protected] gmurro
Salvatore Pisciotta [email protected] SalvoPisciotta
Xiaowei Wen [email protected] WenXiaowei

License

This project is licensed under the MIT License - see the LICENSE file for details

gender-discrimination-nlp's People

Contributors

erhtric avatar gmurro avatar salvopisciotta avatar wenxiaowei avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.