dccuchile / wefe Goto Github PK

WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Word Embeddings models. Please feel welcome to open an issue in case you have any questions or a pull request if you want to contribute to the project!

Home Page: https://wefe.readthedocs.io/

License: MIT License

Python 100.00%

word-embeddings word-embedding-evaluation word-embedding-fairness fairness-ml fairness-ai nlp nlp-library library python3 bias-detection bias-reduction

wefe's Introduction

WEFE: The Word Embedding Fairness Evaluation Framework

Word Embedding Fairness Evaluation (WEFE) is an open source library for measuring an mitigating bias in word embedding models. It generalizes many existing fairness metrics into a unified framework and provides a standard interface for:

Encapsulating existing fairness metrics from previous work and designing new ones.
Encapsulating the test words used by fairness metrics into standard objects called queries.
Computing a fairness metric on a given pre-trained word embedding model using user-given queries.

WEFE also standardizes the process of mitigating bias through an interface similar to the scikit-learn fit-transform. This standardization separates the mitigation process into two stages:

The logic of calculating the transformation to be performed on the model (fit).
The execution of the mitigation transformation on the model (transform).

The official documentation can be found at this link.

Installation

There are two different ways to install WEFE:

To install the package with pip

pip install wefe

With conda:

To install the package with conda:

conda install -c pbadilla wefe

Requirements

These package will be installed along with the package, in case these have not already been installed:

numpy
scipy
scikit-learn
scipy
pandas
gensim
plotly
requests
tqdm
semantic_version

Contributing

You can download the code executing

git clone https://github.com/dccuchile/wefe

To contribute, visit the Contributing section in the documentation.

Development Requirements

To install the necessary dependencies for the development, testing and compilation of WEFE documentation, run

pip install -r requirements-dev.txt

Testing

All unit tests are in the wefe/tests folder. It uses pytest as a framework to run them.

To run the test, execute:

pytest tests

To check the coverage, run:

pytest tests --cov-report xml:cov.xml --cov wefe

And then:

coverage report -m

Build the documentation

The documentation is created using sphinx. It can be found in the docs folder at the root of the project. To compile the documentation, run:

cd docs
make html

Then, you can visit the documentation at docs/_build/html/index.html

Changelog

Version 0.4.1

Fixed a bug where the last pair of target words in RIPA was not included.
Added a benchmark that compares WEFE with another measurement and bias mitigation libraries in the documentation.
Added a library changes since original paper release page in the documentation.

Version 0.4.0

3 new bias mitigation methods (debias) implemented: Double Hard Debias, Half Sibling Regression and Repulsion Attraction Neutralization.
The library documentation of the library has been restructured. Now, the documentation is divided into user guide and theoretical framework The user guide does not contain theoretical information. Instead, theoretical documentation can be found in the conceptual guides.
Improved API documentation and examples. Added multilingual examples contributed by the community.
The user guides are fully executable because they are now on notebooks.
There was also an important improvement in the API documentation and in metrics and debias examples.
Improved library testing mechanisms for metrics and debias methods.
Fixed wrong repr of query. Now the sets are in the correct order.
Implemented repr for WordEmbeddingModel.
Testing CI moved from CircleCI to GithubActions.
License changed to MIT.

Version 0.3.2

Fixed RNSB bug where the classification labels were interchanged and could produce erroneous results when the attributes are of different sizes.
Fixed RNSB replication notebook
Update of WEFE case study scores.
Improved documentation examples for WEAT, RNSB, RIPA.
Holdout parameter added to RNSB, which allows to indicate whether or not a holdout is performed when training the classifier.
Improved printing of the RNSB evaluation.

Version 0.3.1

Update WEFE original case study
Hotfix: Several bug fixes for execute WEFE original Case Study.
fetch_eds top_n_race_occupations argument set to 10.
Preprocessing: get_embeddings_from_set now returns a list with the lost preprocessed words instead of the original ones.

Version 0.3.0

Implemented Bolukbasi et al. 2016 Hard Debias.
Implemented Thomas Manzini et al. 2019 Multiclass Hard Debias.
Implemented a fetch function to retrieve gn-glove female-male word sets.
Moved the transformation logic of words, sets and queries to embeddings to its own module: preprocessing
Enhanced the preprocessor_args and secondary_preprocessor_args metric preprocessing parameters to an list of preprocessors preprocessors together with the parameter strategy indicating whether to consider all the transformed words ('all') or only the first one encountered ('first').
Renamed WordEmbeddingModel attributes `model` and `model_name` to `wv` and `name` respectively.
Renamed every run_query `word_embedding` argument to `model` in every metric.

Version 0.2.2

Added RIPA metrics (thanks @stolenpyjak for your contribution!).
Fixed Literal typing bug to make WEFE compatible with python 3.7.

Version 0.2.1

Compatibility fixes.

Version 0.2.0

Renamed optional `run_query` parameter `warn_filtered_words` to warn_not_found_words.
Added `word_preprocessor_args` parameter to `run_query` that allow specifying transformations prior to searching for words in word embeddings.
Added `secondary_preprocessor_args` parameter to `run_query` which allows specifying a second pre-processor transformation to words before searching them in word embeddings. It is not necessary to specify the first preprocessor to use this one.
Implemented `__getitem__` function in `WordEmbeddingModel`. This method allows obtaining an embedding from a word from the model stored in the instance using indexers.
Removed underscore from class and instance variable names.
Improved type and verification exception messages when creating objects and executing methods.
Fix an error that appeared when calculating rankings with two columns of aggregations with the same name.
Ranking correlations are now calculated using pandas `corr` method.
Changed metric template, name and short_names to class variables.
Implemented `random_state` in RNSB to allow replication of the experiments.
run_query now returns as a result the default metric requested in the parameters and all calculated values that may be useful in the other variables of the dictionary.
Fixed problem with api documentation: now it shows methods of the classes.
Implemented p-value for WEAT

Citation

Please cite the following paper if using this package in an academic publication:

P. Badilla, F. Bravo-Marquez, and J. Pérez WEFE: The Word Embeddings Fairness Evaluation Framework In Proceedings of the 29th International Joint Conference on Artificial Intelligence and the 17th Pacific Rim International Conference on Artificial Intelligence (IJCAI-PRICAI 2020), Yokohama, Japan.

Bibtex:

@InProceedings{wefe2020,
    title     = {WEFE: The Word Embeddings Fairness Evaluation Framework},
    author    = {Badilla, Pablo and Bravo-Marquez, Felipe and Pérez, Jorge},
    booktitle = {Proceedings of the Twenty-Ninth International Joint Conference on
               Artificial Intelligence, {IJCAI-20}},
    publisher = {International Joint Conferences on Artificial Intelligence Organization},
    pages     = {430--436},
    year      = {2020},
    month     = {7},
    doi       = {10.24963/ijcai.2020/60},
    url       = {https://doi.org/10.24963/ijcai.2020/60},
    }

Team

Contributors

We thank all our contributors who have allowed WEFE to grow, especially stolenpyjak and mspl13 for implementing new metrics.

We also thank alan-cueva for initiating the development of metrics for contextualized embedding models and harshvr15 for the examples of multi-language bias measurement.

Thank you very much 😊!

wefe's People

Contributors

Stargazers

Watchers

Forkers

e-tornike hugomansilla alan-cueva kr-ramesh edric-shen harshvr15 humbertordrgs cristian-tamblay giturra mariagrandury souvik2376 benjubeh moonisali kato8966

wefe's Issues

Problem with the library typing

I tried to test the wefe library through colabolatory and it does not work due to compatibility issues with the library typing (product colab works with python 3.7). I got this problem:

ImportError: cannot import name 'Literal' from 'typing' (/usr/lib/python3.7/typing.py)

Perhaps a solution to this problem is to add the following lines in the code:

try:
    from typing import Literal
except ImportError:
    from typing_extensions import Literal

Is this tool no longer being maintained?

I see the last update happened more than 9 months ago...

class wefe.WordEmbeddingModel with gensim.models.keyedvectors.FastTextKeyedVectors

I'm using WEFE to calculate WEAT with word embeddings and I've observed that WordEmbbeddingModel only works with gensim.models.keyedvectors.Word2VecKeyedVectors. Is that true?

I have some embeddings calculated with Facebook original Fasttext module. Can I use WEFE with them?

Thanks,
Elisabetta.

RNSB Deprecation Warning

/home/raffaele/.local/lib/python3.9/site-packages/wefe/metrics/RNSB.py:150: VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences (which is a list-or-tuple of lists-or-tuples-or ndarrays with different lengths or shapes) is deprecated. If you meant to do this, you must specify 'dtype=object' when creating the ndarray.
  probabilities = np.array([
/home/raffaele/.local/lib/python3.9/site-packages/wefe/metrics/RNSB.py:156: VisibleDeprecationWarning: Creating an ndarray from ragged nested sequences (which is a list-or-tuple of lists-or-tuples-or ndarrays with different lengths or shapes) is deprecated. If you meant to do this, you must specify 'dtype=object' when creating the ndarray.
  negative_probabilities = np.array(

WEFE documentation is inconsistent with the literature

The WEFE documentation says that a target set:

A target word set (denoted by ) corresponds to a set of words intended to denote a particular social group
e.g., gender, social class, age, and ethnicity

while and attribute set:

An attribute word set (denoted by ) is a set of words representing some attitude, characteristic, trait, occupational field, etc.

But in this paper the two concepts are reversed, with target meaning mathematics and arts (occupational fields) and attribute meaning male and female (a particular social group).

Is it a problem of the paper alone?

Are those terms used consistently in the literature?

What happens to the WEAT if I exchange attribute and target sets?

(Is the WEFE documentation right?)

Error on import in Google Colab (v0.3.2) with PL

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
<ipython-input-1-ad60f770c107> in <module>()
----> 1 from wefe import Query, WEAT
      2 import gensim.downloader as api
      3 
      4 word2vec_model = model

13 frames
/usr/local/lib/python3.7/dist-packages/distributed/config.py in <module>()
     18 
     19 with open(fn) as f:
---> 20     defaults = yaml.load(f)
     21 
     22 dask.config.update_defaults(defaults)

TypeError: load() missing 1 required positional argument: 'Loader'

word_embedding not found under wefe

ImportError: cannot import name 'word_embedding' from 'wefe' (/Users/xx/PycharmProjects/pythonProject/venv/lib/python3.8/site-packages/wefe/init.py)

please help :)

WEAT effect size: Different values

I get slightly different values when using WEFE's weat.run_query() than when doing the calculations manually as below:

def getCosine(a, b):
    return np.dot(a, b) / (np.linalg.norm(a) * np.linalg.norm(b))

list_avg_diffs= []

avg_t1_a1= 0
avg_t1_a2= 0
avg_t1_diff= 0

for t in t1: #t1= ['brother', 'father', 'uncle', 'grandfather', 'son']
    x= w2v[t]
    for a in a1: #a1= ['science', 'technology', 'physics', 'chemistry', 'Einstein', 'NASA', 'experiment', 'astronomy']
        y= w2v[a]
        avg_t1_a1= avg_t1_a1 + getCosine(x,y)
    avg_t1_a1= avg_t1_a1/len(a1)
    
    for a in a2: #a2= ['poetry', 'art', 'Shakespeare', 'dance', 'literature', 'novel', 'symphony', 'drama']
        y= w2v[a]
        avg_t1_a2= avg_t1_a2 + getCosine(x,y)
    avg_t1_a2= avg_t1_a2/len(a2)

    avg_t1_diff= avg_t1_diff + (avg_t1_a1 - avg_t1_a2)
    list_avg_diffs.append(avg_t1_a1 - avg_t1_a2)

avg_t1_diff= avg_t1_diff/len(t1)

avg_t2_a1= 0
avg_t2_a2= 0
avg_t2_diff= 0

for t in t2: #t2= ['sister', 'mother', 'aunt', 'grandmother', 'daughter']
    x= w2v[t]
    for a in a1: #a1= ['science', 'technology', 'physics', 'chemistry', 'Einstein', 'NASA', 'experiment', 'astronomy']
        y= w2v[a]
        avg_t2_a1= avg_t2_a1 + getCosine(x,y)
    avg_t2_a1= avg_t2_a1/len(a1)
    
    for a in a2: #a2= ['poetry', 'art', 'Shakespeare', 'dance', 'literature', 'novel', 'symphony', 'drama']
        y= w2v[a]
        avg_t2_a2= avg_t2_a2 + getCosine(x,y)
    avg_t2_a2= avg_t2_a2/len(a2)

    avg_t2_diff= avg_t2_diff + (avg_t2_a1 - avg_t2_a2)
    list_avg_diffs.append(avg_t2_a1 - avg_t2_a2)

avg_t2_diff= avg_t2_diff/len(t2)

diff_of_diff= avg_t1_diff - avg_t2_diff
sd= np.std(list_avg_diffs) #use ddof=1 in case you need same sd as in R
weat_effect_size= diff_of_diff/sd

The weat_effect_size above is: 1.6132258039368363. However, that obtained using weat.run_query() is: 1.674.

Not sure why this difference of 0.06 in effect size. Request help.

Support gensim 4

Following #7, will this library support the upcoming gensim 4?

Information about the pre-loaded wordsets (Dataloaders)

Hi Pablo,
Greetings!

1). I was trying to get results on multi-lingual fastText embeddings through the query using WEAT_wordsets() and RND_wordsets(). I am not sure whether I can use the same set of query for multi-lingual embeddings. However, I am getting some result for this.

WEAT_wordsets = load_weat()
RND_wordsets = fetch_eds()

glove_embedding = WordEmbeddings('de')
glove_keyed_vectors = glove_embedding.precomputed_word_embeddings
glove_100 = WordEmbeddingModel(glove_keyed_vectors, 'de')

result = weat.run_query(gender_3, glove_100, lost_vocabulary_threshold=0.3, warn_not_found_words=True, calculate_p_value=True, p_value_iterations=10000)
print(result)

RESULT:
{'query_name': 'Male terms and Female terms wrt Science and Arts', 'result': 0.50751936, 'weat': 0.50751936, 'effect_size': 0.9096829, 'p_value': 0.0034996500349965005}

But I am not sure whether it is correct? Can you please guide whether these wordset queries can be used for multi-lingual embeddings and if not how can I proceed?

2). Can you please share any links where I can find information about the pre loaded wordsets ( load_weat() and fetch_eds() )?

For now, I just know:
fetch_eds(): Fetches the word sets used in the experiments of the work Word Embeddings *Quantify 100 Years Of Gender And Ethnic Stereotypes.
load_weat(): Loads the word sets used in the paper Semantics Derived Automatically From Language Corpora Contain Human-Like Biases.

I couldn't find any links on what data are they extracted from or what sets does it have and further information.

Furthermore,

For example: In this query:
gender_1 = Query(
[RND_wordsets["male_terms"], RND_wordsets["female_terms"]],
[WEAT_wordsets["career"], WEAT_wordsets["family"]],
["Male terms", "Female terms"],
["Career", "Family"],

(Correct me if I'm wrong) You are taking male_terms and female _terms as target set and career and family as attribute sets from these dataloaders. What are the other sets (like male_terms, adjectives_intelligence, female_terms etc.) which are there in this dataloaders? You have described a few in your queries, are there only those or are there few others which can be considered?

Hope to hear from you soon.

Thanks,
Harsh

RNSB Error

I'm getting the following error.

What may be the cause?

---------------------------------------------------------------------------
Exception                                 Traceback (most recent call last)
<ipython-input-22-1b733a31758e> in <module>
     37 
     38     # RNSB
---> 39     result_rnsb = rnsb.run_query(query, wefemodel)
     40     print(result_rnsb)

~/.local/lib/python3.9/site-packages/wefe/metrics/RNSB.py in run_query(self, query, word_embedding, estimator, estimator_params, num_iterations, random_state, print_model_evaluation, lost_vocabulary_threshold, preprocessor_args, secondary_preprocessor_args, warn_not_found_words, *args, **kwargs)
    335 
    336             # train the logit with the train data.
--> 337             trained_classifier, score = self._train_classifier(
    338                 attribute_embeddings_dict=attribute_embeddings,
    339                 random_state=random_state,

~/.local/lib/python3.9/site-packages/wefe/metrics/RNSB.py in _train_classifier(self, attribute_embeddings_dict, estimator, estimator_params, random_state, print_model_evaluation)
     96         # Check the number of train and test examples.
     97         if num_train_positive_examples == 1:
---> 98             raise Exception(
     99                 'After dividing the datset using stratified train_test_split, '
    100                 'the attribute 0 has 0 training examples.')

Exception: After dividing the datset using stratified train_test_split, the attribute 0 has 0 training examples.

Missing WEAT words

There are two versions of unpleasant_5 words in the original WEAT paper. One of them is used in WEAT 1, and the other is used in WEAT 3. (For example, the former includes "prison," while the latter does not.) However, wefe/datasets/data/WEAT.json has only the former.

I fixed the problem by renaming unpleasant_5 to unpleasant_5a and adding unpleasant_5b (This is the fix branch. https://github.com/kato8966/wefe/tree/fix-weat-words), but this breaks backward compatibility.

Should I proceed and make a PR, or should I try a different approach?

Questions RNSB metric

Hi,
I would ask you some questions about WEFE in order to calculate bias with RNSB.

I am using two target sets and two attribute sets and running queries in this form: Query([target1_array, target2_array],[positive_attribute_array, negative_attribute_array], [ntarget1, ntarget2], [nattribute1, nattribute2]). For attributes I have 26 or 50 items in total. I am working with 3 different word embedding models.
My questions are:

Why am I obtaining the same results with different runnings?
Why am I obtaining very different results with respect to the model used with the same words? For example, I have in most cases RNSB = 0.20 from one model and RNSB = 0.60 / 0.70 for the others. It seems strange.

I read the source code but I don't understand. Maybe I have a problem with my data.

Thank you very much.

Best regards,
Elisabetta Triolo.

Availability for embeddings created with transformer models

Hi, I'm using transformer models to create my embeddings but would like to calculate the bias. Is there a way to just feed in the embeddings instead of the embedding models or to introduce a transformer model as embedding model?
Would there be a difference in the calculation of the WEAT score?
Many thanks

[Question] One-sided p-value for WEAT?

The WEAT is a very useful metric. I was wondering if there was a way to use WEFE to to obtain the one-sided p-value described on page 9 in the following paper on WEAT?

Caliskan, A., Bryson, J. J., & Narayanan, A. (2017). Semantics derived automatically from language corpora contain human-like biases. Science, 356(6334), 183-186.

Thanks,
sbs

WEAT returns nothing

I use this code to compute a WEAT:

wefemodel = WordEmbeddingModel(wv, model_name)
query = Query(target_sets, attribute_sets, target_sets_names, attribute_sets_names)
result_weat = weat.run_query(query, wefemodel, 
                                     calculate_p_value = True,
                                     return_effect_size = True)

But sometimes the returning result_weat does not include a p_value key:

KeyError: 'p_value'

I think it depends on the model.

For some of my models the returning dictionary do not include this key.

Is it possible it depends on the model?

Issue with RIPA metric

Hi,

I tried to compute RIPA with a single base pair but I get only nan :

emb_model = WordEmbeddingModel(model)

query = Query(
    target_sets=[["he"], ["she"]],
    attribute_sets=[["doctor", "nurse"]],
    target_sets_names=["male", "female"],
    attribute_sets_names=["profession"],
)

metric = metrics.RIPA()
metric.run_query(gender_query, emb_model, lost_vocabulary_threshold=1.0)

#{'query_name': 'male and female wrt profession',
# 'result': nan,
# 'ripa': nan,
# 'word_values': {'doctor': {'mean': nan, 'std': nan},
#  'nurse': {'mean': nan, 'std': nan}}}

I think this comes from wefe/metrics/RIPA.py line 115 :

# calculating the ripa score for each attribute word with each target pair
for word in range(len(attribute_embeddings_0)):
    ripa_scores[attributes[word]] = []
    for index in range(target_length - 1): # <- Should it be range(target_length) instead ?
        bvec = self._b_vec(
            target_embeddings_0[index], target_embeddings_1[index]
        )
        score = self._ripa_calc(attribute_embeddings_0[word], bvec)
        ripa_scores[attributes[word]].append(score)

Is there a reason why the last element is not used ?

How to reproduce table 1 as in the paper?

Hello WEFE developers,

I am trying to reproduce your paper results. It looks to me the best reference is your WEFE-ranking notebook example. Obviously, the notebook can't run directly, with runtime errors as in this issue. I can fix the syntax problem, but detailed experiment settings are unknown.

Other errors can show up like:

Exception: After dividing the datset using stratified train_test_split, the attribute 0 has 0 training examples.
...
Exception: Error during executing the query: Christianity terms and Islam terms wrt Conservative and Terrorism on the model: word2vec-google-news-300

Can you provide a complete notebook simply for reproducing your paper table 1?

Thanks a lot!

Metrics not discussed in the paper

Hello,

The WEFE paper presented at IJCAI discuss WEAT, RND, and RNSB, but it does not discuss MAC and ECT.

I wondered whether there were an updated version that discussed those?

Without opening a new issue, the reference for MAC should actually be

Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings

Instead in the documentation is written as:

Black is to criminals caucasian is to police: Detecting and removing multiclass bias in word embeddings

(criminals is plural and "as" is missing: it0s a minor thing but the reference manager doesn't find it written in this way :))

ImportError: cannot import name 'Literal' from 'typing' in Python 3.7

Hello!

I have an error running WEFE in Python version < 3.8 due to the fact metrics/WEAT.py calls Literal from typing:

"/home/appuser/venv/lib/python3.7/site-packages/wefe/metrics/WEAT.py", line 2, in
from typing import Any, Dict, List, Literal, Set, Tuple, Union
ImportError: cannot import name 'Literal' from 'typing'

This is a known issue for typing, see here.

I would suggest a quick adjustment to WEAT.py, line 2:

try:
from typing import Literal
except ImportError:
from typing_extensions import Literal

Please let me know if feasible.

Thanks,
Tia

WEAT p-value is nan

Hello,

the WEAT p-value is always nan.

This happened for two different models I have tried.

Is it normal?

ECT score

Hi,

I'm using the WEFE package to measure bias in different word embeddings. However, When I use ECT the results are not clear to me. AS mentioned in the original paper "Attenuating Bias in Word vectors." they explain that the higher the ECT score the better its s meaning that the higher the score the less biased a word embedding is. I'm wondering if you are taking that in consideration in the Ranking process?

Thanks,
Fatma

Import error: cannot import name 'BaseKeyedVectors' from 'gensim.models.keyedvectors'

Thanks for making this very useful python package.

I tried to use it. But get an error with the following line:
from wefe.query import Query


---------------------------------------------------------------------------
ImportError                               Traceback (most recent call last)
<ipython-input-3-02319299b065> in <module>
----> 1 from wefe.query import Query
      2 from wefe.word_embedding_model import WordEmbeddingModel
      3 from wefe.metrics.WEAT import WEAT
      4 #import gensim.downloader as api

/anaconda/envs/py37_default/lib/python3.7/site-packages/wefe/__init__.py in <module>
----> 1 from .word_embedding_model import WordEmbeddingModel
      2 from .query import Query
      3 from .metrics.base_metric import BaseMetric
      4 from .metrics.WEAT import WEAT
      5 from .metrics.RND import RND

/anaconda/envs/py37_default/lib/python3.7/site-packages/wefe/word_embedding_model.py in <module>
----> 1 from gensim.models.keyedvectors import BaseKeyedVectors
      2 
      3 
      4 class WordEmbeddingModel:
      5     """A container for Word Embedding pre-trained models.

ImportError: cannot import name 'BaseKeyedVectors' from 'gensim.models.keyedvectors' (/anaconda/envs/py37_default/lib/python3.7/site-packages/gensim/models/keyedvectors.py)

dccuchile / wefe Goto Github PK

wefe's Introduction

WEFE: The Word Embedding Fairness Evaluation Framework

Installation

Requirements

Contributing

Development Requirements

Testing

Build the documentation

Changelog

Version 0.4.1

Version 0.4.0

Version 0.3.2

Version 0.3.1

Version 0.3.0

Version 0.2.2

Version 0.2.1

Version 0.2.0

Citation

Team

Contributors

wefe's People

Contributors

Stargazers

Watchers

Forkers

wefe's Issues

Recommend Projects

Recommend Topics

Recommend Org