Giter Club home page Giter Club logo

mtaal's Introduction

MTAAL

[Title] MTAAL: Multi-Task Adversarial Active Learning for Medical Named Entity Recognition and Normalization

[Authors] Baohang Zhou, Xiangrui Cai, Ying Zhang, Wenya Guo, Xiaojie Yuan

AAAI 2021 paper [video]

Preparation

  1. Clone the repo to your local.
  2. Download Python version: 3.6.5.
  3. Download the word embeddings from the following websites. Put them into the "pretrain" folder. (Word2Vec and Glove)
  4. Open the shell or cmd in this repo folder. Run this command to install necessary packages.
pip install -r requirements.txt

Experiments

  1. Before running models, you should run this command to handle the dataset. You can choose the parameters to change datasets and word embeddings.
python preprocess.py --dataset=[ncbi, cdr] --wordembedding=[word2vec, glove]
  1. You can input the following command to run the different active learning models. There are different choices for some parameters shown in []. The meaning of these parameters are shown in the following tables.
Parameters Value Description
epoch int Query times for active learning
label float The split proportion for initial labeled set
unlabel float The split proportion for initial unlabeled set
test float The split proportion for test set
query_num int The number of query samples
ad_task str Choose whether to use Task Adversarial Learning
task str Choose the task to run model. "all" is multi-task scenario
al str Choose the active learning method.
python main.py params \
--epoch=70 \
--label=0.2 \
--unlabel=0.7 \
--test=0.1 \
--batch_size=32 \
--query_num=64 \
--ad_task=[True, False] \
--dataset=[ncbi, cdr] \
--rnn_units=64 \
--task=[all, ner, nen] \
--gpu=[True, False] \
--al=[diversity, random, lc, entropy, mnlp]
  1. After running the model, the test result is saved in the "results" folder.

PS: We use the evaluation metrics as described in this paper (Zhao et al.).

mtaal's People

Contributors

zhoubaohang avatar

Stargazers

 avatar pengpeng avatar Steven Zhao avatar  avatar Junchi Zhang avatar

Watchers

James Cloos avatar  avatar Jc Guo avatar

mtaal's Issues

Glove链接失效

可否更新下链接
或者明确下使用的是哪一个版本
谢谢

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.