Giter Club home page Giter Club logo

race_ar_baselines's Introduction

RACE Reading Comprehension Task

Code for the paper: RACE: Large-scale ReAding Comprehension Dataset From Examination. Guokun Lai*, Qizhe Xie*, Hanxiao Liu, Yiming Yang and Eduard Hovy. EMNLP 2017

Leaderboard of RACE

Dependencies

  • Python 2.7
  • Theano >= 0.7
  • Lasagne 0.2.dev1

Datasets

  • RACE: Please submit a data request here. The data will be automatically sent to you. Create a "data" directory alongside "src" directory and download the data.

  • Word embeddings:

Usage

Preprocessing

* python preprocess.py

Stanford AR

* test pre-trained model: bash test_SAR.sh
* train: bash train_SAR.sh (The pre-trained model will be replaced)

GA

* test pre-trained model: bash test_GA.sh
* train: bash train_GA.sh (The pre-trained model will be replaced)

Reference

@inproceedings{lai2017large,
  title={RACE: Large-scale ReAding Comprehension Dataset From Examinations},
  author={Lai, Guokun and Xie, Qizhe and Liu, Hanxiao and Yang, Yiming and Hovy, Eduard},
  booktitle={EMNLP},
  year={2017}
}

Acknowledgement

Contact

  • Please contact Qizhe Xie (qzxie AT cs DOT cmu DOT edu) if you find bugs or missing info

License

MIT

race_ar_baselines's People

Contributors

qizhex avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

race_ar_baselines's Issues

test pretrained model failed

 [root@localhost src]# bash test_GA.sh 
!!!test
WARNING (theano.configdefaults): g++ not detected ! Theano will be unable to execute optimized C-implementations (for both CPU and GPU) and will default to Python implementations. Performance will be severely degraded. To remove this warning, set Theano flags cxx to an empty string.
WARNING (theano.sandbox.cuda): CUDA is installed, but device gpu1 is not available  (error: cuda unavailable)
/root/.pyenv/versions/yan35/lib/python3.5/site-packages/theano/tensor/signal/downsample.py:6: UserWarning: downsample module has been moved to the theano.tensor.signal.pool module.
  "downsample module has been moved to the theano.tensor.signal.pool module.")
06-27 16:35 main.py -train_file ../data/data/train -dev_file ../data/data/test -embedding_size 100 -pre_trained ../obj/model_GA.pkl.gz -test_only True -model GA -num_GA_layers 1 -hidden_size 128
06-27 16:35 --------------------------------------------------
06-27 16:35 Load data files..
06-27 16:35 ********** Train
06-27 16:35 #Examples: 87866
06-27 16:35 ********** Dev
06-27 16:35 #Examples: 4934
06-27 16:35 --------------------------------------------------
06-27 16:35 Build dictionary..
06-27 16:35 --------------------------------------------------
06-27 16:35 Embeddings: 50002 x 100
06-27 16:35 Compile functions..
Traceback (most recent call last):
  File "main.py", line 309, in <module>
    main(args)
  File "main.py", line 216, in main
    train_fn, test_fn, params, all_params = build_fn(args, embeddings)
  File "main.py", line 82, in build_fn
    rnn_layer=args.rnn_layer)
  File "/root/yanqiang/12-RACE_AR_baselines/src/nn_layers.py", line 87, in stack_rnn
    network = _rnn(True, name)
  File "/root/yanqiang/12-RACE_AR_baselines/src/nn_layers.py", line 84, in _rnn
    name=name + '_layer' + str(layer + 1))
  File "/root/.pyenv/versions/yan35/lib/python3.5/site-packages/lasagne/layers/recurrent.py", line 1180, in __init__
    super(GRULayer, self).__init__(incomings, **kwargs)
TypeError: __init__() got an unexpected keyword argument 'only_return_final'

I tried to run test_GA.sh,but it failed,can you tell me what caused this?

Performance of the AR model

Hi,

I tried to use the Stanford Attentive Reader on the data, testing the pretrained model viabash test_SAR.sh. I downloaded both the glove embeddings, which I put under data/embedding, and the RACE data, under data/data. However, the resulting accuracy on both development and test data is only around .36-.39, and not > .4, as stated in the paper. There is no error message as far as I can tell, the output of the script is appended. Any idea about what could have gone wrong?

Looking forward to an answer!

log.txt

About the AR model

Hi
I re-build a AR model with tensorflow, but the loss not decrease. i can't find the difference between your model and mine. could you public the training log?
Thank you very much!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.