yuvalpinter / mimick Goto Github PK

View Code? Open in Web Editor NEW

151.0 9.0 34.0 20.33 MB

Code for Mimicking Word Embeddings using Subword RNNs (EMNLP 2017)

License: GNU General Public License v3.0

Python 100.00%

word-embeddings neural-network lstm convolutional-neural-networks part-of-speech-tagger multilingual

mimick's People

Contributors

Stargazers

Watchers

mimick's Issues

Code runs very slow on GPU

I ran a mimick algorithm on a small data set and it is taking 5 mins on CPU. But when I run the same on GPU it is taking 40 mins to finish one epoch. Is there a way to fix this?

Can we increase the batch size of this?

Transformer Models

Hi, is it possible to integrate it with transformer-based models, such as a variation of BERT?

params in MomentumSGDTrainer

Hi I was trying out your demo when I run into error at line 166, Mimick/mimick/model.py
trainer = dy.MomentumSGDTrainer(model.model, options.learning_rate, 0.9, 0.1)

The error message shows that MomentumSGDTrainer takes 3 parameters, as in
MomentumSGDTrainer(ParameterCollection &m, real learning_rate = 0.01, real mom = 0.9)

wondering is there a version conflict? But I installed the v2.0 dynet, following your README.

So what is this last parameter 0.1? Do I just simply delete it?

Thanks in advance!

Call `initial_state()` on all levels of BiLSTM

Currently, all of the dynet code for LSTMs in the tagging task code (model.py) is not making use of the initial_state() method. This entails:

Word-level LSTM states are not reset between sentences (although the computation graph is renewed so there's no trans-backprop).
Char-level LSTM in char2tag or both mode keeps its state across words along the entire dataset. Within sentences, this means there is also backprop across word boundaries since there's no call to renew_cg(). This effect may be insignificant due to the <PAD> characters, but I don't know for sure.

yuvalpinter / mimick Goto Github PK

mimick's People

Contributors

Stargazers

Watchers

Forkers

mimick's Issues

Code runs very slow on GPU

Transformer Models

params in MomentumSGDTrainer

Call `initial_state()` on all levels of BiLSTM

Add early stopping

Compatibility with Python 3

in_vocab count is set to zero

Remove `START_TAG` and `STOP_TAG` from `model.py` and `make_dataset.py`

Find best Trainer

Model arguments confusion!!

Char2Tag takes wrong representations from backward LSTM

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent