The russian-ulmfit from noise-field

Vocabulary size

Dear author, could you please clarify what is the vocabulary size in your pretrained LMs? From the notebook, it seems that you leave only 60k most frequent words. However, from the files of your pretrained model, it seems that vocabulary is about 250k.

FastAI version

What is the FastAI version you used?
Thanks!

Fine-tuning issue

Good day!

I want to fine-tune your language model on my own data, however, I got a problem:

RuntimeError: Error(s) in loading state_dict for SequentialRNN:
	size mismatch for 0.rnns.0.weight_hh_l0_raw: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
	size mismatch for 0.rnns.0.module.weight_ih_l0: copying a param with shape torch.Size([4600, 400]) from checkpoint, the shape in current model is torch.Size([4608, 400]).
	size mismatch for 0.rnns.0.module.weight_hh_l0: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
	size mismatch for 0.rnns.0.module.bias_ih_l0: copying a param with shape torch.Size([4600]) from checkpoint, the shape in current model is torch.Size([4608]).
	size mismatch for 0.rnns.0.module.bias_hh_l0: copying a param with shape torch.Size([4600]) from checkpoint, the shape in current model is torch.Size([4608]).
	size mismatch for 0.rnns.1.weight_hh_l0_raw: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
	size mismatch for 0.rnns.1.module.weight_ih_l0: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
	size mismatch for 0.rnns.1.module.weight_hh_l0: copying a param with shape torch.Size([4600, 1150]) from checkpoint, the shape in current model is torch.Size([4608, 1152]).
	size mismatch for 0.rnns.1.module.bias_ih_l0: copying a param with shape torch.Size([4600]) from checkpoint, the shape in current model is torch.Size([4608]).
	size mismatch for 0.rnns.1.module.bias_hh_l0: copying a param with shape torch.Size([4600]) from checkpoint, the shape in current model is torch.Size([4608]).
	size mismatch for 0.rnns.2.module.weight_ih_l0: copying a param with shape torch.Size([1600, 1150]) from checkpoint, the shape in current model is torch.Size([1600, 1152]).

What version of fast.ai did you use?
My code is:

learn = language_model_learner(
    data_lm, 
    AWD_LSTM, 
    pretrained=False,
    drop_mult=0.3,
    pretrained_fnames=['lm_5_ep_lr2-3_5_stlr', 'itos']
)

Before that, I've changed config:

config = awd_lstm_lm_config.copy()
config['n_hid'] = 1150

My fast.ai version is 1.0.61
Thank you!

noise-field / russian-ulmfit Goto Github PK

russian-ulmfit's People

Contributors

Stargazers

Watchers

Forkers

russian-ulmfit's Issues

Vocabulary size

FastAI version

Fine-tuning issue

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent