Giter Club home page Giter Club logo

arnet's People

Contributors

chenxinpeng avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

arnet's Issues

AttributeError: 'Namespace' object has no attribute 'drop_prob_lm'

It's a good work for image caption. But I got the problem when I try to run the code.
I can not find the variable 'drop_prob_lm' in the opts.py and '.sh' files. So, I do not know the value of it.
My environment is Pytorch 0.3.1 with python 2.7.
Please help. Thanks a lot!

Adaptive Attention Model with ARNet

Hi, I have read your masterpiece recently. Because I am doing adaptive attention with my model, I want to mix adaptive attention model with ARNet together. After the first stage of training, because the adaptive attention model uses hidden state with attention to form a new hidden, where should I put ARNet. Only on LSTM or I need to count the attention together. If you have any suggestions, please teach me. Thank you.

why missing keys in state_dict ?

when I run the image_caption_ende_rcst_lstm.py ,there are some errors。
KeyError: 'missing keys in state_dict: "{'rcst_lstm.h2h.bias', 'rcst_lstm.h2h.weight', 'h_2_pre_h.weight', 'rcst_lstm.i2h.bias', 'h_2_pre_h.bias', 'rcst_lstm.i2h.weight'}"'

RuntimeError: cuda runtime error (59): device-side assert triggered

When I try to run this project, after a few steps, I meet this error at line:
output = torch.sum(output) / batch_size in the 'image_caption_ende_xe.py'
the output is Variable of 'torch.cuda.FloatTensor' and the type of 'batch_size' is 'float'. But I do not know where is the problem.
My environment: Pytorch-0.3.1 with python2.7.
Please Help! Thanks a lot!

Why does training get stuck by ipdb ?

Hi! Thank you for your sharing your code and a comprehensive tutorial to use it. However, after I excuted the command "./bash_image_caption_soft_att_xe.sh", some weird things happened as follows:
"
idx: 13024 epoch: 0 lr:0.00050000 loss: 43.086 time: 0.239
idx: 13040 epoch: 0 lr:0.00050000 loss: 41.745 time: 0.287

/home/xxx/extend/ARNet/image_captioning/utils_model.py(162)forward()
161
--> 162 output = - input.gather(1, target) * mask
163 output = torch.sum(output) / batch_size

ipdb> list
157
158 target_cpu = target.data.cpu().numpy()
159 if 10516 in target_cpu:
160 ipdb.set_trace()
161
--> 162 output = - input.gather(1, target) * mask
163 output = torch.sum(output) / batch_size
164
165 return output
166
167
"
What's wrong with it ? Thank you for your answering.

Problems about im2p evaluation

Sorry for asking about im2p question here. Because I didn't find way to establish an issue in your im2p project.
I'm confused how to evaluate the paragraph generated by im2p.
Should I regard the whole paragraph (multi-sentences) as a large sentence and regard the ground truth as a sentence, either? Then put them into bleu, cider (and so on) to evaluate?
Or should I change the code of bleu.py and cider.py to evaluate the paragraphs by one sentence (generated) matching one sentence (ground truth)?
Hope you can help me with this! Thank you!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.