Giter Club home page Giter Club logo

joganic's Introduction

JoGANIC

A repo for JoGANIC - Journalistic Guidelines Aware News Image Captioning, accepted on EMNLP 2021. This is a non-official repo. Please refer to the official repo https://github.com/dataminr-ai/joganic

"Journalistic Guidelines Aware News Image Captioning" by Xuewen Yang, Svebor Karaman, Joel Tetreault, and Alex Jaimes.

@inproceedings{yang-etal-2021-journalistic,
    title = "Journalistic Guidelines Aware News Image Captioning",
    author = "Yang, Xuewen  and
      Karaman, Svebor  and
      Tetreault, Joel  and
      Jaimes, Alejandro",
    booktitle = "Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing",
    year = "2021",
    pages = "5162--5175",
}

joganic's People

Contributors

xuewyang avatar anonynous10992 avatar

Watchers

James Cloos avatar  avatar

joganic's Issues

the reproduce metrics

Hi, I've figured out the apex problem ! I can train the joganic model for whole 6.6 million sample in NYTimes now. I follow the config in tell model, and I check my data and label which are right. I got 5.57(BLEU), 20.00(Rouge), 46.33(CIDEr) at last. It seems a lot different from paper which is 6.39(BLEU), 22.38(Rouge), 56.54(CIDEr).
my main config is:

lr 0.0001
t_total 437600
num_epoch 100
eval_limit 5120

Sorry to bother you all the time, if you know how to reproduce the metrics better, please give me some advice which could help. It is really important to my research now, and I'll appreciated a lot if I can contain this model in my ablation experiment.
May I have your email and know more about the model ?

apex Gradient overflow.

I've trained joganic model without apex with small bsz. But it takes long time and only get 4.64 Belu. So I guess I have to train model with apex.
But every time I use apex, it always cause Gradient overflow after 50~55 epoch.

Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.5
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.25
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.125
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.0625
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.03125
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.015625
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.0078125
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.00390625
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.001953125
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.0009765625
Gradient overflow.  Skipping step, loss scaler 0 reducing loss scale to 0.00048828125

And then I got nan loss for caption, I don't change any code from github, and I use the file in scripts to obtain data, I use the same config with Transform-and-tell. I checked the data is clean. Have you met the same problem during training stage? Could you give me some advice to fix it?

about other files to train the model

It is a great model for news image captioning! I'm doing research on this task too. I want to explore your model more, could you please send me the config.yaml? What's more, I don't understand how to obtain the classes_test.npy and nerts_test.json, I'll be appreciated if you can tell me the way to obtain them and how to train the model.

could I get the model weight?

I tried the config the same as transform and tell, but with apex it get nan loss. could I have the origin config.yaml or the trained wight? I wish I could get this good performance model involved in my research. Thanks previously!

Json output of the JoGANIC model

Hello,

Great work on the paper! We are working on our paper to compare to yours. Thus, if you could provide us the json output file for your models, it would be great!

Thanks.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.