Giter Club home page Giter Club logo

x-lxmert's People

Contributors

j-min avatar jiasenlu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar

x-lxmert's Issues

Scripts missing?

Hi!
In the README there are references to scripts for finetuning and testing the different downstream tasks. They should be located in the x-lxmert/scripts/ folder, but don't seem to be there. Is it possible to add them? It would also be nice if the fine-tuned models could be shared to be able to reproduce the results.

Thanks :)

API for generating images from captions

This is a cool tool, and I really enjoy the images I've gotten from the Demo.

I was hoping one of two things were possible, and I'm wondering if I'm just missing something basic. First, is there a web API? It'd be amazing to be able to do something similar to:

result = requests.get('https://vision-explorer.allenai.org/text_to_image_generation_api', data={'caption': 'Diamond rose horse'})
save_image(result.json()['image_str'])

Or, (and it looks like this is to something that I might actually be able to do) if there's no web API:

pip install -r requirements.txt
wget -O image_generator/snap/pretrained/G_60.pth https://ai2-vision-x-lxmert.s3-us-west-2.amazonaws.com/image_generator/G_60.pth
./image_generator/scripts/make_image.py --caption "Diamond rose horse" --outpath my_weird_pic.png

I guess my question is: if all I care about is having a caption and getting an image programmatically, what's the easiest way of doing that?

image_generator/src/trainer.py missing

Hi, thanks for this wonderful work! When I try to run the image generation training, it appears that trainer.py is missing from the src. Could you add this missing file to the repo? Thanks a lot for your kind help!

403 Forbidden when trying to download grid features

Thanks for sharing the code! Could you check the download links for grid features?

$ wget https://ai2-vision-x-lxmert.s3-us-west-2.amazonaws.com/butd_features/NLVR2/maskrcnn_train_grid8.h5
--2021-01-08 14:57:08--  https://ai2-vision-x-lxmert.s3-us-west-2.amazonaws.com/butd_features/NLVR2/maskrcnn_train_grid8.h5
Resolving ai2-vision-x-lxmert.s3-us-west-2.amazonaws.com (ai2-vision-x-lxmert.s3-us-west-2.amazonaws.com)... 52.218.153.17
Connecting to ai2-vision-x-lxmert.s3-us-west-2.amazonaws.com (ai2-vision-x-lxmert.s3-us-west-2.amazonaws.com)|52.218.153.17|:443... connected.
HTTP request sent, awaiting response... 403 Forbidden
2021-01-08 14:57:08 ERROR 403: Forbidden.

Is this the original code for the paper?

Seems like this might not be the original code used for the paper - as I see quite a few bugs here ranging from typos/syntax errors to different file structures than the instructions (specifically for image generation). Would be nice if the authors @j-min actually can verify if this is it and it works for them?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.