This is an implementation of QwenLM/Qwen-VL-Chat as a Cog model. Cog packages machine learning models as standard containers.
First, download the pre-trained weights:
cog run script/download-weights
Then, you can run predictions:
cog predict -i [email protected] -i prompt="What is the name of the movie in the poster?"