Comments (2)
Hi @Kratos-Wen,
Thanks for your interest in our project and for bringing up this question.
In our experiments, we found a linear layer architecture met our performance objectives quite effectively. Although the potential of more complex designs was noted, we didn't dive into that depth of experimentation and hence, we lack specific numbers related to these designs.
An important point to consider is that these complex models may not necessarily outperform our chosen architecture. Major reason being their inability to initialize from the pretrained LLaVA weights.
Hope this clarifies things a bit. Thank You.
from video-chatgpt.
Thanks, that pretty much answers my question! Congratulations again on your results!
from video-chatgpt.
Related Issues (20)
- Llava Weights not available for running demo offline HOT 2
- Can I deal with Chinese video? HOT 1
- Clarification Needed on the Role of **QA_GT_caption_based_noisy** in the Dataset HOT 1
- Why is OpenAI used for Activitynet-QA Evaluation? HOT 3
- About zero-shot test on TGIF-QA HOT 1
- Fail to execute Video-ChatGPT offline demo (when click "Upload Video") HOT 7
- Code for Dataset Generation HOT 1
- Train errors: some videos in train_video_ids.txt do not appear in activity_clip-14L_spatio_temporal_356 HOT 2
- What is the formal name of the benchmark? HOT 3
- GPT Judge does not work for the Prompt
- 13B, 70B models HOT 1
- RuntimeError: Internal: src/sentencepiece_processor.cc(1101) [model_proto->ParseFromArray(serialized.data(), serialized.size())] HOT 5
- I encountered an error while using "Run the Demo". HOT 3
- Insufficient VRAM on a single card, how to utilize multiple cards for inference HOT 2
- Question about consistency evaluation HOT 3
- The responses from the offline_demo are garbled HOT 2
- Why TGIF videos missed and can't be processed HOT 3
- How to download the ready LLaVA-Lightening-7B weights HOT 5
- Download videos HOT 1
- How many frames are sampled per video for the training and testing process? HOT 5
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from video-chatgpt.