open-sora-dataset's People
open-sora-dataset's Issues
更高分辨率的原始视频
非常感谢搜集并开源如此大规模的数据集
不知道是否可以放出处理之前的高分辨率的视频或者视频的source link?
开源的数据集只有64512512的分辨率
期待您的回答
Will you provide the download link?
I saw that you did an excellent job of crawling videos from diverse sources. Will you provide the download link?
关于使用ShareGPT4V-Captioner-7B生成视频字幕的问题
在你们的报告中提到关于使用ShareGPT4V-Captioner-7B
我想知道你们在驱动GPT进行图生文的时候使用了什么提问文本
重复的视频
我发现在pixabay的3万多条视频中,有1万多条的视频是重复的(这些视频的pixabay video id相同)。建议去重。
在mixikit和pexels中也有类似的现象,pexels中约有100多条,mixikit约有30多条
训练数据的json文件格式
请问训练数据的json文件格式是什么?有无可以参考的对象?
question for video spliting
Thanks for the wonderful work! I am confused with the video splitting pipeline.
According to the readme and my comprehension, the videos are splitted by Panda70M to get the no-transition clips, firstly, and then these clips are further splitted to get the final 2s clips.
Is that right?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.