laion-ai / audio-dataset Goto Github PK
View Code? Open in Web Editor NEWAudio Dataset for training CLAP and other models
Audio Dataset for training CLAP and other models
@marianna13 Hi Mariana,
It seems like there are several options to have a Music Dataset. However, could you recommend me one (or many) for training the MuLaN model?
They used 44 million music recordings (almost 370K hours). The following table show some examples of their texts of 3 different types.
Assigned: Knoriy
Where do we get the following files from?
Any help would be appreciated.
Thanks in advance.
Hi, can you share some ways to download Freesound? e.g. How to use Linux scripts to download these audio.
Assigned: Knoriy
Current location AWS S3 bucket, not yet prepared
Most of the urls in the Freesound (no overlap)train+test.csv files are invalid. When I visit the url, I find the result like this:
https://freesound.org/apiv2/sounds/621393/download/?format=api
How can I download the dataset correctly? Thank you!
This repo is great. I always wanted to benchmark webdataset for audio. A couple of questions:
Hi,
Thanks for sharing the wonderful code.
According to the readme of data preprocess (here)
there should be a key of 'tag' (containing labels) in the output JSON file after preprocessing.
This tag extraction/creation is missing in the preprocess_FSD50K.py file.
Am I understanding something incorrectly or there is 'tag' creation missing in the file?
Thanks,
Saksham
@rvencu @rom1504
We need more data in the next step. The data we need in the ranking of priority is:
For audio data with natural text description, we further need:
For audio data with other labels, we need to collect new large datasets while converting our current dataset with tag labels.
The datasets in top priority are those with large size and easy to turn labels into a text description:
(The following datasets all are those with tag labels of the audio)
The datasets we currently have that need converting labels to text are:
We should come up with a unified way of converting tags to text. We could reference how CLIP did that (in converting classification to natural text).
current location on AWS S3 bucket
How can we get the dataset you are using, I can not rerun the preprocessing code for the dataset downloaded from their website
Hi, I want to ask how to download Epidemic Sound dataset. Following the link in the audiostock csv file, e.g. https://audiostock.net/audio/1111457/play , I find that I can download it by Windows browser, but it cannot used on linux system.
Similarly to image datasets, it's better to first save a url + metadata file as parquet
That can be distributed without copyright issue
Then a tool like img2dataset can handle the download
Let's add that in the readme here
Congratulations for executing the herculean effort of putting together this dataset!
Where can one find the access information for the data in s3://s-laion-audio/?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.