audioset / ontology Goto Github PK

View Code? Open in Web Editor NEW

635.0 635.0 150.0 87 KB

The Audio Set Ontology aims to provide a comprehensive set of categories to describe sound events.

ontology's People

Contributors

Stargazers

Watchers

Forkers

ankitshah009 vangao happyday630 ieyer chagge benjamesbabala parety ilibx greathillzhu alex1226 760chong ml-lab olivererwang zhengneng dim25 stephanerenouard alexxnica kryndex chenxinglili hades210 thearchiver gaoyiyeah mengqhui coco1905 yummywoo blank-wang picopoco vidhijain casinoyl jordipons ferasos nazifberat holm-xie gnanaoly drasted maggie0830 abdylan123 kevd1337 anuttarra dougouk priyankamakwana dataspock shubhampachori12110095 jazzzchan ygbwwqsi adrienriou ryanjfdeng breadbasket95 icefire-luo hmdo abhishekpratapa shayben mikeperrotta eraoul wangfeng-skymind genadee bertros stevenlol amy10260903 ginking lyulianghui kzhai smallfade shilpakk95 yamlong javelir italoadler qoboty luvsheryl leora-betesh faudil fanezhang jimyo1002 backwardn xjia520 fanofjava hadryan ghyuanmeng pabul motus anirudh58 lara-hdr habibzadeh mengze-96 lauriechen dimple-bansal abdullaeff-agency veenavijai rbroc srk92 ugotsuyokunaru atul1234anand xiongmaoxia alex007sirois nandan-sridhar lizyazpin ardasahiner hdavidethan ismita98 rendongfa

ontology's Issues

citation_uris that are not correct

Bassoon's (/m/01c3q) citation_uri: http://en.wikipedia.org/wiki/Oboe is not correct. It should be: https://en.wikipedia.org/wiki/Bassoon.

I propose this amendment on behalf of the Freesound Datasets team.

Buzz parents

"Buzz" (/m/07pjwq1) parents are:

Animal > Wild animals > Insect > Fly, housefly
Animal > Wild animals > Insect > Bee, wasp, etc.
Source-ambiguous sounds > Onomatopoeia > Brief tone

And its definition is: "The sound of rapid vibration, commonly the wings of a flying insect."

Therefore the "Buzz" class does not seem to include, e.g., "Engine" (/m/02mk9) or "Guitar amplifier" (/m/01vfsf). Is it worth including these parents? Or these kinds of sounds should not be included into "Buzz" (/m/07pjwq1)?

In case of willing to include more kinds of sounds into "Buzz", we think that it is more correct that it is an "Onomatopoeia" (/m/05n1m) and, as such, it is a sibling of "Hum" (/m/07rcgpl) - which is very similar. In this case, shall we change the parent "Source-ambiguous sounds > Onomatopoeia > Brief tone" to a more generic parent "Source-ambiguous sounds > Onomatopoeia"?

I propose this discussion on behalf of the Freesound Datasets team.

Release tag + DOI?

I know that the ontology is meant to evolve over time, but for archival purposes, it would be useful to have tagged revisions (ie, v1) and DOI's (via Zenodo).

Is there any chance of this happening?

Where do imaginary animals go?

For example: dragons, demons, banshees, aliens, things like that? This is an issue for sound effects libraries.

Missing Some Entities from Temporally-Strong Audioset

Hi everyone

The following entities of the temporally strong labeled part of Audioset are missing:

/m/0174k2 Washing machine
/m/018p4k Cart
/m/01j2bj Bathroom sounds
/m/01j3j8 Studio recording, Music
/m/01lynh Stairs
/m/02417f Windscreen wiper, windshield wiper
/m/0269r2s Chain
/m/02f9f_ Shower
/m/02ll1_ Lock
/m/040b_t Refrigerator
/m/04ctx Knife
/m/056r_1 Keypress tone
/m/0641k Paper rustling
/m/06cyt0 Mechanical bell
/m/07pqmly Slurp, drinking straw
/m/07s13rg Sweeping
/m/07sk0jz Stomp, stamp
/m/08dckq Carbon monoxide detector, CO detector
/m/098_xr Error signal
/m/0bcdqg Ringing tone, ringback tone
/m/0bzvm2 Video game sound
/m/0c1tlg Electric rotor drone, quadcopter
/m/0d4wf Kitchen and dining room sounds
/m/0fw86 Tap dance
/m/0hgq8df Crockery breaking and smashing
/m/0md09 Power saw, circular saw, table saw
/t/dd00138 Brief tone
/t/dd00141 Pant (dog)
/t/dd00142 Audio logo
/t/dd00143 Unknown sound
/t/dd00144 Alert
/t/dd00147 Dong, bong

Do you have plans to add them to the ontology?

Missing MIDs more than 9 in newly released strong labels

Hi Dan:
Sorry for bugging again!
"The file audioset_eval_strong.tsv describes 139,538 segments across the 16,996 excerpts from the evaluation set. There are 416 MIDs, 9 of which are not present in the train labels."

In my experiment, there seem to be 35 MIDS that are different than the original weakly labeled 527 MIDS:
{'/m/0bzvm2', '/t/dd00139', '/t/dd00098', '/m/07q8f3b', '/m/0c1tlg', '/m/0md09', '/t/dd00091', '/m/093_4n', '/m/01sb50', '/m/0174k2', '/m/01j423', '/m/0hgq8df', '/t/dd00099', '/m/05mxj0q', '/t/dd00141', '/m/01lynh', '/m/0fw86', '/m/0dgw9r', '/t/dd00061', '/t/dd00109', '/m/09l8g', '/m/07sk0jz', '/t/dd00133', '/m/0d4wf', '/m/018p4k', '/t/dd00143', '/m/0bcdqg', '/m/09hlz4', '/m/0zmy2j9', '/t/dd00138', '/t/dd00142', '/m/02f9f_', '/m/02021', '/m/01j3j8', '/m/0641k'}

Just want to confirm this is the expected behavior.

This is very similar to Issue 6, but not quite the same.

Thank you in advance!

Quality Assessment and rerating File Not Downloadable

Hi, there:
The Quality Assessment and rerating file at:
https://research.google.com/audioset/download.html
is not downloadable.
Namely, qa_true_counts.csv rerated_video_ids.txt are both not available. Could you fix this?

Thank you!

How about decoding the label?

Hi, there:

Just wondering how to decode the labels:

/m/06mb1,/m/0jb2l,/m/0ngt1,/t/dd00038

Thanks.

Test Set missing files causing variation in published research

Hi, Dan and the other contributors:
Thank you for maintaining the repo so far!
AudioSet, to me, is a great resource, and still the best resource to understand the nature of sound.
We did a recent study: paper and code, where we found the recent research papers are having a whooping +- 5% difference in performance due to test set missing files when downloading. Plus, the difference in label quality also contributed to the performance variation and making it less fair (see figure 2 in our paper).
I understand you guys have legal constraints on youtube licensing, but guess this issue could be easier for the original authors to address. Either to advocate the community to use a common subset, or release an updated test set? given you guys already released updated strong labels.
Looking forward to your thoughts.

audioset / ontology Goto Github PK

ontology's People

Contributors

Stargazers

Watchers

Forkers

ontology's Issues

citation_uris that are not correct

Buzz parents

Release tag + DOI?

Where do imaginary animals go?

Missing Some Entities from Temporally-Strong Audioset

Missing MIDs more than 9 in newly released strong labels

Quality Assessment and rerating File Not Downloadable

How about decoding the label?

Test Set missing files causing variation in published research

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent