Giter Club home page Giter Club logo

common-voice's Introduction

Common Voice

This is the web app for Mozilla Common Voice, a platform for collecting speech donations in order to create public domain datasets for training voice recognition-related tools.

Upcoming releases

Type Release Cadence More info
Platform code & sentences Monthly, or as needed Release notes
Dataset Quarterly Dataset metadata

Quick links

How to contribute

๐ŸŽ‰ First off, thanks for taking the time to contribute! This project would not be possible without people like you. ๐ŸŽ‰

There are many ways to get involved with Common Voice - you don't have to know how to code to contribute!

  • To add or correct the translation of the web interface, please use the Mozilla localization platform Pontoon. Please note, we do not accept any direct pull requests for changing localization content.
  • For information on how to add or edit sentences to Common Voice, see SENTENCES.md
  • For instructions on setting up a local development environment, see DEVELOPMENT.md
  • For information on how to add a new language to Common Voice, see LANGUAGE.md
  • For information on how to get in contact with existing language communities, see COMMUNITIES.md

For more general guidance on building your own language community using Mozilla voice tools, please refer to the Mozilla Voice Community Playbook.

Discussion

For general discussion (feedback, ideas, random musings), head to our Discourse Category.

For bug reports or specific feature, please use the GitHub issue tracker.

For live chat, join us on Matrix.

Licensing and content source

This repository is released under MPL (Mozilla Public License) 2.0.

The majority of our sentence text in /server/data comes directly from user submissions in our Sentence Collector or they are scraped from Wikipedia using our extractor tool, and are released under a CC0 public domain Creative Commons license.

Any files that follow the pattern europarl-VERSION-LANG.txt (such as europarl-v7-de.txt) were extracted with our thanks from the Europarl Corpus, which features transcripts from proceedings in the European parliament.

Citation

If you use the data in a published academic work we would appreciate if you cite the following article:

  • Ardila, R., Branson, M., Davis, K., Henretty, M., Kohler, M., Meyer, J., Morais, R., Saunders, L., Tyers, F. M. and Weber, G. (2020) "Common Voice: A Massively-Multilingual Speech Corpus". Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020). pp. 4211โ€”4215

The BiBTex is:

@inproceedings{commonvoice:2020,
  author = {Ardila, R. and Branson, M. and Davis, K. and Henretty, M. and Kohler, M. and Meyer, J. and Morais, R. and Saunders, L. and Tyers, F. M. and Weber, G.},
  title = {Common Voice: A Massively-Multilingual Speech Corpus},
  booktitle = {Proceedings of the 12th Conference on Language Resources and Evaluation (LREC 2020)},
  pages = {4211--4215},
  year = 2020
}

Cross Browser Testing

This project is tested with Browserstack

common-voice's People

Contributors

abelardo-py avatar ag12r avatar alidev9901 avatar andy-moz avatar bbeshir avatar danielinux7 avatar fjoerfoks avatar fu-lab avatar gozer avatar gregoor avatar ilnarselimcan avatar jimsp472000 avatar karm46 avatar koehlermichael avatar kripoks avatar marceloghelman avatar marixime avatar markh-bz avatar mikehenrty avatar milupo avatar mohsin274 avatar mpqadir avatar petercpg avatar phirework avatar piotrdrag avatar rprys avatar sahran avatar theochevalier avatar txopi avatar yaitelmouden avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

common-voice's Issues

[Bug] Cannot listen to m4a on desktop, cannot listen to ogg on iphone

We'll need to do client detection and server conversion.

Perhaps one thing we could do is add extension to the url: /upload/random.m4a to give the server a hint about what to serve.

Better though would be client side audio conversion. Need to see what is easier, converting ogg -> m4a on webkit, or the other way around on everything else, or neither.

[iOS App] Don't take up full screen

Right now the app displays underneath the system statusbar (which is transparent). This looks a little weird. Can we make it so that the app displays inside a frame and not underneath the statusbar?

Basic flow, Wire Frames

Landing Page (main)

  • Show overall progress towards goal
  • Big Donate now button

Leaderboard

  • show top 3
  • show where you are
  • show bottom 2

Rewards Page

  • List any rewards the user has (with unclaimed ones at the top)
  • Ability to add name for leaderboard, new sentence, login

Contributors

  • Dynamic list of all users who reached level 2

Voice Donation Page

  • experience points (voice-coins), perhaps count of how many submissions from your user
  • Next reward widget - a

Profile Page

[Listen Screen] Flow

There are several issues here I'm not sure if I should break them up in different issues.

  • The general flow should be the user only needs to press on Yes / No to move on to the next sentence. No separate Submit button.
  • Instead of the browser sound controls, we should have a Play button, which becomes Repeat after the first time it is pressed.
  • After submitting, the next sentence should be loaded and the audio played automatically. The goal is to have as few user interactions as possible.
  • A user should not hear the same (sentence, audio) pair more than once.
  • There seems to several types of recorded audio samples:
    • Accurate readout of the sentence
    • Medium quality, perhaps a word or two are missing
    • Nothing was recorded at all
    • Background noise
    • Something else was recorded entirely, perhaps the used accidentally pressed the record button and left it on, posing a privacy concern.
    • Profanity, or vulgar content that should be flagged.

Text for initial iOS app and website

Home page:
Build the world's most diverse set of voice data that researchers and others can use for free to create better voice technologies for the Internet.

[maybe button here?]
Click 'Record' to start donating your voice.

Your voice donations will be made available for researchers and others to use under a Creative Commons license [https://creativecommons.org/publicdomain/zero/1.0/]. Your name or any other identifying information will not be associated with this voice data.

This project is governed by Mozilla's Privacy Policy [https://mozilla.org/privacy/websites/]

About page:
Project Common Voice is brought to you by Mozilla, the proudly non-profit champions of the Internet.

Today's technologies that allow learning from data are freely available for anyone to use, and are resulting a wave of innovation online. However, voice technologies (for example, speech recognition) are not seeing the same innovation because little data is freely available to train machine learning technologies. The data that is available is from a set of speakers with limited diversity of accents and languages.

Our aim with Project Common Voice is to enable "voice donors" to build the world's largest and most diverse set of voice data that is freely available for anyone to use. Our vision is that researchers and others will be able to use this data to increase innovation in voice related technologies. This will help everyone have access to a new wave of voice technologies, and ensure that people aren't locked-in to using services from a small number of Internet giants.

First time use incorrectly initializes User

If one goes to the website without ever previously visiting the site, one hits the error

Uncaught TypeError: Cannot read property 'userId' of null
    at User.Component.setState (bundle.js:404)
    at User.restore (bundle.js:487)
    at new User (bundle.js:475)
    at new App (bundle.js:1430)
    at HTMLDocument.<anonymous> (bundle.js:301)

due to restore() incorrectly initializing userId on first time use.

Add voice verification functionality

Users can help us validate voice input. Right now we don't have this scoped, so we would need to do that if this becomes a priority.

This was originally brought up in issue #3.

Hire a UX/UI person

Need a UX/UI contractor for roughly 1-2 weeks for interaction and visual design of the responsive website layout.

[iOS app] Disable overscroll

Right now you can drag the content of the webview up and down due to Safari's overscroll ability. Can we disable this in the app?

[First Time Experience] Need User Flow for FTE

Here's a proposal for a simple first time experience that introduces all the features. Obviously this is still open to suggestions and/or complete rewrites.

User starts at Homepage:
(Text needs to be short and clear, and have call to action.)

Welcome to Voice Commons!
Our goal is to create a Public Domain (CC0) database of voices in every language with any accent. We believe this data can empower universities, researchers, buinesses, non-profits (like ourselves), or anyone who is interested in Voice Recognition technology to be creators in this emerging space. You can learn more about Common Voice through our Mision Statement.

Take our guided tour:
Button text could be Lend your Voice or Donate or Try it out

(button takes you to donate screen)

Donate Screen

insert brief description of why we need "labeled" voice clips.

(pehaps we can call out that your data won't be uploaded yet)

User clicks records their voice, and clicks Submit.

(here we could have some sort of nice indication they just contributed)
(submitting their voice clip takes them to the listening screen.)

Listen Screen

Explain that we need to make sure that what everyone says is correct
"Play this sound clip, and tell is if what they said matches the senstence"

(here users could listen to your own clip, or we could have a pre-canned example)

User selects yes or no:
Yes || no

(Whether they say yes or no, move on to registration)

Don't worry, what you just recorded has not been added to our database yet. First, we would like you to take a look at our Privacy Policy.
tl;dr When we publish the voice database, we will strip all personally identifying information about you. But while using this site, you may share as much or as little personal information as you'd like with other users.

To use all the features of this site, you need to register. Again, your email will not be tied to any published voice data.
Registration Form
(perhaps make it clear they can still donate as Guest)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.