Giter Club home page Giter Club logo

languagesgallery's Introduction

LanguagesGallery

Lingualibre Languages Gallery is a VueJS searchable gallery of Lingualibre languages, with relevant statistics, call to action, and access to download our open licence datasets.

Why a language gallery ?

The Lingualibre Languages Gallery is the fruits of 2 years of periodic research and developement. It started in late 2021, with the writing of Help:SPARQL which aimed to explore per language statistics, including on gender coverage and geographic distribution. Advanced SPARQL queries confirmed our suspicions, the major geographic and gender biaises found raised the need for an human friendly, searchable Languages gallery able to communicate those findings to both the general public and Wikimedia policies makers.

Solution

The language gallery stands upon data consolidation pipeline and daily publication systems, the Sparql2data bash script. Using refined SPARQL queries, both Lingualibre and Wikidata's data are fetched daily, their JSON responses saved in persistant files and served online. Responsive HTML/CSS code was adapted from Common Voice Project, with whom further collaboration would be welcome.

The whole does the job elegantly, providing per language key metrics, with a search function, call to action, datasets download and sharable links.

Design

Image This page presents Lingualibre.org's statistics on a per languages basis with:
  • Language name (Wikidata)
  • Number of native speakers worldwide. (Wikidata)
  • Speakers : number of Lingualibre speakers, aka voices. (Lingualibre)
  • Gender split : number of females, other (non-binary and undefined), males voices. (Lingualibre)
  • Unique words vs recordings ratio. (Lingualibre)
  • Recordings gender split: number of females, other (non-binary and undefined), males recordings. (Lingualibre)
  • Contribute button: points to our online rapid recording studio.
  • Download button: download access to all open licence audios in this language.

Screenshot from 2023-06-04 21-59-20

Advanced features

  • Search and filter by English or native language name
  • Share a filtered url by appending ?search=YourSearch to the url

Datasets reuse

Each language's card has a download button to get a zip of all our open licence files. Their filenames should be used to point to the rightful Wikimedia file page, containing the file's licence.

Contribute

This Single Page Web Application works with:

  • HTML
  • CSS
  • VueJS

Credits

Sorted by involvement: Yug, Elfix, VIGNERON, Poslovitch, Envel.

License

See index.html's footer and LICENSE MPL.

languagesgallery's People

Contributors

alphaleadership avatar belett avatar hugolpz avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

languagesgallery's Issues

Simple refactor

Idea

Friendly : we want to be junior devs friendly, in code, project structure, and npm depencies.
Simple : we want to keep this project's code simple. Simple to maintain even in current open source / volunteer-based context.
Open to new visual representation : the code should allow easy implementation multiple data representations in the future. This could include simple alternative template, or inclusion of a (vuejs) datavisualization library.

Proposition

The coder decides the reorganization. For now I suggest a simple split into :

  • index.html
  • css/styles.css
  • js/main.js

Long term view

Possible representations of our daily consolidated Lingualibre data :

lang-oci

Incorrect amount of speakers worldwide for languages

Ukrainian language is shown to have 27M speakers worldwide while in reality about 40M use it.
German is shown to have 77M while in reality it is about 140M.
It seems that there are problems with this number in a lot of languages


[EDIT] Issue closes, the following were disccussed.

  • Wikidata inconsistency for Speakers (P1098) : native or all.
  • Speaker of Ukrainian : 27M according to ethnologue.
  • Better legend need to be coded / designed, issue #10 opened.

Enable interface translation

It would be great to be able to translate this interface.

At present, it is only available in English, which potentially limits its adoption by Lingua Libre users.

Notes

First, the current texts strings were all wrote by one single contributor ( hugolpz ). It would be better if some other writing styles and minds review, improve, merge or remove those strings, consolidating those. Then, we can start translating around.

Second, Wikimédia France is considering a full stack Lingualibre revamp. Their stack and approach is unknow to me at the moment. Due to limited JS dev human resources at the moment I would prefer to handle all translations with the same tech, same process. Preferably for me: Translatewiki + npm banana-i18n , as demonstrated in Lingualibre SignIt.

Todo

On github :

  • Create a i18n directory with base files
  • Install banana-i18n
  • Implement banana-i18n

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.