Giter Club home page Giter Club logo

language-data's Introduction

Language data and utilities

npm node-build php-build

This library contains language related data, and utility libraries written in PHP and Node.js to interact with that data.

The language related data comprises of the following,

  1. The script in which a language is written
  2. The script code
  3. The language code
  4. The regions in which the language is spoken
  5. The autonym - language name written in its own script
  6. The directionality of the text

This data is populated from the current version of CLDR supplemental data and various other sources.

Documentation

  1. Full documentation
  2. Using the PHP library
  3. Using the Node.js library
  4. Adding Languages

Release schedule

Similar to MLEB, this library will have a quarterly release schedule, and will be released along with MLEB. Intermediate releases will be made for important bug fixes.

Changelog

The full changelog is available in CHANGELOG.md.

language-data's People

Contributors

abijeet avatar amire80 avatar cscott avatar jayprakash-se avatar jdforrester avatar jhsoby avatar kartikm avatar kizule avatar legoktm avatar lucaswerkmeister avatar marcoaureliowm avatar nikerabbit avatar nikkiwd avatar pl217 avatar revi avatar robinvandervliet avatar santhoshtr avatar srish avatar zabemath avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

language-data's Issues

Name for ASL is incorrectly capitalised

The name used here for ASL is incorrectly capitalised. It's a proper noun so each letter should be capitalised - American Sign Language is not the only American sign language.

`php src/util/ulsdata2json.php` doesn't update language-data.json

I updated an autonym in YAML, and ran php src/util/ulsdata2json.php. Nothing changed in the language-data.json file.

I'm running this version of PHP:

aaharoni@macdavid language-data % php --version                
PHP 7.3.11 (cli) (built: Feb 29 2020 02:50:36) ( NTS )
Copyright (c) 1997-2018 The PHP Group
Zend Engine v3.3.11, Copyright (c) 1998-2018 Zend Technologies

Check the usage of parentheses in names of Chinese variants

Many names of Chinese variants here have parentheses. Here's the full current list:

cjy-hans: [Hans, [AS], 晋语(简化字)]
gan-hans: [Hans, [AS], 赣语(简体)]
hak-hans: [Hans, [AS], 客家语(简体)]
hak-hant: [Hant, [AS], 客家語(繁體)]
zh-cn: [Hans, [AS], 中文(**大陆)]
zh-hans: [Hans, [AS], 中文(简体)]
zh-hant: [Hant, [AS], 中文(繁體)]
zh-hk: [Hant, [AS], 中文(香港)]
zh-mo: [Hant, [AS], 中文(澳門)]
zh-my: [Hans, [AS], 中文(马来西亚)]
zh-sg: [Hans, [AS], 中文(新加坡)]
zh-tw: [Hant, [AS], 中文(台灣)]

One small issue is in cjy-hans: it has ASCII closing parentheses, while all the rest have fullwidth parentheses on both sides.

But a bigger issue is that I wonder whether the parentheses are needed at all. Can these names be written without parentheses? Or is it more natural with parentheses?

Unfortunately I don't know Chinese. For what it's worth, cdo-hani has no parentheses ("閩東語漢字"), and I can find some others without parentheses, e.g. "客家語繁體" (even if I remove Wikipedia and translatewiki from the results).

npm package does not work

09ab602 moved entry point file index.js to src folder. But that non-default path is not given in package.json. This makes the package unusable

Add sign languages

I am trying to use $.uls.data.getScript() to determine which languages should be rendered as SignWriting, but the only sign language it knows about is ase for ASL, which means I'm having to maintain my own list of languages instead.

getScript/getDir/isRtl return incorrect data for unrecognised codes containing a script tag

If a language code contains a script tag, getScript, getDir and isRtl should not need the code to be defined, because it already contains the information they need.

e.g. For mis-arab (uncoded language, Arabic script):

$.uls.data.getScript("mis-arab"); currently returns Zyyy instead of Arab.

$.uls.data.getDir("mis-arab"); currently returns ltr instead of rtl.

$.uls.data.isRtl("mis-arab"); currently returns false instead of true.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.