Comments (14)
@gsilvapt Please refer to the Unicode CLDR document here: http://cldr.unicode.org/translation/country-names
Customary Names
The ISO names and the "official" names are often not the best ones. The goal is the most customary name used in your language, even if it is not the official name. For example, for the territory name in English you would use "Switzerland" instead of "Swiss Confederation", and use "United Kingdom" instead of "The United Kingdom of Great Britain and Northern Ireland". One of the best sources for customary usage is to look at what common major publications such as newspapers and magazines do, the equivalents of The Economist, NY Times, BBC, WSJ, etc. in your language. You can look at style guides if available or at a sampling of pages, but favor publications’ rather than academic style guidelines. For example, to see how "Congo" is used in French, one might search for Congo on Le Monde and on other publications.
Practical benefits include not only the "Venezuela" case @JoshMcCullough mentions but also includes examples such as:
- Korea, Democratic People's Republic of
- Korea, Republic of
The above official names provided by the ISO may not be useful as the customary names such as North Korea and South Korea as listed in Unicode CLDR.
This is very important difference especially if the data is meant to be visualized or designed into interactive infographics where shorter customary names are better than longer official names.
from country-codes.
@rgrp @gsilvapt I agree on using the English customary names (as defined/updated by Unicode CLDR). It is also a good advice by gsilvapt to point users to an official dataset (for official names e.g. http://unstats.un.org/unsd/methods/m49/m49alpha.htm) or an industry-focus dataset (for customary names for localization purposes such as Unicode CLDR). This can be mentioned simply in the Readme or similar file.
@JoshMcCullough I hope that the English customary names, as defined/updated by Unicode CLDR, match what you and your users will expect.
If you all agree, I can help submitting a pull request (PR) with just the English customary names.
from country-codes.
@hanteng this seems a great suggestion - @ewheeler any thoughts as maintainer?
@hanteng if we did do this would you be up for turning your fork into a pull request?
Also apologies for delay in responding - we somehow missed this comment!
from country-codes.
@rgrp Sure. Please let me know how and what to add a field of CLDR customary name. I just recently do some work based on Babel python module
from country-codes.
@hanteng can you submit a PR so we can review ...
from country-codes.
@hanteng @rgrp yes this would be fantastic to include. PR would be most welcome!
from country-codes.
Would this fix the "Venezuela, Bolivarian Republic of"
issue? (E.g. let's just call this country "Venezuela" -- as everyone knows it.)
from country-codes.
@JoshMcCullough yes i imagine it would.
@gsilvapt - could you take a look at @hanteng work in https://github.com/hanteng/country-codes/tree/master/data/country-names and see what it would take to integrate here - or whether it should even be a separate data package.
from country-codes.
@rgrp I think we can create a package from this and even add another script to the guides to help users easily replace country codes. However, getting used to it may be troublesome.
I am not sure what this fixes though? @hanteng can you enlighten me please?
from country-codes.
I appreciate this dataset but, honestly, until the country names are as @hanteng suggested, I can't use it. I need the country names I'm using to be what people would expect, otherwise, most users would be confused. Perhaps we just need a new column for the "customary name"?
from country-codes.
@JoshMcCullough we are 👍 on getting this in and adding the relevant column.
@hanteng can you submit a PR with just the english customary names for now?
from country-codes.
I think we could advise the usage of the English customary names in our core datasets and forward people to an official dataset if they need the country names in their local language. How does that sound, @hanteng @rgrp ?
from country-codes.
@hanteng Yes, this sounds good.
from country-codes.
@hanteng sounds good and please submit a PR :-)
Note, in terms of way of doing this i would recommend the following:
- Store list of customary names (plus a country identifier that would allow merging into the main file) in a separate CSV
- Join this into the main country code file using a script
This was the process is easily repeatable in future and follows the current build process.
from country-codes.
Related Issues (20)
- Validation errors HOT 2
- MYSQL hyphen problem HOT 1
- Include column for Wikidata identifier, suggestion HOT 7
- Wrong value in Dial field HOT 1
- Include official long names
- CLDR name for Sark should not be 'Namibia' HOT 3
- Update link in repository description HOT 3
- Use Goodtables to continuous data validation HOT 10
- "." not part of TLD HOT 1
- Request: Add timezone columns
- ISO-3166-1 out of date at source; consider switching source HOT 1
- Missing 3 country codes HOT 2
- use the git releases
- About Taiwan,China HOT 3
- New Venezuelan currency
- Not deploying to datahub.io
- Missing currency data for Taiwain
- Add currency symbols like "€" or "$" to country codes
- Change of country name TURKEY to TÜRKİYE
- Unable to download https://datahub.io/core/country-list/r/data.json HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from country-codes.