tube42 / wordlists Goto Github PK
View Code? Open in Web Editor NEWMulti-language word lists for use in open source games.
License: GNU General Public License v2.0
Multi-language word lists for use in open source games.
License: GNU General Public License v2.0
While searching for a Swedish word list to add to https://github.com/lexica/lexica , this is the best collection of dictionaries I have found. See for example this: lexica/lexica#179
What games currently use these lists? Collaborating on keeping them up to date seems like a very reasonable thing to do. For Swedish, there are regular updates of words to add and remove, as collected for example here: http://scrabbleforbundet.se/ordlistor/
Here are some categories that could be useful for some games, to allow the player to choose what types of games to play with:
Beginner words (200)
Common words (1000)
Expanded common words (10 000)
City, country, continent, area names
Flora names
Fauna names
Celebrity names
Hard/unusual words
Abbreviations
Scrabble
Massive list of everything
But to do something like this there would need to be more restructuring done. The original word lists look like you want to keep them as they are, without changes. But is that really the best idea? Language is changing, keeping them more up to date is likely better. At least for Swedish, the included list stopped being updated 9 years ago when there was heavy discussion about how it was used in a popular app game. There are some ways that the list was set up that was useful for its original purpose, but not suitable for use in a word game. As the stated purpose of this repo is to maintain word lists for use in word games, it might make sense to adapt the list for this use.
I'm a native german speaker and i notice really odd words in the german word list, which i am certain do positively not exist in the german language. for example:
Aa
lormest
losbräch
Lunden
lustrier
Maa
macklich
MacGuffin
Also I find words that are obscure / very specialised, so that no average speaker would know them without being an expert in a very narrow field. They indeed exist (e.g. found on de.wikipedia.org) but it is questionable whether they should appear in a game. For example:
Aalhamen
Kyu
kyanisieren
Labantzen
Machorka
Where did the words come from?
Any ambitions to curate suitable words?
(manually importing this issue from gitlab since it may be useful for reference later)
Martin Quinson
Add some german word list?
Hello,
you may want to add some german word list, for example one listed in lexica/lexica#69 (comment)
Thanks, Mt
tube42 @tube42 · 3 months ago
Maintainer
Waiting for this issue to resolve
enz/german-wordlist#1
tube42
tube42 @tube42 · 3 months ago
Maintainer
No idea about the quality of the words, but German is now added.
Note that the file format and the way unicode is handled has changed in this release.
tube42 @tube42 closed 3 months ago
Martin Quinson
Martin Quinson @mquinson · 3 months ago
Thanks a lot. I can tell you that almost all words containing a œ in their list are actually French words. I guess that they are usable in German too, but I'm not fluent enough to say for sure. So I kinda agree with you even if I'd prefer a native German speaker to comment on the bug you opened on their repo.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.