Giter Club home page Giter Club logo

Comments (2)

tube42 avatar tube42 commented on September 28, 2024 1

My idea was to start with a large enough list and later improve it by adding/removing words using XX.add/XX.remove files. Anyone can contribute to this. So far people have mostly contributed by sending me new languages but very few have tried to improve the existing word lists.

Categorizing words sounds like a great idea but is a huge undertaking. Maybe someone could come up with an efficient way to do that?

from wordlists.

RustanHakansson avatar RustanHakansson commented on September 28, 2024 1

Indeed it is a huge undertaking. I have mainly looked for Swedish word lists, and the closest I have found that is categorized like above is Saldom: https://spraakbanken.gu.se/resurser/saldom
It is an XML file, with attributes that shows what the word is. It is grouped by word stem, and with the variations on that word listed as a group, with each variation defined by the tags. Some of the variations are very strange, words I have never have seen and would be very surprised if I ever saw, although logical according to how Swedish works. I have discussed this with the people at the University that maintain Saldom, and this is what this lexicon intends to do. So for word game use, part of these unusual word forms need to be split off into a "cruel and unusual" word list that can be used at your own peril. The rest would work great to separate into different files that can be combined according to the game or the player.
I have not really looked for anything similar for other languages, but I guess they exist.

from wordlists.

Related Issues (3)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.