Giter Club home page Giter Club logo

Comments (5)

dan-zeman avatar dan-zeman commented on May 20, 2024 1

I have added a link to the Download section from each treebank's section. Hope this helps to find it in the future.

from lemmatag.

foxik avatar foxik commented on May 20, 2024

If I recall, the only official source of UD data is the LINDAT release -- the Github repos are usually used only for development (i.e., they are not required to contain branch or tag with latest release).

@dan-zeman Am I right, or is it possible to get the stable releases from Github?

from lemmatag.

Hyperparticle avatar Hyperparticle commented on May 20, 2024

@foxik I was not aware that UD is hosted on LINDAT. Going through the UD website, I could not find any links to LINDAT datasets, there were just the GitHub repos.

from lemmatag.

dan-zeman avatar dan-zeman commented on May 20, 2024

Hmm, maybe we should think of making this more explicit and visible on the UD website. I can see how you can overlook it if you are looking just for one language and never go further once you click on the language... But in fact, the information is quite explicit on the title page below the flags. If you scroll long enough, or if you hit CTRL+F and type "download", you will end up at the Download section and see the link to Lindat. And you get all languages in one big package, you cannot download just one selected language.

Otherwise, it is actually possible to get stable releases from Github, although it is not the preferred way (because we want download statistics at one place, i.e., Lindat). Since we learned the first time that some people just take their data from Github and write papers about it, we reversed the branch logic and now we try to make sure that the contents of the master branch of each repo always corresponds to the most recent official release, while all fixes in the meantime happen in the dev branch. You still don't have 100% certainty that you get the right data if a treebank was released in the past, then became invalid due to stricter validation rules, was not fixed and was not included in the last release.

from lemmatag.

Hyperparticle avatar Hyperparticle commented on May 20, 2024

@dan-zeman Ah, I never noticed the download section at the bottom of the page, thanks! This should simplify things. And I agree, the download section should be more prominent (perhaps mentioned at or moved to the top of the page).

from lemmatag.

Related Issues (3)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.