Giter Club home page Giter Club logo

Centre for Language Technology, University of Copenhagen's Projects

affixtrain icon affixtrain

Using supervised learning, create a set of affix rules for use by the CSTlemma lemmatiser.

all2lower icon all2lower

Converts input text (UTF-8 encoded) to lowercase. Usage: all2lower <input> <output>

anvil-facetracker icon anvil-facetracker

OpenCV-based Plugin for the Anvil annotation software that tracks faces and creates annotations when velocity or acceleration thresholds are transgressed.

cstlemma icon cstlemma

Lemmatiser for Danish, Dutch, English, German, Polish, Romanian, Russian and tens of other languages, that uses affix rules (affix: prefix, infix, suffix, circumfix). Rules are obtained by supervised learning from a full form - lemma list.

cuphic icon cuphic

Transform or scrape Hiccup with a declarative DSL.

dannet icon dannet

The Danish WordNet as an RDF graph.

dk5 icon dk5

Fetch the DK5 dataset and store it as EDN.

hashmap icon hashmap

Simple implementation of a hash map using separate chaining. The table allocates more buckets if the load factor is more than 100% and frees buckets if the loadfactor falls below 20%.

head_movement_detection icon head_movement_detection

Jupyter notebooks and training data containing manual head movement annotations, speech data and velocity, acceleration and jerk data.

jerk icon jerk

Analyses the movement of two points in x-y plane, in casu nose tips data from OpenPoseDemo.exe, and computes velocity, acceleration and jerk of the points.

korp-setups icon korp-setups

Docker setups for all Korp installations maintained by NorS.

lemmax icon lemmax

Lemmatiser with an extra. Predict lemmas as well as classes (e.g. Parts of Speech), based on the morphology of the input word.

letterfunc icon letterfunc

Functions for upper/lower casing, for testing whether a character is a letter and for conversion between Unicode encodings UTF-8 and UTF-16

makeutf8 icon makeutf8

Converts UTF-16 (BE/LE), UTF-32 (BE/LE), ISO-8859-N to UTF-8. Removes BOM and surrogate pairs from UTF-8, converting a codepoint between U-D800 and U-DBFF followed by a codepoint between U-DC00 and U-DFFF to one valid codepoint > U-FFFF.

mate-parser icon mate-parser

Web service that wraps around Bernd Bohnet's graph based parser

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.