Giter Club home page Giter Club logo

concepticon's Introduction

Concepticon in CLLD

Since the data and mappings provided by the concepticon project are an important building block to interoperability between lexical resources, they should be published as clld app (edited by Cysouw and List), thereby providing an API which can be used by tools and methods.

Data model

The main entities of the data model are

  • ConcepticonConcept: An "authoritative" or "hub" concept, which can be used to relate other concepts to or directly by referencing it from wordlists. A ConcepticonConcept is identified by its unique English gloss. All ConcepticonConcepts have a description or example, possibly imported from OmegaWiki. (maps to Parameter)
  • ConceptList: A list of concepts typically created to collect wordlists; e.g. the swadesh list, but maybe also OmegaWiki's DefinedMeanings. (maps to Contribution)
  • SourceConcept: A concept as it appears in a ConceptList. (maps to ValueSet, the relation from ValueSet to Parameter is interpreted as the canonical sameAs relation (see below).)
  • Relation (better term?): Relates SourceConcepts to ConcepticonConcepts, specifying a relation from "sameAs|narrower|broader". Each SourceConcept is assumed to be related to exactly one ConcepticonConcept via a sameAs relation.

clld app

With this data model, a clld app can provide:

  • URLs identifying SourceConcepts and ConcepticonConcepts (as well as ConceptLists).
  • an API allowing discovery of related concepts given either a Source- or ConcepticonConcept.

Data curation

Data curation could proceed pretty much as done now, i.e. the data can be curated as a set of csv files maintained in a GitHub repository. The clld app can import well-defined releases of this repository. The set of files may look as follows:

  • One csv file listing ConcepticonConcepts: ID Gloss Description
  • One csv file per ConceptList, e.g. swadesh-1955.csv: ID Gloss ConcepticonConceptID
  • One csv file per ConceptList specifying additional relations: ID ConcepticonConceptID relation

The data repository could be equipped with a small python package providing data integrity checks which can be run after each push as Travis-CI jobs.

concepticon's People

Contributors

xrotwang avatar chrzyki avatar lingulist avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.