Giter Club home page Giter Club logo

corpusingestionandassembly's Introduction

Corpus Ingestion and Assembly

General information about the WorldModelers repos related to reading and assembly

Additional documentation is available at the website for this repo.

Reading and Assembly Software Components

This section describes the machine reading and assembly repositories.

Eidos

Eidos is the machine reading system developed by the CLU lab at University of Arizona. This repository includes the reading software as well as code for integrating with DART.

Concept Discovery

This is the concept identification component, which is used by the ontology-in-a-day (OIAD) system.

HUME

Hume is BBN's machine reading system that extracts CAGs and supports the OIAD clustering. It leverages the following software

Text-Open contains Java and Python APIs for reading and writing BBN's SerifXML format, which is BBN's internal representation of documents and information.

LearnIt is a tool for customizing Machine Readers (a.k.a., Information Extraction algorithms) with human in the loop. Within WM, we also use it as a pattern-based extractor for event extraction.

NLPLingo is BBN's Deep Learning toolkit for event extraction and causal relation extraction.

CSerif contains C/C++ code for pre-reading of texts into BBN's SerifXML format. We are working on open sourcing this.

Sofia

INDRA World

Ontology

The reading and assembly software developed under World Modelers program share the same underlying ontology.

corpusingestionandassembly's People

Contributors

bgyori avatar chanyees avatar chanys avatar jmacbrid avatar johnhungerford avatar kwalcock avatar mihaisurdeanu avatar reynoldsm88 avatar spilioeve avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.