Giter Club home page Giter Club logo

dbpedia-spotlight's Introduction

DBpedia Spotlight Build Status

Shedding Light on the Web of Documents

DBpedia Spotlight looks for ~3.5M things of unknown or ~320 known types in text and tries to link them to their global unique identifiers in DBpedia.

Demonstration

Go to our Demonstration page, copy+paste some text and play with the parameters to see how it works.

Call our web service

You can use our demonstration Web Service directly from your application.

curl http://spotlight.sztaki.hu:2222/rest/annotate \
  --data-urlencode "text=President Obama called Wednesday on Congress to extend a tax break
  for students included in last year's economic stimulus package, arguing
  that the policy provides more generous assistance." \
  --data "confidence=0.35"

or for JSON:

curl http://spotlight.sztaki.hu:2222/rest/annotate \
  --data-urlencode "text=President Obama called Wednesday on Congress to extend a tax break
  for students included in last year's economic stimulus package, arguing
  that the policy provides more generous assistance." \
  --data "confidence=0.35" \
  -H "Accept: application/json"

Run your own server

If you need service reliability and lower response times, you can run DBpedia Spotlight in your own In-House Server. Just download a model and Spotlight from here to get started.

wget http://spotlight.sztaki.hu/downloads/dbpedia-spotlight-latest.jar
wget http://spotlight.sztaki.hu/downloads/latest_models/en.tar.gz
tar xzf en.tar.gz
java -jar dbpedia-spotlight-latest.jar en http://localhost:2222/rest

Models and data

Models and raw data for most languages are available here.

Citation

If you use DBpedia Spotlight in your research, please cite the following paper:

@inproceedings{isem2013daiber,
  title = {Improving Efficiency and Accuracy in Multilingual Entity Extraction},
  author = {Joachim Daiber and Max Jakob and Chris Hokamp and Pablo N. Mendes},
  year = {2013},
  booktitle = {Proceedings of the 9th International Conference on Semantic Systems (I-Semantics)}
}

Licenses

All the original code produced for DBpedia Spotlight is licensed under Apache License, 2.0. Some modules have dependencies on LingPipe under the Royalty Free License. Some of our original code (currently) depends on GPL-licensed or LGPL-licensed code and is therefore also GPL or LGPL, respectively. We are currently cleaning up the dependencies to release two builds, one purely GPL and one purely Apache License, 2.0.

The documentation on this website is shared as Creative Commons Attribution-ShareAlike 3.0 Unported License.

More information on citation and how to cite the deprecated Lucene version can be found here.

Documentation

More documentation is available from the DBpedia Spotlight wiki.

FAQ

Check the FAQ here

dbpedia-spotlight's People

Contributors

dav009 avatar dbrambilla avatar dirkweissenborn avatar golicar avatar hajokeffer avatar halin avatar hunterhector avatar jgrivolla avatar jimkont avatar jodaiber avatar kenda avatar maxjakob avatar mvnural avatar pablomendes avatar poke19962008 avatar sandroacoelho avatar sjuyal avatar zaknarfen avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.