<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

Move DMP extension here about dataid-ontology HOT 4 CLOSED

dbpedia commented on July 4, 2024

Move DMP extension here

from dataid-ontology.

Comments (4)

seebi commented on July 4, 2024

If you plan to release and version the ontology core and its extensions separately, you should not put it in one repository.

from dataid-ontology.

jimkont commented on July 4, 2024

dmp is mostly stable now and we can align the versioning if needed.
Keeping them in separate repositories will add more overhead for publishing

from dataid-ontology.

chile12 commented on July 4, 2024

DMP is quiet closely connected with the purposes of (core) dataid. This has
manifested itself by properties seeping from dmp to dataid (eg similar
data, software requirement). DMP is not stable yet since there are still
details to figure out (like repository descriptions). See draft version for
more.
@dimitris: i'm not sure about your question, since DMP has already a branch in this repo

from dataid-ontology.

chile12 commented on July 4, 2024

Hello everyone,

as you might have noticed we had some troubling issues with abstracts files
in general and English abstracts in particular.

We have remedied those issues by rerunning the full abstract extractions
for the 10 languages most affected by these issues
(de,en,es,fr,it,ja,ko,nl,pl,pt).

Secondarily, we used this as an opportunity to test the the NLP Interchange
Format (NIF)
http://persistence.uni-leipzig.org/nlp2rdf/ontologies/nif-core/nif-core.htmlextraction
on the abstracts of those languages, extraction three new datasets in the
process:

nif-context: the full text of a page as context (including begin and
end index)
nif-page-structure: the structure of the page in sections and
paragraphs (titles, subsections etc.)
nif-text-links: all in-text links to other DBpedia resources as well
as external references

While for this test run we only include the first section (the abstract) of
every page in the context, we are trying (hopefully by the next release) to
extend the context to the full text of all Wikipedia pages, portraying its
structure and providing the foundation for future NLP fact extraction tasks.

You can download these files from here
http://wiki.dbpedia.org/nif-abstract-datasetsor directly here
http://downloads.dbpedia.org/2016-04/ext/nif-abstracts/.

Furthermore, Magnus discovered that all Wikidata normalized files
(wkd_uris) for the English language edition had faulty predicates, so we
reproduced these as well.

We hope to have covered all shortcomings of the last release by this
measure.

Please note: Patrick from Open Link is still in the process of updating the
public endpoint of DBpedia with the new abstracts while I'm writing this
message.

Markus Freudenberg

Release Manager, DBpedia http://wiki.dbpedia.org

from dataid-ontology.

Move DMP extension here about dataid-ontology HOT 4 CLOSED

Comments (4)

Related Issues (8)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent