nheist / caligraph Goto Github PK
View Code? Open in Web Editor NEWA Large Semantic Knowledge Graph from Wikipedia Categories and Listings
Home Page: http://caligraph.org
License: GNU General Public License v3.0
A Large Semantic Knowledge Graph from Wikipedia Categories and Listings
Home Page: http://caligraph.org
License: GNU General Public License v3.0
We would love to use the CaLiGraph data for research purposes over here in Amsterdam (VU / Triply), but unfortunately bumped into reusability issues due to invalid IRIs that contain blank spaces.
We don't have a complete overview, but we did look extensively into caligraph-ontology.nt
downloaded and extracted from here.
riot --validate caligraph-ontology.nt
results in:
ERROR riot :: [line: 4510843, col: 77] Bad character in IRI (space): <http://caligraph.org/ontology/RestrictionHasValue_birthPlace_Trentino-Alto[space]...>
The complete subject of the triple in this line is:
<http://caligraph.org/ontology/RestrictionHasValue_birthPlace_Trentino-Alto Adige/S%C3%BCdtirol>.
Blank spaces in IRIs seem to be the only issue in this file.
Hi, your work is very intersting, but I meet some problems when reproduce the result.
I input
python tune_entity_dismabiguation.py
but I'm required to add some parameters
usage: tune_entity_disambiguation.py [-h] [--approach APPROACH] [--corpus {LIST,NILK}] [-ss {5,10,15,20,25,30,35,40,45,50,55,60,65,70,75,80,85,90,95,100}]
[--mm_approach MM_APPROACH] [--me_approach ME_APPROACH] [--mm_threshold MM_THRESHOLD] [--me_threshold ME_THRESHOLD]
[--path_threshold PATH_THRESHOLD] [--me_cluster_threshold ME_CLUSTER_THRESHOLD] [--cpus CPUS]
config_file
tune_entity_disambiguation.py: error: the following arguments are required: config_file
Could you please give me the full instruction or give me the parameters required?
Thanks a lot!
Thank you for this great resource!
There is a minor issue with unescaped spaces appearing in an IRI in file caligraph-ontology.nt.bz2
line 2.405.570:
<http://caligraph.org/ontology/RestrictionHasValue_location_Allentown,_Pennsylvania> <http://www.w3.org/2000/01/rdf-schema#label> <Restriction onProperty=location hasValue=Allentown,_Pennsylvania> .
We are unable to read file caligraph-instances_labels.nt.bz2
because a single-quoted literal on line 912,449 contains an unescaped newline:
<http://caligraph.org/resource/Analogy_of_the_divided_line> <http://www.w3.org/2004/02/skos/core#altLabel> "Divided line of
Plato" .
There are two possible fixes here:
\n
).<http://caligraph.org/resource/Analogy_of_the_divided_line> <http://www.w3.org/2004/02/skos/core#altLabel> """Divided line of
Plato""" .
Hi team,
Thank you for nice work on semantic typing of concepts from wikipedia. I was struggling with same task for specific problem in my team.
I have one question:
What is the source of kind on inconsistency in sense that in class clgo:Organization there are many peoples?
Thank you
Sergei
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.