Comments (6)
All these things sound good and should be done. However, just to scale down the expectations in us: The reality is that we currently have one PhD student (@chengsijin0817) who can work some part of her time on LinGBM, and I can advise and moderate a bit. So, we can focus only on specific pieces out of the many directions into which this benchmark project can be taken. If we are lucky, we may find a Master's student from time to time who may help by working on a smaller project within the context of LinGBM (properly extending LinGBM to RDF-based data sources would be one such project, but we don't have such a student at the moment). So, any external help we can get is appreciated.
from lingbm.
@hartig running this over RDF is not yet in our sprint plans, but will likely be included. So we'll extend it to RDF.
You changed the LUBM gen to produce RDBMS data, is it isomorphic to the original RDF?
from lingbm.
Yes, the RDBMS data produced by our dataset generator is isomorphic to the RDF data. Our extension to the dataset generator intercepts the produced RDF data and turns it into SQL DML statements (INSERT INTO and UPDATE) for an SQL import script. The actual data is not changed in the process. (@chengsijin0817 can you confirm?)
from lingbm.
After discussing the question of isomorphism internally, I have to backtrack a bit: While it is indeed true that our extension to the dataset generator intercepts the produced RDF data in order to turn it into SQL DML statements, we actually also add a bit of additionally data; namely, for each publication, we add a title and abstract, and for each student, we add an age. These additional attributes allow us to cover some of our choke points that we could not cover based on the original LUBM data. Currently, these additional attributes are present only in the RDBMS data that our extended dataset generator produces. We can check whether we can move the generation of these attributes deeper into the dataset generator such that they also end up as additional datatype properties in the generated RDF data.
from lingbm.
Extending LUBM with more choke points is a great idea.
Does anyone maintain LUBM, or will you he taking that role?
from lingbm.
Does anyone maintain LUBM
I don't know.
or will you he taking that role?
No time.
from lingbm.
Related Issues (20)
- Missing placeholder for qt9 in documentation HOT 3
- License? HOT 3
- issue Query template q12.txt HOT 1
- Issue in query Template q14.txt HOT 1
- issue in query template qt2.txt HOT 5
- issue in query template qt3.txt HOT 1
- issue in query template qt4.txt HOT 1
- issue Query template QT4.txt HOT 1
- Define reporting rules in the wiki
- Optimize dataset generator
- It's LUBM not BSBM HOT 4
- Who has run LinGBM? HOT 3
- Finish defining the mapping of the LinGBM GraphQL schema to the RDB schema of the dataset generator HOT 2
- query parameters as string interpolation?
- bring back BSBM HOT 3
- Templatize query "where" clauses
- LinGBM GraphQL schema has typos
- Better checking of arguments in the query generator HOT 1
- Unify how the server URL is passed to both of the test drivers HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lingbm.