Giter Club home page Giter Club logo

Comments (6)

hartig avatar hartig commented on July 23, 2024

All these things sound good and should be done. However, just to scale down the expectations in us: The reality is that we currently have one PhD student (@chengsijin0817) who can work some part of her time on LinGBM, and I can advise and moderate a bit. So, we can focus only on specific pieces out of the many directions into which this benchmark project can be taken. If we are lucky, we may find a Master's student from time to time who may help by working on a smaller project within the context of LinGBM (properly extending LinGBM to RDF-based data sources would be one such project, but we don't have such a student at the moment). So, any external help we can get is appreciated.

from lingbm.

VladimirAlexiev avatar VladimirAlexiev commented on July 23, 2024

@hartig running this over RDF is not yet in our sprint plans, but will likely be included. So we'll extend it to RDF.

You changed the LUBM gen to produce RDBMS data, is it isomorphic to the original RDF?

from lingbm.

hartig avatar hartig commented on July 23, 2024

Yes, the RDBMS data produced by our dataset generator is isomorphic to the RDF data. Our extension to the dataset generator intercepts the produced RDF data and turns it into SQL DML statements (INSERT INTO and UPDATE) for an SQL import script. The actual data is not changed in the process. (@chengsijin0817 can you confirm?)

from lingbm.

hartig avatar hartig commented on July 23, 2024

After discussing the question of isomorphism internally, I have to backtrack a bit: While it is indeed true that our extension to the dataset generator intercepts the produced RDF data in order to turn it into SQL DML statements, we actually also add a bit of additionally data; namely, for each publication, we add a title and abstract, and for each student, we add an age. These additional attributes allow us to cover some of our choke points that we could not cover based on the original LUBM data. Currently, these additional attributes are present only in the RDBMS data that our extended dataset generator produces. We can check whether we can move the generation of these attributes deeper into the dataset generator such that they also end up as additional datatype properties in the generated RDF data.

from lingbm.

VladimirAlexiev avatar VladimirAlexiev commented on July 23, 2024

Extending LUBM with more choke points is a great idea.

Does anyone maintain LUBM, or will you he taking that role?

from lingbm.

hartig avatar hartig commented on July 23, 2024

Does anyone maintain LUBM

I don't know.

or will you he taking that role?

No time.

from lingbm.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.