Giter Club home page Giter Club logo

ldbc_snb_bi's Introduction

LDBC_LOGO

LDBC SNB Business Intelligence (BI) workload implementations

Build Status

Implementations for the BI workload of the LDBC Social Network Benchmark.

To get started with the LDBC SNB benchmarks, check out our introductory presentation: The LDBC Social Network Benchmark (PDF).

๐Ÿ“œ If you wish to cite the LDBC SNB, please refer to the documentation repository (bib snippet).

Implementations

The repository contains the following implementations:

All implementations use Docker containers for ease of setup and execution. However, the setups can be adjusted to use a non-containerized DBMS.

Reproducing SNB BI experiments

Running an SNB BI experiment requires the following steps.

  1. Pick a system, e.g. Umbra. Make sure you have the required binaries and licenses available.

  2. Generate the data sets using the SNB Datagen according to the format described in the system's README.

  3. Generate the substitution parameters using the paramgen tool.

  4. Load the data set: set the required environment variables and run the tool's scripts/load-in-one-step.sh script.

  5. Run the benchmark: set the required environment variables and run the tool's scripts/benchmark.sh script.

  6. Collect the results in the output directory of the tool.

โš ๏ธ Note that deriving official LDBC results requires commissioning an audited benchmark, which is a more complex process as it entails code review, cross-validation, etc. See the specification's Auditing chapter for details.

Cross-validation

To cross-validate the results of two implementations, use two systems. Load the data into both, then run the benchmark in validation mode, e.g. Cypher and Umbra results. Then, run:

export SF=10

cd cypher
scripts/benchmark.sh --validate
cd ..

cd umbra
scripts/benchmark.sh --validate
cd ..

scripts/cross-validate.sh cypher umbra

Note that the cross-validation uses the numdiff CLI tool.

Usage

See .circleci/config.yml for an up-to-date example on how to use the projects in this repository.

Data sets

Pre-generated data sets and parameters are now available for SF1-SF10,000 (projected FK variant) and SF1-SF1,000 (merged FK variant).

Data sets are also being uploaded to the SURF CWI repository and expected to be published there in 2023.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.