Giter Club home page Giter Club logo

jnidzwetzki / bboxdb Goto Github PK

View Code? Open in Web Editor NEW
51.0 4.0 8.0 33.56 MB

BBoxDB is a scalable, highly available, and distributed data store for multi-dimensional big data. The software supports operations like multi-dimensional range queries and spatial joins. In addition, data streams are supported.

Home Page: http://bboxdb.org

License: Apache License 2.0

Java 98.24% Shell 1.45% Python 0.26% Gnuplot 0.02% Dockerfile 0.04%
bigdata key-value-database multi-dimensional storage-engine storage-manager multidimensional-data key-value-store sstables range-query distributed-storage-manager

bboxdb's Introduction


Build Status Coverity Scan Build Status Codecov Join the chat at https://gitter.im/bboxdb/Lobby Maven Central Version

Please Note: The master branch may be in an unstable state during development. Please use our releases for productive environments.

What is BBoxDB?

BBoxDB is a highly available distributed storage manager designed to handle multi-dimensional big data. In contrast to existing key-value stores, BBoxDB can handle multi-dimensional efficiently. Existing key-value stores are using one-dimensional keys to address the values. Finding a proper key for multi-dimensional data is challenging and often impossible; this is especially true when the data has an extent (non-point data / regions). To retrieve multi-dimensional data from a key-value store, a full data scan is often required. BBoxDB was developed to avoid the expensive full data scan and to make the work with multi-dimensional data more convenient. User-defined filters are supported to process custom data formats, and BBoxDB also supports the handling of data streams.

Key features

  • ✅ A distributed and fault-tolerant data store for n-dimensional data.
  • ✅ Data (point and non-point) of any dimension is supported.
  • ✅ The data is indexed, which enables efficient range query processing.
  • ✅ BigData is supported by spreading the data across a cluster of systems. Each node stores only a small part of the whole dataset.
  • ✅ Multi-dimensional shards are created dynamically on the actual distribution of the data (automatically scale-up/scale-down).
  • ✅ Data of multiple tables is stored co-partitioned, and spatial-joins can be executed efficiently without data shuffling between nodes.
  • ✅ Data are re-distributed in the background without any service interruption.
  • ✅ Multi-dimensional data streams can be processed and continuous queries (range queries and spatial joins) are supported.
  • ✅ User-defined filters for query processing on custom data types.

Documentation

The documentation of the project is located at https://bboxdb.org/documentation/. The documentation also contains the changelog of the project.

Getting started

For a guided tour through the features of BBoxDB, see the getting started chapter in the documentation. We also recommend reading the creating client code section. The install guide explains the needed steps to deploy an own BBoxDB cluster. The guide also describes how you can setup a virtualized cluster with 5 BBoxDB nodes in under two minutes, by using Docker and Docker Compose.

Screenshots

BBoxDB ships with a GUI that allows observing the global index structure. Below you find two screenshots of the GUI. The screenshots show how the space is partitioned. In addition, some details about the discovered nodes are shown. When two-dimensional bounding boxes with WGS 84 coordinates are used, a map overlay visualization is supported by the GUI. On the top right picture, some spatial data about Germany was imported and the Figure shows, how Germany in partitioned after the data was imported. In addition, the GUI provides operations to explore two dimensional GeoJSON encoded data.




(The screenshots contain content from OpenStreetMap - CC-BY-SA 2.0)

BBoxDB is also able to handle data streams. The first screenshot shows the busses in Sydney fetched from a real-time GTFS feed. The data is provided by the Transport for New South Wales Website. The second screenshot shows the aircraft traffic in the area of Berlin. The data is fetched from the Automatic Dependent Surveillance–Broadcast (ADS–B) data feed from the ADSBHub Website. For more details about that, see our tutorial on the handling of real-world data streams.


(The screenshots contain content from OpenStreetMap - CC-BY-SA 2.0)

Contact / Stay informed

License

BBoxDB is licensed under the Apache 2.0 license. See the LICENSE file for details.

bboxdb's People

Contributors

dependabot-preview[bot] avatar dependabot-support avatar dependabot[bot] avatar jlleitschuh avatar jnidzwetzki avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

bboxdb's Issues

ConcurrentModificationException when delete distribution group is called

291116 [pool-1-thread-6] WARN de.fernunihagen.dna.jkn.scalephant.network.server.ClientConnectionHandler - Error while delete distribution group

java.util.ConcurrentModificationException

at java.util.HashMap$HashIterator.nextEntry(HashMap.java:922)

at java.util.HashMap$KeyIterator.next(HashMap.java:956)

at de.fernunihagen.dna.jkn.scalephant.storage.StorageInterface.deleteAllTablesInDistributionGroup(StorageInterface.java:110)

at de.fernunihagen.dna.jkn.scalephant.network.server.ClientConnectionHandler.handleDeleteDistributionGroup(ClientConnectionHandler.java:308)

at de.fernunihagen.dna.jkn.scalephant.network.server.ClientConnectionHandler.handleNextPackage(ClientConnectionHandler.java:681)

at de.fernunihagen.dna.jkn.scalephant.network.server.ClientConnectionHandler.run(ClientConnectionHandler.java:169)

at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)

at java.util.concurrent.FutureTask.run(FutureTask.java:262)

at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145)

at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615)

at java.lang.Thread.run(Thread.java:745)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.