Giter Club home page Giter Club logo

.github's Introduction

PLATON

Overview

Processing of large collections of real-world data series is nowadays one of the most challenging and critical problems for a wide range of diverse application domains, including finance, seismology and other earth sciences, astrophysics, neuroscience, engineering, etc. Due to recent advances in the development of modern scientific instruments in earth sciences and the dominance of the Internet of Things, data series collections experience an unprecedented growth in size. Therefore, one of the most pressing issues in data series processing is achieving enhanced performance and high scalability.

PLATON (Platform-aware LArge-scale Time-Series prOcessiNg) brings together a highly-experienced researcher in the field of the theory of concurrent and distributed computing with a hosting group which has world-leading expertise on data series management, indexing, and analysis, to harness the difficulties of large-scale data series processing by realizing the data series processing performance and scalability goals. Specifically, PLATON aspires to build, for the first time, the necessary methods, algorithms and tools for highly-efficient, scalable, and fault-tolerant processing of huge collections of data series. Scalability will be accomplished by exploiting the full computational capacity (multiple nodes, multiple cores, accelerators) of modern computing platforms. To ensure fault-tolerance, PLATON is designed to run in a way agnostic to the number and identities of the computing elements provided by the platform. PLATON demonstrates its value proposition using two real datasets from different domains. The proposed research project has the potential of great economic and social impact in Europe as multiple scientific and industrial fields are currently in need of the right tools, in order to handle their massive collections of data series, a fact acknowledged by the EU strategy on Big Data.

Objectives

  • PLATON will develop new algorithms and techniques for highly-efficient data series processing in a multi-node setting. This encompasses the design and implementation of 1) low-cost, query-aware data partitioning and mapping techniques for answering queries on large collections of data series in heterogeneous computing platforms, as well as 2) load balancing and communication primitives for multi-node query processing that will result in much better performance, high scalability, and fault-tolerance in large-scale data series processing.
  • The development of new fault-tolerant multi-threading indexes and query processing schemes for large data series collections.
  • PLATON will achieve enhanced performance by combining the power of general purpose CPUs with accelerators, such as Graphical Processing Units (GPUs).
  • PLATON will compile its different components into a powerful index for large-scale data series processing, coming up with a highly-efficient and scalable solution to facilitate processing of datasets that will be orders of magnitude larger (depending on the number of nodes utilized during the computation) than the current datasets tested by state-of-the-art such indexes.

.github's People

Watchers

Eleftherios Kosmas avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.