Giter Club home page Giter Club logo

s-graffito's Introduction

S-Graffito

Streaming Graph Processing At Scale

S-Graffito is a Streaming Graph Management System that addresses the processing of OLTP and OLAP queries on high streaming rate, very large graphs. These graphs are increasingly being deployed to capture relationships between entities (e.g., customers and catalog items in an online retail environment) both for transactional processing and for analytics (e.g., recommendation systems)

Streaming Graph Querying

Existing work on streaming graph systems, by and large,focuses on either:

  1. Maintenance of graph snapshots under a stream of updates for iterative graph analytic workloads, or
  2. Specialized systems for persistent query workloads that are tailored for the task in hand

In a large number of applications, the unbounded nature of streaming graphs and the need for real-time answers on recent data make it impractical to employ snapshot-based techniques. Specialized systems, on the other hand, provide satisfactory performance for the task in hand but they lack the flexibility to support a wide range of real-world scenarios.

The primary focus of this component is the efficient processing of persistent graph queries over large streaming graphs with very high edge arrival rates. We investigate query execution techniques and robust system architectures for an efficient and scalable treaming Graph Management System (SGMS). In particular, we tackle following problems for efficient persistent query evaluation over streaming graphs:

  1. Design and development of non-blocking algorithms for persistent evaluation of path navigation queries over streaming graphs.
  2. Query processing techniques for persistent graph queries with both structural and attribute-based predicates.
  3. Scale-out system architectures and distributed query evaluation techniques to scale to large streaming graphs arising in real-world applications.

Streaming Graph Analytics

Graph analytics is concerned with estimating properties of the graph or finding patterns within a graph (e.g. finding cliques or densely connected clusters, subgraph matching, and finding frequent patterns/motifs). Running analytics tasks over streaming graphs is particularly challenging because of the unboundedness of the graph (i.e. sequential access to the unbounded structural events in the graph) as well as the potentially bursty and high velocity arrivals. The growing need to process streaming graphs, with their ever-changing nature, has brought about a resurgence of interest in prediction-based analytics over streaming graph (e.g. link prediction, node prediction, event time prediction, and pattern prediction).

The primary focus of this component is creating an analytics engine that ingests streaming records, batches them using sliding window semantics, and performs (several) machine learning-aided analytics tasks on each batch before retiring the corresponding window and ingesting the next batch. To this end, we design efficient algorithms for a generic analytics engine that is based on time-based windows (as the computation methodology) and low dimensional vertex embeddings (as the analytics primitives). In particular, we tackle the following problems for efficient analytics over streaming graphs.

  1. Exploratory analysis of real-world streaming graphs
  2. Representation learning over streaming graphs
  3. Prediction-based analytics over streaming graphs

Talks

Streaming Graph Processing and Analytics (Slides, Video)

Keynote at 14th International Conference on Distributed and Event-Based Systems, 2020

Publications

Streaming Graph Analytics
Streaming Graph Querying

Artifacts

Streaming Graph Analytics

sGrapp: Butterfly Approximation in Streaming Graphs

sGrow: Explaining the Scale-Invariant Strength Assortativity of Streaming Butterflies

Transient Concepts in Streaming Graphs

Streaming Graph Querying

Evaluating Complex Queries on Streaming Graphs

People

M. Tamer Özsu

Angela Bonifati (Collaborator at Lyon 1 University)

Anil Pacaci (Former PhD Student)

Aida Sheshbolouki (Former PhD Student, Postdoctoral Fellow)

Kerem Akillioglu (MMath Student)

s-graffito's People

Contributors

aidasheshbolouki avatar anilpacaci avatar keremakillioglu avatar ozsu avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.