Giter Club home page Giter Club logo

Comments (4)

robcowart avatar robcowart commented on June 13, 2024 1

I wanted to provide an update on this issue. I have done some testing in an environment where the ingestion rate averages 500 flows/sec over 24 hours (day time peaks are over 1000/sec). Reports covering a 24 hour period, which is 42 million records, are still rendering within 5 secs.

This was a 3 nodes cluster installed on bare-metal. 64GB RAM and SSD storage, which is really a "starter cluster".

I hope this gives you a bit more insight into how the solution can scale.

from elastiflow.

robcowart avatar robcowart commented on June 13, 2024

This is a question that can only be answered with "it depends". While you could have 1TB of data on a single Elasticsearch node, spreading it across 3 nodes will provide much better ingest and query performance (and 3 nodes is also the starting point if high availability and data redundancy are important.

Keep in mind that indexing data in Elasticsearch is not the same as writing records to a traditional SQL DB. The process of indexing data optimizes it for search... i.e. querying. So instead of having to scan through millions (or even billions) of rows to find those that contain data for a particular server, the index contains direct pointers to documents which refer to the server. This allows many types of queries to complete extremely fast compared to SQL DBs or many other storage mechanisms.

The guidance that I can provide is that most of the dashboards will render within 5 secs, even when querying 50-60 million flow records. For some environments that may be 30 days of data, in others that may be only 30 minutes (or less). Fortunately the Elastic Stack provides deployment options for either case, but you need to have an idea of your requirements first.

from elastiflow.

simPod avatar simPod commented on June 13, 2024

KK, thanks for quick reply! I think it might be worth giving it a try then

from elastiflow.

simPod avatar simPod commented on June 13, 2024

Thanks for update. I definitelly have better insight now. Currently, I'm experimenting with ClickHouse but want to give this a try too.

from elastiflow.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.