Comments (4)
I wanted to provide an update on this issue. I have done some testing in an environment where the ingestion rate averages 500 flows/sec over 24 hours (day time peaks are over 1000/sec). Reports covering a 24 hour period, which is 42 million records, are still rendering within 5 secs.
This was a 3 nodes cluster installed on bare-metal. 64GB RAM and SSD storage, which is really a "starter cluster".
I hope this gives you a bit more insight into how the solution can scale.
from elastiflow.
This is a question that can only be answered with "it depends". While you could have 1TB of data on a single Elasticsearch node, spreading it across 3 nodes will provide much better ingest and query performance (and 3 nodes is also the starting point if high availability and data redundancy are important.
Keep in mind that indexing data in Elasticsearch is not the same as writing records to a traditional SQL DB. The process of indexing data optimizes it for search... i.e. querying. So instead of having to scan through millions (or even billions) of rows to find those that contain data for a particular server, the index contains direct pointers to documents which refer to the server. This allows many types of queries to complete extremely fast compared to SQL DBs or many other storage mechanisms.
The guidance that I can provide is that most of the dashboards will render within 5 secs, even when querying 50-60 million flow records. For some environments that may be 30 days of data, in others that may be only 30 minutes (or less). Fortunately the Elastic Stack provides deployment options for either case, but you need to have an idea of your requirements first.
from elastiflow.
KK, thanks for quick reply! I think it might be worth giving it a try then
from elastiflow.
Thanks for update. I definitelly have better insight now. Currently, I'm experimenting with ClickHouse but want to give this a try too.
from elastiflow.
Related Issues (20)
- Import PCAP file from a USB File to ElasticFlow HOT 1
- RISKIQ behind proxy HOT 1
- Can't access ElastiFlow 5.0.0 in Docker HOT 2
- Centos 7 Install Logstsh sFlow codec fails HOT 1
- ElasticFlow : Netflow VLAN data is not populated in elastiflow HOT 2
- Elastiflow : Not generating enough data in Elastic HOT 2
- ElasticSearch 7.12 ... just checking :) HOT 3
- I can't see data (elastiflow) in kibana HOT 1
- docker image - flowcoll should recover once elasticsearch become available HOT 3
- index-pattern remain elastiflow empty HOT 8
- Invalid request payload JSON format HOT 4
- Elastic upgrade from 7.10 to 7.12 HOT 5
- source.as.organization.name always public HOT 2
- logstash is running but not able to see Sflow data HOT 3
- netflowv5 mikrotik no data in dashboard HOT 6
- kibana terms list might be incomplete because the request is taking to long HOT 1
- Mapper_parsing_exception HOT 2
- No matching indices found: No indices match pattern "elastiflow-flow-codex-*" HOT 2
- no data in threats HOT 2
- The legacy ElastiFlow is deprecated. TRY THE NEW ELASTIFLOW!!!
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elastiflow.