Comments (4)
Hi,
You should have at least 32Gb per node, and configured elassandra with half the total RAM up to maximum of 30,5Gb. But if you are not aggregating on analyzed string fields, you can probably use less memory to improve file system cache.
For write, elassandra write throughput is roughly half the throughput of cassandra if you index all columns. In this scenario, Cassandra and Elasticsearch files with be roughly the same size, for example 100Gb of cassandra data generates 100Gb of elasticsearch index files if you map all columns, but again, write throughput and elasticsearch disk usage will depend on your elasticsearch mapping.
In your cassandra schema, avoid huge indexed wide rows, because locking on wide rows can dramatically affect write performance.
For good search performance, keep shards under 50Gb each, and use partitioned index to split a cassandra table to more than one shard per node, see http://doc.elassandra.io/en/latest/mapping.html#partitioned-index.
Finally, depending on your hardware, you should have a few Tb of data per node and add more nodes to improve performances (index/search throughput) and/or increase the volume of data. Keep in mind that elassandra should require less nodes than separate elasticsearch + cassandra nodes providing the same service.
Thanks',
Vincent
from elassandra.
Why do you say less nodes compared to cassandra + elasticsearch (assuming you don't store "_source" in elasticsearch) ? Or is that by lowering per-shard overhead since you can include everything in 1 shard while in normal-es you have to over provision the number of shards.
from elassandra.
"For write, elassandra write throughput is roughly half the throughput of cassandra if you index all columns."
It would be nice to be able to count on highly-optimized Cassandra write efficiency for significant operations on large data sets. Cassandra is UPSERT optimised, e.g. update a single property without reading the existing record. If a single property change is appended in Cassandra, then will Elassandra be smart enough to only reindex the one property, or will it reindex all properties for much less than half the throughput on single-property updates?
from elassandra.
@RayWhitmer it will reindex the full document.
from elassandra.
Related Issues (20)
- Cassandra Starting error
- What to do with Elassandra? HOT 1
- Elassandra Starting Error on cpu.stat
- Is Windows supported ?
- Elasendra Error - shard or keyspace unavailable
- issue after upgrade elassandra from 6.2.3 to 6.8.4
- Unable to create new indices
- Elassandra index on blob
- Difference response for same API request HOT 2
- Inconsistent Data Querying ElasticSearch HOT 1
- Jar Hell when installing ingest-attachment plugin HOT 1
- is the project still active ? HOT 1
- override default Authenticator of Cassandra to PasswordAuthenticator
- cassandra -e is not working while bringing up the sever on windows. -e is not valid argument
- elassandra connectivity problem
- elasticsearch couldn't connect
- Elassandra not working on windows 10
- Run source code error
- elassandra importing cql database but not creating elasticsearch indexes HOT 2
- Elassandra not able to run in Kubernetes HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from elassandra.