Comments (6)
For existing cluster, if you already have all required hbase-secondary index related configurations configured in your cluster machines(HMaster+Regionservers, else after making all configuration changes restart tour cluster) then you can make use of class "org.apache.hadoop.hbase.index.mapreduce.TableIndexer" to create index on existing user tables:
./hbase org.apache.hadoop.hbase.index.mapreduce.TableIndexer -Dtablename.to.index=<table_name> -Dtable.columns.index='IDX1=>cf1:[q1->datatype&length];cf2:[q1->datatype&length],[q2->datatype&length],[q3->datatype& lenght]#IDX2=>cf1:q5,q5'
Here,
tablename.to.index: Table name to create index.
table.columns.index : Table columns on which index to be created.
The format used here is:
IDX1 - Name of the Index given by user
cf1 - Column family name of user table
q1 - qualifier name
datatype - datatype of column values "cf1:q1"
[Int, String, Double, Float]
length - Maximum length of the values of "cf1:q1"
# is used to separate between two index details
from hindex.
may be you rowkey is too long, i think.
from hindex.
I have created a hase table and index table with hindex framework, but when we are uploading more data into same table, it keeps on increasing the size of index table only and no actual data is appearing in Hbase table. In this case my input data is 80 GB and the index table has grown to 200+ GB and no new data appearing in the main table.
Can rowkey size be a reason for such huge table size ?
from hindex.
index table rowkey contains the index column/value and user table rowkey. As you said, your user table data size has no change, so your index table affect data size.
from hindex.
Is there any detail description in how to implement hindex in an existing Cluster?
from hindex.
Thanks for your kind answer, abhi-kr. I did it successfully.
from hindex.
Related Issues (20)
- hindex write issues HOT 7
- Support secure access to the index tables HOT 2
- migrate hindex to hbase 0.96 or higher HOT 5
- deadlock in put operation HOT 3
- issues in put operation HOT 10
- how to use the index? HOT 8
- Error when compile with maven.
- How to use hindex for scanning data? HOT 20
- is there any new release based on hbase0.98.1+? HOT 3
- ERRORs in building with CDH5.1 HOT 10
- Indexing arbitrary column qualifier HOT 4
- Pending items in Arbitrary index feature
- Unify index details validation in preCreateTable and preModifyTable to allow proper indexes
- To determine the split threshold
- Perfomance about hindex in range search
- No compile document? HOT 3
- does hindex only support hbase 0.94.8?
- Your project Huawei-Hadoop hindex is using buggy third-party libraries [WARNING]
- 请求项目如何启动?
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from hindex.