Please tell me how the HBase partition table divides the regions.
For example, let them say that my string keys are integers from 0 to 10 M, and I have 10 registers.
Does this mean that the first register server will save all lines with keys with values 0 - 10M, the second 1M - 2M, the third 2M - 3M, ... the tenth 9M - 10M?
I would like my string key to be timestamp, but in most cases requests will be applied to the latest dates, all requests will be processed by only one register server, is this true?
Or maybe this data will be distributed differently?
Or maybe I can somehow create more regions than the servers of the region, so (according to this example) server 1 will have the keys 0 - 0.5M and 3M - 3.5M, so my data will be distributed more evenly possible?
Update
I just found that there is an option hbase.hregion.max.filesize , do you think this will solve my problem?
parallel-processing hbase hadoop
wlk
source share