I am considering the following scenario. I have a data file sent daily. I add it to HBase with the file name -yyyyMMdd. Thus, for some period of time I have many databases, for example.
tempdb-20121220 tempdb-20121221 tempdb-20121222 tempdb-20121223 tempdb-20121224 tempdb-20121225
Now I want a specific date range to get a list if the tables fit that range so that I can create indexes. I am using hbase-0.90.6
As for my research, then TableMapReduceUtil.initTableMapperJob takes only 1 table name.
TableMapReduceUtil.initTableMapperJob( tableName, // input HBase table name scan, // Scan instance to control CF and attribute selection HBaseIndexerMapper.class, // mapper null, // mapper output key null, // mapper output value job );
I managed to get a list of tables and run it in a loop, but the idea is that I can scroll through all the tables, scan them (or something else) so that in the end I can get merges / combined results for indexing purposes.
Any direction to achieve this would be great and helpful.
source share