Jar containing org.apache.hadoop.hive.dynamodb

I tried to programmatically upload the dynamodb table to HDFS (via java, not the hive), I could not find examples on the Internet how to do this, so I thought that I was loading a jar containing org.apache.hadoop.hive.dynamodb and reconstructing process.

Unfortunately, I could not find the file: (.

Can anyone answer the following questions for me (listed in priority order).

  • An example of Java that loads a dynamodb table into HDFS (which can be passed to the cartographer as a table input format).
  • a jar containing org.apache.hadoop.hive.dynamodb.

Thanks!

+1
source share
2 answers

In hive-bigbird-handler.jar . Unfortunately, AWS does not provide any sources, or at least Java Docs. But you can find the jar on any node of the EMR cluster:

 /home/hadoop/.versions/hive-0.8.1/auxlib/hive-bigbird-handler-0.8.1.jar 

You can check this article:

Unfortunately, Amazon havent released sources for hive-bigbird-handler.jar, which is a shame given its usefulness. Of particular note, it also includes built-in support for Hadoops input and output formats, so you can write directly to MapReduce Jobs, writing directly to DynamoDB.

Tip: find hive-bigbird-handler.jar to get to the interesting parts ...; -)

+1
source

1- I don't know any such examples, but you may find this library useful. It provides InputFormats, OutputFormats, and Writable classes for reading and writing data to Amazon DynamoDB tables.

2- I do not think they made it publicly available.

0
source

All Articles