Look for a general overview of Hadoop

I am looking for some Hadoop performance reviews (cluster of 300-600 boxes, commercial equipment), especially on the following aspects:

  • High concurrent read and write
  • Web Crawl
  • Mapreduce parallel computing
  • Inverted index
+5
source share
1 answer

This is not a specific question, maybe that's why no one has answered yet. Performance on a cluster of 3-600 nodes is best analyzed using tests.

However, I found some really interesting articles about Hadoop and its implementations in production:

, .

+2

All Articles