Large-scale dataset processing

Question

Large-scale dataset processing

From online discussion groups and blogs, I have seen that many of the interview questions are related to processing a large-scale array. I wonder if there is a systematic approach to analyzing this type of question? Or, more specifically, is there any data structure or algorithms that can be used to solve this problem? Any suggestions really appreciated.

+5

c ++ algorithm data-structures dataset

bit-question Jun 27 '10 at 5:22

source share

4 answers

"" - , "" ( , SMP - ).

+1

Alex Martelli 27 . '10 5:26

. , , . , , , B + Trees.

+1

stinky472 27 . '10 5:33

, , . , .

- . , , , .

- - , .

- , .

The DBMS can greatly simplify data access, but it adds some system overhead.

0

Michael j Jun 27 '10 at 5:43

source share

Owen S. · Accepted Answer · 2010-06-27T18:14:16+0000

Large-scale datasets belong to several categories that I have seen, each of which presents various problems for you.

, . :
- ,
- , , ( , !)
- .
, , - . - , , .
, . , , , :-) .
- , . . / , CouchDB Hadoop, .
, . ( ) ( , ). , , , . :
- ( )
- ( , )

, , , :

, . , .
, . , ?

Large-scale dataset processing

More articles: