Google's MapReduce is a new parallelism framework for processing large amounts of data. Some recommended links are:
- Mining of Massive Datasets by Jure Leskovec, Anand Rajaraman, Jeff Ullman (http://www.mmds.org/).
- Wu-Jun Li's course at Shangai Jiao Tong University: http://cs.nju.edu.cn/lwj/course/mmds.html
Hey Alex
ReplyDeleteyou may like this book about about large scale text processing with mapreduce
http://www.umiacs.umd.edu/~jimmylin/book.html
Thanks, Andrei. Seems very good book.
ReplyDelete