Google has been granted a patent on “System and method for efficient large-scale data processing” http://patft.uspto.gov/netacgi/nph-Parser?Sect1=PTO1&Sect2=HITOFF&d=PALL&p=1&u=%2Fnetahtml%2FPTO%2Fsrchnum.htm&r=1&f=G&l=50&s1=7,650,331.PN.&OS=PN/7,650,331&RS=PN/7,650,331
I’m not going to discuss the fact of patenting itself. But when I read about it I started to click around and found a couple of amusing discussions which I hadn’t read before. A few years ago, when Google started to hype MapReduce, many bloggers (and non-bloggers) were spreading that hype without understanding what MapReduce really is. I argued with many folks and was trying to make the following simple points:
- MapReduce is not used by Google as a distributed DBMS, or as any sort of data retrieval / query execution system.
- It’s probably not a good idea to use MapReduce as a DBMS.
- MapReduce is based on trivial ideas. It does not deserve the hype.
At that time I wasn’t aware of these heated discussions (that are now even documented in Wikipedia):
Another interesting thing: Microsoft presents Dryad to Google . (11/01/07)
MapReduce with Dryad is one line of code: .SelectMany(mapper).GroupBy(selector, reducer)
Anagram of the day: Peace Drum = MapReduce, via Xworder