{"id":2825,"date":"2010-07-07T16:51:11","date_gmt":"2010-07-07T16:51:11","guid":{"rendered":"http:\/\/www.redmonk.com\/jgovernor\/?p=2825"},"modified":"2010-07-07T16:51:11","modified_gmt":"2010-07-07T16:51:11","slug":"emc-big-data-play-continues-greenplum-acquisition","status":"publish","type":"post","link":"https:\/\/redmonk.com\/jgovernor\/emc-big-data-play-continues-greenplum-acquisition\/","title":{"rendered":"EMC Big Data Play Continues: Greenplum Acquisition"},"content":{"rendered":"<p>I wrote recently about <a href=\"http:\/\/www.redmonk.com\/jgovernor\/2010\/04\/21\/vmwares-springsource-redis-and-rabbit-acquisitions-a-database-play-is-emerging\/\">VMware&#8217;s emerging Data Management play<\/a> after the announcement the firm was <a href=\"http:\/\/blogs.vmware.com\/console\/2010\/03\/vmware-hires-key-developer-for-redis.html\">hiring Redis lead developer Salvatore Sanfillipo<\/a>.<\/p>\n<blockquote><p>While [CEO Paul] Maritz may say VMware isn\u2019t getting into the database business,  he means not the <em>relational<\/em> database market. The fact is  application development has been dominated by relational- Oracle on  distributed, IBM on the mainframe \u2013 models. Cloud apps are changing  that. As alternative data stores become natural targets for new  application workloads VMware does indeed plan to become a database  player, or NoSQL player, or data store, or whatever you want to call it.<\/p>\n<p>We have been forcing round holes into square pegs with  object\/relational mapping for years, but the approach is breaking down.  Tools and datastores are becoming heterodox. something RedMonk has  heralded for years.<\/p><\/blockquote>\n<p>Now comes another interesting piece of the puzzle. EMC is acquiring Greenplum &#8211; and building a new division around the business, dubbed Data Computing Product Division. While Redis is a &#8220;NoSQL&#8221; data store, Greenplum represents a massively parallel processing architecture designed to take advantage of the new multicore architectures with pots of RAM: its designed to process data into chunks for parallel processing across these cores. While Greenplum has a somewhat traditional &#8220;datawarehouse&#8221; play &#8211; it also supports MapReduce processing. EMC will be competing with the firms like <a href=\"http:\/\/redmonk.com\/sogrady\/2009\/10\/02\/hadoopworld\/\">Hadoop packager Cloudera<\/a> [client] and <a href=\"http:\/\/redmonk.com\/sogrady\/2010\/03\/16\/rod-smith-interviews\/\">its partners such as IBM<\/a> [client]. Greenplum customers include Linkedin, which uses the system to support its new &#8220;People You May Know&#8221; function.<\/p>\n<p>There is a grand convergence beginning between NoSQL and distributed cache systems (see <a href=\"http:\/\/blogs.forrester.com\/mike_gualtieri\">Mike Gualtieri&#8217;s Elastic Cache piece<\/a>). It seems EMC plans to be a driver, not a fast follower. The Hadoop wave is just about ready to crash onto the enterprise, driven by the likes of EMC and IBM. Chuck Hollis, for example, <a href=\"http:\/\/chucksblog.emc.com\/chucks_blog\/2010\/07\/emc-to-acquire-greenplum.html\">points out Greenplum would make a great pre-packaged component VBlock<\/a> for VMware\/EMC\/Cisco&#8217;s VCE alliance &#8211; aimed at customers building private clouds. Of course Cisco is likely to make its own Big Data play anytime soon&#8230; That&#8217;s the thing with emergent, convergent markets- they sure make partnering hard. But for the customer the cost of analysing some types of data is set to fall by an order of magnitude, while query performance improves by an order of magnitude. Things are getting very interesting indeed.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I wrote recently about VMware&#8217;s emerging Data Management play after the announcement the firm was hiring Redis lead developer Salvatore Sanfillipo. While [CEO Paul] Maritz may say VMware isn\u2019t getting into the database business, he means not the relational database market. The fact is application development has been dominated by relational- Oracle on distributed, IBM<\/p>\n","protected":false},"author":5,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"spay_email":"","footnotes":"","jetpack_publicize_message":"","jetpack_is_tweetstorm":false},"categories":[1],"tags":[416,177,236],"class_list":["post-2825","post","type-post","status-publish","format-standard","hentry","category-uncategorized","tag-emc","tag-hadoop","tag-mapreduce"],"jetpack_featured_media_url":"","jetpack_publicize_connections":[],"jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p9wfjh-Jz","_links":{"self":[{"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/posts\/2825","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/users\/5"}],"replies":[{"embeddable":true,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/comments?post=2825"}],"version-history":[{"count":0,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/posts\/2825\/revisions"}],"wp:attachment":[{"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/media?parent=2825"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/categories?post=2825"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/redmonk.com\/jgovernor\/wp-json\/wp\/v2\/tags?post=2825"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}