{"id":645631,"date":"2013-03-07T08:00:37","date_gmt":"2013-03-07T13:00:37","guid":{"rendered":"http:\/\/gigaom.com\/?p=616972"},"modified":"2013-03-07T08:00:37","modified_gmt":"2013-03-07T13:00:37","slug":"5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking","status":"publish","type":"post","link":"https:\/\/mereja.media\/index\/645631","title":{"rendered":"5 reasons why the future of Hadoop is real-time (relatively speaking)"},"content":{"rendered":"<p>In some ways, Hadoop is a like a fine wine: It gets better with age as rough edges (or flavor profiles) are smoothed out, and those who wait to consume it will probably have a better experience. The only problem with this is that Hadoop exists in a world that\u2019s more about <a href=\"http:\/\/www.urbandictionary.com\/define.php?term=md+20%2F20\">MD 20\/20<\/a> than it is about <a href=\"http:\/\/www.winespectator.com\/display\/show?id=47374\">Relentless Napa Valley 2008<\/a>: Companies often want to drink their big data fast, get drunk on insights, and then have some more \u2014 maybe something even stronger. And with data \u2014 unlike technology and tannins \u2014 it turns out older isn\u2019t always better.<\/p>\n<p>That\u2019s a crude analogy, of course, but it gets at the essence of what\u2019s currently plaguing Hadoop adoption and what will propel it forward in the next couple years. The work being done by companies like Cloudera and Hortonworks at the distribution level is great and important, as is MapReduce as a processing framework for certain types of batch workloads. But not every company can afford to be concerned about managing Hadoop on a day-to-day basis. And <a href=\"http:\/\/gigaom.com\/2012\/07\/07\/why-the-days-are-numbered-for-hadoop-as-we-know-it\/\">not every analytic job pairs well with MapReduce<\/a>.<\/p>\n<p>In Part I of our four-part series on Hadoop, we <a href=\"http:\/\/gigaom.com\/2013\/03\/04\/the-history-of-hadoop-from-4-nodes-to-the-future-of-data\/\">looked at how the technology was born<\/a> and grew into the juggernaut it is today. In Part II, <a href=\"http:\/\/gigaom.com\/2013\/03\/05\/the-hadoop-ecosystem-the-welcome-elephant-in-the-room-infographic\/\">we laid out the map of the current products and projects<\/a> that comprise the Hadoop ecosystem. In this installment, we\u2019ll take a closer look at some of them and how they\u2019re positioning themselves to be important players down the road.<\/p>\n<p>If there\u2019s one big Hadoop theme at our <a href=\"http:\/\/event.gigaom.com\/structuredata\/?utm_source=data&#38;utm_medium=editorial&#038;%2338;utm_campaign=intext&#038;%2338;utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&#038;%2338;utm_content=dharrisstructure\">Structure: Data conference<\/a> March 20-21 in New York, it\u2019s the new realization that people shouldn\u2019t be asking \u201cWhat\u2019s next after Hadoop?\u201d but rather \u201cWhat will Hadoop become next?\u201d. Based on what\u2019s transpiring today, the answer to that question is that Hadoop will become faster in all regards and more useful as a result.<\/p>\n<h2 id=\"interactivity-big-data-style\">Interactivity, big-data-style<\/h2>\n<div id=\"attachment_612788\" class=\"wp-caption alignright\" style=\"width: 310px\"><img loading=\"lazy\" decoding=\"async\" alt=\"Source: Shutterstock user hauhu.\" src=\"http:\/\/gigaom2.files.wordpress.com\/2013\/02\/shutterstock_37622056.jpg?w=300&#038;h=225\" width=\"300\" height=\"225\" class=\"size-medium wp-image-612788\"><\/p>\n<p class=\"wp-caption-text\">Source: Shutterstock user hauhu.<\/p>\n<\/div>\n<p>As I explained with some detail a couple weeks ago, <a href=\"http:\/\/gigaom.com\/2013\/02\/21\/sql-is-whats-next-for-hadoop-heres-whos-doing-it\/\">SQL is what\u2019s next for Hadoop<\/a>, and that\u2019s not because of familiarity alone or the types of queries permitted by SQL <del datetime=\"2013-03-07T02:30:39+00:00\"><\/del>on relational data<del datetime=\"2013-03-07T02:30:39+00:00\"><\/del>. It\u2019s also because the types of massively parallel processing engines developed to analyze relational data over the years are very fast. That means analysts can ask questions and get answers at speeds much closer to the speed of their intuitions than is possible when querying entire data sets using standard MapReduce.<\/p>\n<p>But just as SQL and its processing techniques bring something to Hadoop, Hadoop (the Hadoop Distributed File System, specifically) brings something to the table, too. Namely, it brings scale and flexibility that don\u2019t exist in the traditional data warehouse world, where new hardware and licenses can be expensive; so only the \u201cvaluable\u201d data makes its way inside and only after it has been fitted to a pre-defined structure. Hadoop, on the other hand, provides virtually unlimited scale and schema-free storage, so companies can store however much information they want in whatever format they want and worry later about what they\u2019ll actually use it for. (Actually, though, most Hadoop jobs do require some sort of structure in order to run, and Hadoop co-creator Mike Cafarella is <a href=\"http:\/\/cloudera.github.com\/RecordBreaker\/\">working on a project called RecordBreaker<\/a> that aims to automate this process for certain data types.)<\/p>\n<p>How hot is SQL-on-Hadoop space? I profiled the companies and projects working on it on Feb. 21, and since then EMC Greenplum <a href=\"http:\/\/gigaom.com\/2013\/02\/25\/emc-to-hadoop-competition-see-ya-wouldnt-wanna-be-ya\/\">announced a completely rewritten Hadoop distribution<\/a> that fuses its analytic database to Hadoop, and an entirely new player called <a href=\"http:\/\/jethrodata.com\/\">JethroData<\/a> emerged along with $4.5 million in funding. Even if there\u2019s a major shakeout, there will be a few lucky companies left standing to capitalize on a shift to Hadoop as\u00a0<em>the<\/em> center of data gravity that EMC Greenplum\u2019s Scott Yara (albeit a biased source) thinks will be the data equivalent of the mainframe\u2019s demise.<\/p>\n<h2 id=\"this-is-your-database-this-is-\">This is your database. This is your database on HDFS<\/h2>\n<p>The SQL versus NoSQL debate appears to be dying down as companies and developers begin to realize there\u2019s definitely a place for both in most environments, but a new debate \u2014 with Hadoop at the center \u2014 might be about to start up. At its core is <a href=\"http:\/\/datagravity.org\/\">the concept of data gravity<\/a> and the large, attractive (in a gravitational sense) entity that is HDFS. Here\u2019s the underlying question that might be posed: If I\u2019m already storing my unstructured data in HDFS and am expected to replace my data warehouse with it, too, why would I also run a handful of other databases that require a separate data store?<\/p>\n<p>This is in part why <a href=\"http:\/\/hbase.apache.org\/\">HBase<\/a> has attracted such a strong following despite its relative technical and commercial immaturity compared with comparable NoSQL database <a href=\"http:\/\/cassandra.apache.org\/\">Cassandra<\/a>. For applications that would benefit from a relational database, startups such as <a href=\"http:\/\/gigaom.com\/2012\/07\/24\/how-one-startup-wants-to-inject-hadoop-into-your-sql\/\">Drawn to Scale<\/a> and <a href=\"http:\/\/gigaom.com\/2012\/10\/17\/batten-down-the-analysts-its-a-big-data-bi-storm\/http:\/\/gigaom.com\/2012\/10\/17\/batten-down-the-analysts-its-a-big-data-bi-storm\/\">Splice Machine<\/a> have turned HBase into a transactional SQL system. Wibidata, the <a href=\"http:\/\/gigaom.com\/2012\/02\/07\/hadoop-startup-wibidata-raises-5m-to-power-web-analytics\/\">new startup from Cloudera C0-founder Christophe Bisciglia and Aaron Kimball<\/a>, is <a href=\"http:\/\/gigaom.com\/2012\/11\/14\/wibidata-open-sources-kiji-to-make-hbase-more-useful\/\">pushing an open source framework called Kiji<\/a> to make it easier to develop applications that use HBase.<\/p>\n<p>\u201cIf you talk to anyone from Cloudera or any of the platform vendors, I think they will tell you that a large percentage of their customers use HBase,\u201d Bisciglia said. \u201cIt\u2019s something that I only expect to see increasing.\u201d<\/p>\n<p>MapR seems to think so, too: the Hadoop-distribution vendor is getting ahead of the game by <a href=\"http:\/\/www.mapr.com\/products\/mapr-editions\/m7-edition\">selling an enterprise-grade version of HBase called M7<\/a>.\u00a0Should hot startups such as <a href=\"http:\/\/gigaom.com\/2012\/04\/13\/meet-tempodb-a-database-startup-with-an-eye-for-time\/\">TempoDB<\/a> and <a href=\"http:\/\/gigaom.com\/2013\/01\/16\/has-ayasdi-turned-machine-learning-into-a-magic-bullet\/\">Ayasdi<\/a> decide to take their HBase-reliant cloud services into the data center, they\u2019ll tap into Hadoop clusters, too.<\/p>\n<p>And the National Security Agency built <a href=\"http:\/\/accumulo.apache.org\/\">Apache Accumulo<\/a>, a key-value database similar to HBase but designed for fine-grained security and massive scale. It\u2019s now <a href=\"http:\/\/sqrrl.com\/\">being sold commercially by a startup called Sqrrl<\/a>. There\u2019s even a graph-processing project called <a href=\"http:\/\/incubator.apache.org\/giraph\/\">Giraph<\/a> that relies on HBase or Accumulo as the database layer.<\/p>\n<h2 id=\"whatever-real-time-means-to-yo\">Whatever \u201creal-time\u201d means to you<\/h2>\n<p>Real-time is one of those terms that means different things to different people and different applications. The interactivity that SQL-on-Hadoop technologies promise is one definition, as is the type of stream processing <a href=\"http:\/\/gigaom.com\/2011\/08\/04\/twitter-to-open-source-hadoop-like-tool\/\">enabled by technologies like Storm<\/a>. When it comes to the latter, there\u2019s a lot of excitement around YARN as the innovation will make it happen.<\/p>\n<p><a href=\"http:\/\/hortonworks.com\/blog\/introducing-apache-hadoop-yarn\/\">YARN, aka MapReduce 2.0<\/a>, is a resource scheduler and distributed application framework that allows Hadoop users to run processing paradigms other than MapReduce. This could mean things, from traditional parallel-processing methods such as MPI to graph processing to newly developed stream-processing engines such as Storm and <a href=\"http:\/\/incubator.apache.org\/s4\/\">S4<\/a>. Considering for how many years <em>Hadoop <\/em>meant <em>HDFS and MapReduce<\/em>, this type flexibility is certainly a big deal.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" alt=\"figure1\" src=\"http:\/\/gigaom2.files.wordpress.com\/2013\/03\/figure1.gif?w=300&#038;h=216\" width=\"300\" height=\"216\" class=\"size-medium wp-image-617741 alignleft\">Stream processing, of course, is the antithesis of batch processing, for which Hadoop is known, and which is inherently too slow for workloads such as serving real-time ads or monitoring sensor data. And even if Storm and other stream-processing platforms somehow don\u2019t make their way onto Hadoop clusters, <a href=\"http:\/\/gigaom.com\/2013\/02\/14\/hstreaming-ready-to-show-the-world-its-real-time-hadoop\/\">a startup called HStreaming has made it its mission<\/a> to deliver stream processing to Hadoop, and <a href=\"http:\/\/www.continuuity.com\/technology\">it\u2019s on other companies\u2019 radars, as well<\/a>.<\/p>\n<p>For what it\u2019s worth, though, <a href=\"http:\/\/verticloud.com\/\">VertiCloud<\/a> Founder and CEO and former Yahoo CTO Raymie Stata thinks we should do away with terms such as\u00a0<em><\/em>batch, real-time and interactive altogether. Instead, he prefers the terms synchronous and asynchronous to describe the human experience with the data rather than the speed of processing it. Synchronous computing happens at the speed of human activity, generally speaking, while asynchronous computing is largely decoupled from the idea of someone sitting in front of a computer screen awaiting a result.<\/p>\n<p>The change in terms is associated with a change in how you manage SLAs for applications. Uploading photos to Flickr: synchronous. Running a MapReduce job: most likely asynchronous. Ironically, according to Stata, stream processing data with Storm is often asynchronous, too. That\u2019s because there\u2019s probably not someone on the other end waiting for a page to update or a query to return. And unless you\u2019re doing something where guaranteed real-time latency is <em>necessary<\/em>, the occasional difference between milliseconds and 1 second probably isn\u2019t critical.<\/p>\n<p> <iframe loading=\"lazy\" width=\"100%\" height=\"166\" scrolling=\"no\" frameborder=\"no\" src=\"http:\/\/w.soundcloud.com\/player?url=http%3A%2F%2Fapi.soundcloud.com%2Ftracks%2F80972108%253Fsecret_token%253Ds-1QBTa\"><\/iframe> <\/p>\n<h2 id=\"time-to-insight-starts-at-the-\">Time to insight starts at the planning phase<\/h2>\n<p>Even when MapReduce is the answer, though, not everyone is game for a long Hadoop deployment process coupled with a consulting deal to identify uses and build applications or workflows. Sometimes, you just want to buy some software and get going.<\/p>\n<p>Already, companies such as Wibidata and Continuuity are trying to make it easier for companies to build Hadoop applications specific to their own needs, and Wibidata\u2019s Bisciglia said his company is doing less and less customization the more it deals with customers in the same vertical markets. \u201cI think it\u2019s still a couple years out before you can buy a generic application that runs on Hadoop,\u201d he told me, but he does see opportunity for billion-dollar businesses at this level, possibly selling the Hadoop equivalent of an ERP or CRM application.<\/p>\n<div id=\"attachment_603561\" class=\"wp-caption alignright\" style=\"width: 310px\"><img loading=\"lazy\" decoding=\"async\" alt=\"Structure Data 2012: Michael Olson \u2013 CEO, Cloudera\" src=\"http:\/\/gigaom2.files.wordpress.com\/2013\/01\/1z5o1503.jpg?w=300&#038;h=200\" width=\"300\" height=\"200\" class=\"size-medium wp-image-603561\"><\/p>\n<p class=\"wp-caption-text\">Cloudera CEO Mike Olson at Structure: Data 2012<br \/>(c) 2012 Pinar Ozger pinar@pinarozger.com<\/p>\n<\/div>\n<p>And Cloudera CEO Mike Olson <a href=\"http:\/\/gigaom.com\/2012\/03\/21\/cloudera-structure-data-2012\/\">told the audience at our Structure: Data conference last year<\/a> that he\u2019ll connect startups trying to build Hadoop-based applications with funding opportunities. In fact, Cloudera backer Accel Partners <a href=\"http:\/\/gigaom.com\/2011\/11\/08\/accel-forms-100m-fund-to-feed-big-data-apps\/\">launched a Big Data Fund in 2011<\/a> with the sole purpose of funding application-level big data startups.<\/p>\n<p>But maybe Cloudera, like database vendor Oracle before it, will just get into the application space itself: According to Hadoop creator and Cloudera chief architect Doug Cutting:<\/p>\n<blockquote id=\"quote-i-wouldnt-be-surpris\">\n<p>\u201cI wouldn\u2019t be surprised if you see vendors, like Cloudera, starting to creep up the stack and sell some applications. You\u2019ve seen that before from Red Hat, from Oracle. You could argue that the relational database is a platform for Oracle and they\u2019ve sold a lot of applications on top. So I think that happens as the market matures. When it\u2019s young, we don\u2019t want to stomp on potential collaborators at this point, we want to open that up to other people to really enhance the platform.\u201d<\/p>\n<\/blockquote>\n<p>Cloud computing is proving to be a big help in getting Hadoop projects off the ground, too. Even low-level services such as Amazon Elastic MapReduce can <a href=\"http:\/\/gigaom.com\/2012\/02\/22\/how-infochimps-wants-to-become-heroku-for-hadoop\/\">ease the burden of managing a physical Hadoop cluster<\/a>, and there are already a handful of cloud services <a href=\"http:\/\/gigaom.com\/2012\/04\/05\/kontagent-turns-data-mining-into-saas-for-mobile-apps\/\">exposing Hadoop as a SaaS application<\/a> for business intelligence and analytics. The easier it gets to store, process and analyze data in the cloud, the more appealing Hadoop looks to potential users who can\u2019t be bothered to invest in yet another IT project.<\/p>\n<h2 id=\"google-and-microsoft-a-guiding\">Google (and Microsoft): A guiding light<\/h2>\n<p>Lest we forget, Hadoop is based on a set of Google technologies, and it seems likely its future will also be influenced by what Google is doing. Already, <a href=\"http:\/\/hadoop.apache.org\/docs\/current\/hadoop-project-dist\/hadoop-hdfs\/Federation.html\">improvements to HDFS<\/a> seem to mirror <a href=\"http:\/\/www.theregister.co.uk\/2009\/08\/12\/google_file_system_part_deux\/\">changes to the Google File System a few years bac<\/a>k, and YARN will enable some new types of non-MapReduce processing similar to what <a href=\"http:\/\/research.google.com\/pubs\/pub36726.html\">Google\u2019s new Percolator framework<\/a> does. (Google claims Percolator lets it \u201cprocess the same number of documents per day, while reducing the average age of documents in Google search results by 50%.\u201d) The MapR-led Apache Drill project <a href=\"http:\/\/gigaom.com\/2012\/08\/17\/for-fast-interactive-hadoop-queries-drill-may-be-the-answer\/\">is a Hadoop-based version of Google\u2019s Dremel tool<\/a>; Giraph was likely inspired by Google\u2019s <a href=\"http:\/\/googleresearch.blogspot.com\/2009\/06\/large-scale-graph-computing-at-google.html\">Pregel graph-processing technology<\/a>.<\/p>\n<p>Cutting is particularly excited about Google Spanner, a database system that <a href=\"http:\/\/gigaom.com\/2012\/09\/17\/googles-spanner-a-database-that-knows-what-time-it-is\/\">spans data geographies while still maintaining transactional consistency<\/a>. \u201cIt\u2019s a matter of time before somebody implements that in the Hadoop ecosystem,\u201d he said. \u201cThat\u2019s a huge change.\u201d<\/p>\n<p>It\u2019s possible Microsoft could be an inspiration to the Hadoop community, too, especially if it begins to surface pieces of its Bing search infrastructure as products like a couple of company executives have told me it will. Bing <a href=\"http:\/\/research.microsoft.com\/en-us\/events\/fs2011\/helland_cosmos_big_data_and_big_challenges.pdf\">runs on a combination of tools called Cosmos, Tiger and Scope<\/a>, and it\u2019s part of the Online Services division ran by former Yahoo VP and Hadoop backer Qi Lu. Lu said that Microsoft (like Google) is looking beyond just search \u2014 Hadoop\u2019s original function \u2014 and into building an information fabric that changes how data is indexed, searched for and presented.<\/p>\n<p>However it evolves, though, it\u2019s becoming pretty obvious that Hadoop is no longer just a technology for doing cheap storage and some MapReduce processing. \u201cI think there\u2019s still some doubt in people\u2019s minds about whether Hadoop is a flash in the pan \u2026 and I think they\u2019re missing the point,\u201d Cutting said. \u201cI think that\u2019s going to be proven to people in the next year.\u201d<\/p>\n<p> <img loading=\"lazy\" decoding=\"async\" alt=\"\" border=\"0\" src=\"http:\/\/stats.wordpress.com\/b.gif?host=gigaom.com&#038;blog=14960843&#038;%23038;post=616972&#038;%23038;subd=gigaom2&#038;%23038;ref=&#038;%23038;feed=1\" width=\"1\" height=\"1\" \/><\/p>\n<p><a href=\"http:\/\/pubads.g.doubleclick.net\/gampad\/jump?iu=\/1008864\/GigaOM_RSS_300x250&#038;sz=300x250&#038;%23038;c=459529\"><img decoding=\"async\" src=\"http:\/\/pubads.g.doubleclick.net\/gampad\/ad?iu=\/1008864\/GigaOM_RSS_300x250&#038;sz=300x250&#038;%23038;c=459529\" \/><\/a><\/p>\n<p><strong>Related research and analysis from GigaOM Pro:<\/strong><br \/>Subscriber content. <a href=\"http:\/\/pro.gigaom.com\/?utm_source=data&#038;utm_medium=editorial&#038;utm_campaign=auto3&#038;utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&#038;utm_content=dharrisstructure\">Sign up for a free trial<\/a>.<\/p>\n<ul>\n<li><a href=\"http:\/\/pro.gigaom.com\/2012\/03\/a-near-term-outlook-for-big-data\/?utm_source=data&#038;utm_medium=editorial&#038;utm_campaign=auto3&#038;utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&#038;utm_content=dharrisstructure\">A near-term outlook for big data<\/a><\/li>\n<li><a href=\"http:\/\/pro.gigaom.com\/2012\/07\/scaling-hadoop-clusters-the-role-of-cluster-management\/?utm_source=data&#038;utm_medium=editorial&#038;utm_campaign=auto3&#038;utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&#038;utm_content=dharrisstructure\">Scaling Hadoop clusters: the role of cluster management<\/a><\/li>\n<li><a href=\"http:\/\/pro.gigaom.com\/2012\/04\/sector-roadmap-hadoop-platforms-2012\/?utm_source=data&#038;utm_medium=editorial&#038;utm_campaign=auto3&#038;utm_term=616972+5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking&#038;utm_content=dharrisstructure\">2012: The Hadoop infrastructure market booms<\/a><\/li>\n<\/ul>\n<p><img width='1' height='1' src='http:\/\/gigaom.feedsportal.com\/c\/34996\/f\/646446\/s\/294ea515\/mf.gif' border='0'\/><\/p>\n<div class='mf-viral'>\n<table border='0'>\n<tr>\n<td valign='middle'><a href=\"http:\/\/share.feedsportal.com\/viral\/sendEmail.cfm?lang=en&#038;title=5+reasons+why+the+future+of+Hadoop+is+real-time+%28relatively+speaking%29&#038;link=http%3A%2F%2Fgigaom.com%2F2013%2F03%2F07%2F5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking%2F\" ><img decoding=\"async\" src=\"http:\/\/res3.feedsportal.com\/images\/emailthis2.gif\" border=\"0\" \/><\/a><\/td>\n<td valign='middle'><a href=\"http:\/\/res.feedsportal.com\/viral\/bookmark.cfm?title=5+reasons+why+the+future+of+Hadoop+is+real-time+%28relatively+speaking%29&#038;link=http%3A%2F%2Fgigaom.com%2F2013%2F03%2F07%2F5-reasons-why-the-future-of-hadoop-is-real-time-relatively-speaking%2F\" ><img decoding=\"async\" src=\"http:\/\/res3.feedsportal.com\/images\/bookmark.gif\" border=\"0\" \/><\/a><\/td>\n<\/tr>\n<\/table>\n<\/div>\n<p><a href=\"http:\/\/da.feedsportal.com\/r\/159490600826\/u\/49\/f\/646446\/c\/34996\/s\/294ea515\/a2.htm\"><img decoding=\"async\" src=\"http:\/\/da.feedsportal.com\/r\/159490600826\/u\/49\/f\/646446\/c\/34996\/s\/294ea515\/a2.img\" border=\"0\"\/><\/a><img loading=\"lazy\" decoding=\"async\" width=\"1\" height=\"1\" src=\"http:\/\/pi.feedsportal.com\/r\/159490600826\/u\/49\/f\/646446\/c\/34996\/s\/294ea515\/a2t.img\" border=\"0\"\/><\/p>\n<div class=\"feedflare\">\n<a href=\"http:\/\/feeds.feedburner.com\/~ff\/OmMalik?a=3wNWjn7ZTpM:6u2eiOGR5Yc:yIl2AUoC8zA\"><img decoding=\"async\" src=\"http:\/\/feeds.feedburner.com\/~ff\/OmMalik?d=yIl2AUoC8zA\" border=\"0\"><\/img><\/a>\n<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/feeds.feedburner.com\/~r\/OmMalik\/~4\/3wNWjn7ZTpM\" height=\"1\" width=\"1\"\/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In some ways, Hadoop is a like a fine wine: It gets better with age as rough edges (or flavor profiles) are smoothed out, and those who wait to consume it will probably have a better experience. The only problem with this is that Hadoop exists in a world that\u2019s more about MD 20\/20 than [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[],"class_list":["post-645631","post","type-post","status-publish","format-standard","hentry","category-news"],"_links":{"self":[{"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/posts\/645631","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/comments?post=645631"}],"version-history":[{"count":0,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/posts\/645631\/revisions"}],"wp:attachment":[{"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/media?parent=645631"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/categories?post=645631"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/tags?post=645631"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}