{"id":649255,"date":"2013-03-27T20:53:38","date_gmt":"2013-03-28T00:53:38","guid":{"rendered":"http:\/\/gigaom.com\/?p=618320"},"modified":"2013-03-27T20:53:38","modified_gmt":"2013-03-28T00:53:38","slug":"why-apple-ebay-and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen","status":"publish","type":"post","link":"https:\/\/mereja.media\/index\/649255","title":{"rendered":"Why Apple, eBay, and Walmart have some of the biggest data warehouses you\u2019ve ever seen"},"content":{"rendered":"<p>In an age of Hadoop and a general analytics revolution, it&#8217;s easy to poke fun at legacy data warehouse vendors such as Teradata. Some people might even call it fun. After all, they sell expensive appliances and weren&#8217;t built from the ground up to handle the unstructured data that most people think of when they think of &#8220;big data.&#8221;<\/p>\n<p>But whatever you think about Teradata&#8217;s approach to handling big data workloads, make no mistake about the company&#8217;s clout: It has been around for decades, and it&#8217;s still analyzing boatloads of data for some of the biggest names in business. I spent a day in February touring the Teradata Labs facility in San Diego, and although I heard all about the technology and the company&#8217;s vision for a Teradata-Hadoop-Aster analytics super-environment, the thing that stuck out most were the users. Walmart, eBay, Continental &#8230; Apple.<\/p>\n<p>Here&#8217;s how they&#8217;re all using Teradata and at what scale (try not to faint when you think of the bill):<\/p>\n<ul>\n<li><strong>Apple: <\/strong>Apple is operating a multiple-petabyte Teradata system (that <a href=\"http:\/\/feedproxy.google.com\/~r\/OmMalik\/~3\/z1srCE4iHWk\/gigaom.com\/2011\/06\/08\/the-webs-watchful-eye-fixes-on-apples-cloud-gear\/\">became apparent during its iCloud launch<\/a> in 2011) and, I learned, was Teradata&#8217;s &#8220;fastest ever customer to a petabyte.&#8221; Apple uses the data warehouse to get a better understanding of its customers across product groups. Now every piece of identifiable information &#8212; and those iTunes interactiona generate a lot of data &#8212; goes into the system so the company knows who&#8217;s who and what they&#8217;re up to.<\/li>\n<\/ul>\n<div id=\"attachment_625071\" class=\"wp-caption aligncenter\" style=\"width: 718px\"><a href=\"http:\/\/gigaom2.files.wordpress.com\/2013\/03\/apple-teradata.jpg\"><img loading=\"lazy\" decoding=\"async\" alt=\"Rows of Teradata appliances.\" src=\"http:\/\/gigaom2.files.wordpress.com\/2013\/03\/apple-teradata.jpg?w=708&#038;h=397\" width=\"708\" height=\"397\" class=\"size-full wp-image-625071\" \/><\/a><\/p>\n<p class=\"wp-caption-text\">Rows of Teradata appliances.<\/p>\n<\/div>\n<ul>\n<li><strong>Walmart:<\/strong> The retail giant deployed Teradata&#8217;s first-ever terabyte-scale database in 1992, and it has grown, uh, a bit since then. Its operational system <a href=\"http:\/\/www.computerworld.com\/s\/article\/9117159\/Teradata_creates_elite_club_for_petabyte_plus_data_warehouse_customers\">was at 2.5 petabytes as of 2008<\/a>, and is certainly leaps and bounds bigger by now &#8212; likely well into the double digits when you consider it operates separate ones for Walmart and Sam&#8217;s Club as well as a backup system. The analytics efforts have essentially helped Walmart become a massive consignment shop. It tells suppliers, &#8220;You have three feet of shelf space. Optimize it.&#8221; And then it gives them any data they could possibly need to determine what&#8217;s selling, how fast and even whether they should redesign their packaging to fit more on the shelves.<\/li>\n<li><strong>eBay: <\/strong>eBay (e ebay) has two systems in place, and they&#8217;re both big. Its primary data warehouse is 9.2 petabyes; its &#8220;singularity system&#8221; that stores web clicks and other &#8220;big&#8221; data is more than 40 petabytes. It has a single table that&#8217;s 1 trillion rows. Yes, this is smaller than the <a href=\"http:\/\/gigaom.com\/2012\/04\/06\/making-the-web-more-efficient-a-thousand-servers-at-a-time\/\">50 petabytes worth of Hadoop capacity eBay added last year<\/a>, but Teradata is quick to point out that all of its systems support data into and out of Hadoop, so <a href=\"http:\/\/gigaom.com\/2012\/01\/31\/under-the-covers-of-ebays-big-data-operation\/\">it&#8217;s not as if eBay is operating two entirely distinct data environments<\/a>.<\/li>\n<\/ul>\n<p>Of course, Teradata has lots of other petabyte-scale customers, with Verizon, AT&#38;T and Bank of America among them. Here are a few more interesting use cases:<\/p>\n<ul>\n<li>Harrah&#8217;s (now part of the Caesar&#8217;s Entertainment casino empire) understands how much money particular gamblers can afford to lose in a day before they won&#8217;t come back the next day.<\/li>\n<li>Disney is rolling out new bracelet tickets equipped with GPS and NFC that track everything visitors do while inside Disney&#8217;s amusement parks. The <em>New York Times <\/em><a href=\"http:\/\/www.nytimes.com\/2013\/01\/07\/business\/media\/at-disney-parks-a-bracelet-meant-to-build-loyalty-and-sales.html?pagewanted=all&#38;_r=0\">detailed the privacy implications of this move in a January article<\/a>.<\/li>\n<li>A manufacturing customer generates 20 terabytes of data per hour while testing products, although that volume is ultimately reduced to about 1 terabyte after the valuable data is filtered out.<\/li>\n<li>At some point, Continental Airlines decided it wanted to keep its customers happy and began assessing them by lifetime value (which, it turns out, is often inversely related to frequent-flyer status) and began making alternative arrangements for them as soon as the airline realized flights would be delayed.<\/li>\n<li>A luxury car company used Aster Data to analyze the pattern of failures for various components inside its cars. It found out that lighting, seats and infotainment often failed together (they&#8217;re on the same circuit) and began inspecting all three when a customer comes in for service on any of them.<\/li>\n<\/ul>\n<p><a href=\"http:\/\/gigaom2.files.wordpress.com\/2013\/03\/bmw.jpg\"><img loading=\"lazy\" decoding=\"async\" alt=\"bmw\" src=\"http:\/\/gigaom2.files.wordpress.com\/2013\/03\/bmw.jpg?w=708&#038;h=393\" width=\"708\" height=\"393\" class=\"aligncenter size-large wp-image-625076\" \/><\/a><\/p>\n<p>None of this means Teradata is destined to continue being a huge name in analytics (Scott Yara, co-founder of rival EMC Greenplum, recently <a href=\"http:\/\/gigaom.com\/2013\/02\/25\/emc-to-hadoop-competition-see-ya-wouldnt-wanna-be-ya\/\">called data warehouses this generation&#8217;s mainframe<\/a>), but it&#8217;s still interesting to learn how big companies are analyzing their data, regardless what they&#8217;re running on. And with exabytes worth of data no doubt residing in customer systems across the world, Teradata isn&#8217;t going anywhere soon.<\/p>\n<p> <img loading=\"lazy\" decoding=\"async\" alt=\"\" border=\"0\" src=\"http:\/\/stats.wordpress.com\/b.gif?host=gigaom.com&#038;blog=14960843&#038;%23038;post=618320&#038;%23038;subd=gigaom2&#038;%23038;ref=&#038;%23038;feed=1\" width=\"1\" height=\"1\" \/><\/p>\n<p><a href=\"http:\/\/pubads.g.doubleclick.net\/gampad\/jump?iu=\/1008864\/GigaOM_RSS_300x250&#038;sz=300x250&#038;%23038;c=755894\"><img decoding=\"async\" src=\"http:\/\/pubads.g.doubleclick.net\/gampad\/ad?iu=\/1008864\/GigaOM_RSS_300x250&#038;sz=300x250&#038;%23038;c=755894\" \/><\/a><\/p>\n<p><strong>Related research and analysis from GigaOM Pro:<\/strong><br \/>Subscriber content. <a href=\"http:\/\/pro.gigaom.com\/?utm_source=data&#038;utm_medium=editorial&#038;utm_campaign=auto3&#038;utm_term=618320+why-apple-ebay-and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen&#038;utm_content=dharrisstructure\">Sign up for a free trial<\/a>.<\/p>\n<ul>\n<li><a href=\"http:\/\/pro.gigaom.com\/2012\/05\/the-importance-of-putting-the-u-and-i-in-visualization\/?utm_source=data&#038;utm_medium=editorial&#038;utm_campaign=auto3&#038;utm_term=618320+why-apple-ebay-and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen&#038;utm_content=dharrisstructure\">The importance of putting the U and I in visualization<\/a><\/li>\n<li><a href=\"http:\/\/pro.gigaom.com\/2012\/04\/sector-roadmap-hadoop-platforms-2012\/?utm_source=data&#038;utm_medium=editorial&#038;utm_campaign=auto3&#038;utm_term=618320+why-apple-ebay-and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen&#038;utm_content=dharrisstructure\">2012: The Hadoop infrastructure market booms<\/a><\/li>\n<li><a href=\"http:\/\/pro.gigaom.com\/2012\/03\/a-near-term-outlook-for-big-data\/?utm_source=data&#038;utm_medium=editorial&#038;utm_campaign=auto3&#038;utm_term=618320+why-apple-ebay-and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen&#038;utm_content=dharrisstructure\">A near-term outlook for big data<\/a><\/li>\n<\/ul>\n<p><img width='1' height='1' src='http:\/\/gigaom.feedsportal.com\/c\/34996\/f\/646446\/s\/2a1221c5\/mf.gif' border='0'\/><\/p>\n<div class='mf-viral'>\n<table border='0'>\n<tr>\n<td valign='middle'><a href=\"http:\/\/share.feedsportal.com\/viral\/sendEmail.cfm?lang=en&#038;title=Why+Apple%2C+eBay%2C+and+Walmart+have+some+of+the+biggest+data+warehouses+you%E2%80%99ve+ever+seen&#038;link=http%3A%2F%2Fgigaom.com%2F2013%2F03%2F27%2Fwhy-apple-ebay-and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen%2F\" ><img decoding=\"async\" src=\"http:\/\/res3.feedsportal.com\/images\/emailthis2.gif\" border=\"0\" \/><\/a><\/td>\n<td valign='middle'><a href=\"http:\/\/res.feedsportal.com\/viral\/bookmark.cfm?title=Why+Apple%2C+eBay%2C+and+Walmart+have+some+of+the+biggest+data+warehouses+you%E2%80%99ve+ever+seen&#038;link=http%3A%2F%2Fgigaom.com%2F2013%2F03%2F27%2Fwhy-apple-ebay-and-walmart-have-some-of-the-biggest-data-warehouses-youve-ever-seen%2F\" ><img decoding=\"async\" src=\"http:\/\/res3.feedsportal.com\/images\/bookmark.gif\" border=\"0\" \/><\/a><\/td>\n<\/tr>\n<\/table>\n<\/div>\n<p><a href=\"http:\/\/da.feedsportal.com\/r\/161990725746\/u\/49\/f\/646446\/c\/34996\/s\/2a1221c5\/a2.htm\"><img decoding=\"async\" src=\"http:\/\/da.feedsportal.com\/r\/161990725746\/u\/49\/f\/646446\/c\/34996\/s\/2a1221c5\/a2.img\" border=\"0\"\/><\/a><img loading=\"lazy\" decoding=\"async\" width=\"1\" height=\"1\" src=\"http:\/\/pi.feedsportal.com\/r\/161990725746\/u\/49\/f\/646446\/c\/34996\/s\/2a1221c5\/a2t.img\" border=\"0\"\/><\/p>\n<div class=\"feedflare\">\n<a href=\"http:\/\/feeds.feedburner.com\/~ff\/OmMalik?a=z1srCE4iHWk:diZJvey7KGY:yIl2AUoC8zA\"><img decoding=\"async\" src=\"http:\/\/feeds.feedburner.com\/~ff\/OmMalik?d=yIl2AUoC8zA\" border=\"0\"><\/img><\/a>\n<\/div>\n<p><img loading=\"lazy\" decoding=\"async\" src=\"http:\/\/feeds.feedburner.com\/~r\/OmMalik\/~4\/z1srCE4iHWk\" height=\"1\" width=\"1\"\/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>In an age of Hadoop and a general analytics revolution, it&#8217;s easy to poke fun at legacy data warehouse vendors such as Teradata. Some people might even call it fun. After all, they sell expensive appliances and weren&#8217;t built from the ground up to handle the unstructured data that most people think of when they [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[7],"tags":[],"class_list":["post-649255","post","type-post","status-publish","format-standard","hentry","category-news"],"_links":{"self":[{"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/posts\/649255","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/comments?post=649255"}],"version-history":[{"count":0,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/posts\/649255\/revisions"}],"wp:attachment":[{"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/media?parent=649255"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/categories?post=649255"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mereja.media\/index\/wp-json\/wp\/v2\/tags?post=649255"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}