{"id":67181,"date":"2015-04-06T11:48:52","date_gmt":"2015-04-06T11:48:52","guid":{"rendered":"http:\/\/www.esri.com\/arcgis-blog\/products\/product\/uncategorized\/an-introduction-to-big-data\/"},"modified":"2021-08-03T00:45:52","modified_gmt":"2021-08-03T07:45:52","slug":"an-introduction-to-big-data","status":"publish","type":"blog","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data","title":{"rendered":"An Introduction to Big Data"},"author":3981,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"open","ping_status":"closed","template":"","format":"standard","meta":{"_acf_changed":false,"_searchwp_excluded":""},"categories":[23851],"tags":[25351,25671,25391],"industry":[],"product":[],"class_list":["post-67181","blog","type-blog","status-publish","format-standard","hentry","category-data-management","tag-big-data","tag-github","tag-hadoop"],"acf":{"short_description":"Two years ago the Big Data team released GIS Tools for Hadoop on GitHub. GIS Tools for Hadoop is an open source project that allows users...","flexible_content":[{"acf_fc_layout":"content","content":"<p><a href=\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2015\/04\/ArcGISHadoop.png\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft\" src=\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2015\/04\/ArcGISHadoop.png\" alt=\"\" width=\"289\" height=\"272\" \/><\/a><\/p>\n<p>Two years ago the <a href=\"http:\/\/blogs.esri.com\/esri\/arcgis\/2013\/03\/25\/gis-tools-for-hadoop\/\">Big Data team released<\/a> GIS Tools for Hadoop on GitHub. <a href=\"http:\/\/esri.github.io\/gis-tools-for-hadoop\/\">GIS Tools for Hadoop<\/a> is an open source project that allows users to integrate <a href=\"https:\/\/hadoop.apache.org\/\">Hadoop<\/a> (a distributed big data platform) with <a href=\"https:\/\/www.esri.com\/en-us\/arcgis\/products\/spatial-analytics-data-science\/overview\">big spatial data<\/a>, complete distributed spatial analysis, and move data between the Hadoop Distributed Filing System (HDFS) and ArcGIS Desktop.<\/p>\n<p>Until now, it has been difficult for many GIS users to take full advantage of these tools, or even just try them out (and see what all this big data talk is about). We know that not everyone has a cluster sitting around (<a href=\"http:\/\/blogs.esri.com\/esri\/arcgis\/2014\/07\/28\/setting-up-a-small-budget-hadoop-cluster-for-big-data-analysis\/\">although they are cheaper than you\u2019d think<\/a>) so we have put together a <a href=\"https:\/\/github.com\/Esri\/gis-tools-for-hadoop\/wiki\/GIS-Tools-for-Hadoop-for-Beginners\">tutorial for beginners<\/a> &#8211; no cluster or development experience needed!<\/p>\n<p>This tutorial takes you through the steps of downloading and starting up a virtual machine (<a href=\"http:\/\/hortonworks.com\/products\/hortonworks-sandbox\/#overview\">a self-contained portable Hadoop environment<\/a>), accessing GIS Tools for Hadoop through GitHub, and pointing you towards <a href=\"https:\/\/github.com\/Esri\/gis-tools-for-hadoop\/wiki\">tutorials<\/a> and <a href=\"https:\/\/github.com\/Esri\/gis-tools-for-hadoop\/tree\/master\/samples\">samples<\/a> that teach you how to complete analyses on your big spatial data.<\/p>\n<p>Check out the <a href=\"https:\/\/github.com\/Esri\/gis-tools-for-hadoop\/wiki\">tutorial<\/a> on GitHub, and let us know if you have any questions, or other tutorials you want to see on our <a href=\"https:\/\/geonet.esri.com\/groups\/big-data\">GeoNet page<\/a>.<\/p>\n<p><em>(Post submitted by Sarah Ambrose, Big Data Team)<\/em><\/p>\n"}],"authors":[{"ID":3981,"user_firstname":"Jonathan","user_lastname":"Murphy","nickname":"Jonathan Murphy","user_nicename":"jonmurphy","display_name":"Jonathan Murphy","user_email":"jonathan_murphy@esri.com","user_url":"","user_registered":"2018-03-02 00:15:37","user_description":"Product Owner, UX Designer and Content Strategist on the Geodatabase team at Esri. \r\nWriter, musician, cockatiel whisperer and prolific world traveler.","user_avatar":"<img data-del=\"avatar\" src='https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/04\/J_Mu-213x200.png' class='avatar pp-user-avatar avatar-96 photo ' height='96' width='96'\/>"}],"related_articles":"","card_image":false,"wide_image":false},"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.9 (Yoast SEO v25.9) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>An Introduction to Big Data<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"An Introduction to Big Data\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data\" \/>\n<meta property=\"og:site_name\" content=\"ArcGIS Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/esrigis\/\" \/>\n<meta property=\"article:modified_time\" content=\"2021-08-03T07:45:52+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@ESRI\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":[\"Article\",\"BlogPosting\"],\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data\"},\"author\":{\"name\":\"Jonathan Murphy\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/dec789ad68db472c6018c1c9068998be\"},\"headline\":\"An Introduction to Big Data\",\"datePublished\":\"2015-04-06T11:48:52+00:00\",\"dateModified\":\"2021-08-03T07:45:52+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data\"},\"wordCount\":5,\"commentCount\":0,\"publisher\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#organization\"},\"keywords\":[\"Big Data\",\"GitHub\",\"Hadoop\"],\"articleSection\":[\"Data Management\"],\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"CommentAction\",\"name\":\"Comment\",\"target\":[\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data#respond\"]}]},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data\",\"name\":\"An Introduction to Big Data\",\"isPartOf\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#website\"},\"datePublished\":\"2015-04-06T11:48:52+00:00\",\"dateModified\":\"2021-08-03T07:45:52+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.esri.com\/arcgis-blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"An Introduction to Big Data\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#website\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/\",\"name\":\"ArcGIS Blog\",\"description\":\"Get insider info from Esri product teams\",\"publisher\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.esri.com\/arcgis-blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#organization\",\"name\":\"Esri\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2018\/04\/Esri.png\",\"contentUrl\":\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2018\/04\/Esri.png\",\"width\":400,\"height\":400,\"caption\":\"Esri\"},\"image\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/esrigis\/\",\"https:\/\/x.com\/ESRI\",\"https:\/\/www.linkedin.com\/company\/5311\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/dec789ad68db472c6018c1c9068998be\",\"name\":\"Jonathan Murphy\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/04\/J_Mu-213x200.png\",\"contentUrl\":\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/04\/J_Mu-213x200.png\",\"caption\":\"Jonathan Murphy\"},\"description\":\"Product Owner, UX Designer and Content Strategist on the Geodatabase team at Esri. Writer, musician, cockatiel whisperer and prolific world traveler.\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/author\/jonmurphy\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"An Introduction to Big Data","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data","og_locale":"en_US","og_type":"article","og_title":"An Introduction to Big Data","og_url":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data","og_site_name":"ArcGIS Blog","article_publisher":"https:\/\/www.facebook.com\/esrigis\/","article_modified_time":"2021-08-03T07:45:52+00:00","twitter_card":"summary_large_image","twitter_site":"@ESRI","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":["Article","BlogPosting"],"@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data#article","isPartOf":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data"},"author":{"name":"Jonathan Murphy","@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/dec789ad68db472c6018c1c9068998be"},"headline":"An Introduction to Big Data","datePublished":"2015-04-06T11:48:52+00:00","dateModified":"2021-08-03T07:45:52+00:00","mainEntityOfPage":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data"},"wordCount":5,"commentCount":0,"publisher":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/#organization"},"keywords":["Big Data","GitHub","Hadoop"],"articleSection":["Data Management"],"inLanguage":"en-US","potentialAction":[{"@type":"CommentAction","name":"Comment","target":["https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data#respond"]}]},{"@type":"WebPage","@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data","url":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data","name":"An Introduction to Big Data","isPartOf":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/#website"},"datePublished":"2015-04-06T11:48:52+00:00","dateModified":"2021-08-03T07:45:52+00:00","breadcrumb":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/product\/data-management\/an-introduction-to-big-data#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.esri.com\/arcgis-blog\/"},{"@type":"ListItem","position":2,"name":"An Introduction to Big Data"}]},{"@type":"WebSite","@id":"https:\/\/www.esri.com\/arcgis-blog\/#website","url":"https:\/\/www.esri.com\/arcgis-blog\/","name":"ArcGIS Blog","description":"Get insider info from Esri product teams","publisher":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.esri.com\/arcgis-blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.esri.com\/arcgis-blog\/#organization","name":"Esri","url":"https:\/\/www.esri.com\/arcgis-blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2018\/04\/Esri.png","contentUrl":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2018\/04\/Esri.png","width":400,"height":400,"caption":"Esri"},"image":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/esrigis\/","https:\/\/x.com\/ESRI","https:\/\/www.linkedin.com\/company\/5311\/"]},{"@type":"Person","@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/dec789ad68db472c6018c1c9068998be","name":"Jonathan Murphy","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/image\/","url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/04\/J_Mu-213x200.png","contentUrl":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/04\/J_Mu-213x200.png","caption":"Jonathan Murphy"},"description":"Product Owner, UX Designer and Content Strategist on the Geodatabase team at Esri. Writer, musician, cockatiel whisperer and prolific world traveler.","url":"https:\/\/www.esri.com\/arcgis-blog\/author\/jonmurphy"}]}},"text_date":"April 6, 2015","author_name":"Jonathan Murphy","author_page":"https:\/\/www.esri.com\/arcgis-blog\/author\/jonmurphy","custom_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2025\/08\/Newsroom-Keyart-Wide-1920-x-1080.jpg","primary_product":false,"tag_data":[{"term_id":25351,"name":"Big Data","slug":"big-data","term_group":0,"term_taxonomy_id":25351,"taxonomy":"post_tag","description":"","parent":0,"count":36,"filter":"raw"},{"term_id":25671,"name":"GitHub","slug":"github","term_group":0,"term_taxonomy_id":25671,"taxonomy":"post_tag","description":"","parent":0,"count":4,"filter":"raw"},{"term_id":25391,"name":"Hadoop","slug":"hadoop","term_group":0,"term_taxonomy_id":25391,"taxonomy":"post_tag","description":"","parent":0,"count":3,"filter":"raw"}],"category_data":[{"term_id":23851,"name":"Data Management","slug":"data-management","term_group":0,"term_taxonomy_id":23851,"taxonomy":"category","description":"","parent":0,"count":920,"filter":"raw"}],"product_data":[],"primary_product_link":"https:\/\/www.esri.com\/arcgis-blog\/","_links":{"self":[{"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/blog\/67181","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/blog"}],"about":[{"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/types\/blog"}],"author":[{"embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/users\/3981"}],"replies":[{"embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/comments?post=67181"}],"version-history":[{"count":0,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/blog\/67181\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/media?parent=67181"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/categories?post=67181"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/tags?post=67181"},{"taxonomy":"industry","embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/industry?post=67181"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/product?post=67181"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}