{"id":722342,"date":"2020-02-14T08:33:33","date_gmt":"2020-02-14T16:33:33","guid":{"rendered":"https:\/\/www.esri.com\/arcgis-blog\/?post_type=blog&#038;p=722342"},"modified":"2020-02-23T07:05:45","modified_gmt":"2020-02-23T15:05:45","slug":"data-selection-and-preparation","status":"publish","type":"blog","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation","title":{"rendered":"Data selection and preparation"},"author":6891,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","format":"standard","meta":{"_acf_changed":false,"_searchwp_excluded":""},"categories":[23851],"tags":[173712,42181],"industry":[],"product":[36811,36561],"class_list":["post-722342","blog","type-blog","status-publish","format-standard","hentry","category-data-management","tag-arcgis-hub","tag-arcgis-pro","product-arcgis-hub","product-arcgis-pro"],"acf":{"short_description":"Take a look at a process for data search, selection and cleanup.","flexible_content":[{"acf_fc_layout":"content","content":"<p>Do you perform online searches to look for publicly available datasets that contain the information you need for an analysis? Of course. Now, how often do you find a clean dataset with the exact information that you need, nothing more, nothing less, and in a form that is ready to use for your scenario? Probably very rarely. In this blog post, we will look at a process for data search, selection and cleanup.<\/p>\n<p>Let&#8217;s say you are an analyst at a marketing firm. Your client is a university that wants to boost its enrollment from public schools in Marion County, Indiana. You are responsible for allocating the resources available to you at your firm towards outreach and promotion efforts on behalf of the university. To begin your analysis, you need the locations of all public high schools in Marion County, IN. A search on <a href=\"https:\/\/hub.arcgis.com\/\">ArcGIS Hub<\/a> for \u201cpublic schools united states\u201d returns several results, among which is a dataset of all public schools in the United States, shared by the Oak Ridge National Laboratory, which has been updated recently.<\/p>\n"},{"acf_fc_layout":"image","image":{"ID":722692,"id":722692,"title":"Public Schools dataset on ArcGIS Hub","filename":"Image1.png","filesize":21363,"url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1.png","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\/image1-5","alt":"The 'Last Updated' date tells us that this data was updated recently.","author":"6891","description":"","caption":"","name":"image1-5","status":"inherit","uploaded_to":722342,"date":"2020-02-03 23:52:15","modified":"2020-02-03 23:54:02","menu_order":0,"mime_type":"image\/png","type":"image","subtype":"png","icon":"https:\/\/www.esri.com\/arcgis-blog\/wp-includes\/images\/media\/default.png","width":853,"height":280,"sizes":{"thumbnail":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1-213x200.png","thumbnail-width":213,"thumbnail-height":200,"medium":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1.png","medium-width":464,"medium-height":152,"medium_large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1.png","medium_large-width":768,"medium_large-height":252,"large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1.png","large-width":853,"large-height":280,"1536x1536":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1.png","1536x1536-width":853,"1536x1536-height":280,"2048x2048":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1.png","2048x2048-width":853,"2048x2048-height":280,"card_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1-826x271.png","card_image-width":826,"card_image-height":271,"wide_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image1.png","wide_image-width":853,"wide_image-height":280}},"image_position":"left-center","orientation":"horizontal","hyperlink":""},{"acf_fc_layout":"content","content":"<p>Click on the title <strong>Public Schools<\/strong> to open it.<\/p>\n"},{"acf_fc_layout":"image","image":{"ID":722732,"id":722732,"title":"Image2","filename":"Image2.png","filesize":116918,"url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2.png","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\/image2-6","alt":"","author":"6891","description":"","caption":"","name":"image2-6","status":"inherit","uploaded_to":722342,"date":"2020-02-04 00:00:34","modified":"2020-02-04 00:00:34","menu_order":0,"mime_type":"image\/png","type":"image","subtype":"png","icon":"https:\/\/www.esri.com\/arcgis-blog\/wp-includes\/images\/media\/default.png","width":1122,"height":649,"sizes":{"thumbnail":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2-213x200.png","thumbnail-width":213,"thumbnail-height":200,"medium":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2.png","medium-width":451,"medium-height":261,"medium_large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2.png","medium_large-width":768,"medium_large-height":444,"large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2.png","large-width":1122,"large-height":649,"1536x1536":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2.png","1536x1536-width":1122,"1536x1536-height":649,"2048x2048":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2.png","2048x2048-width":1122,"2048x2048-height":649,"card_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2-804x465.png","card_image-width":804,"card_image-height":465,"wide_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image2.png","wide_image-width":1122,"wide_image-height":649}},"image_position":"center","orientation":"horizontal","hyperlink":""},{"acf_fc_layout":"content","content":"<p>Click on <strong>View Metadata<\/strong> and look for information on terms of use. Under Constraints, you are able to confirm that the dataset is in the public domain, and it would be permissible to use it in your analysis.<\/p>\n"},{"acf_fc_layout":"image","image":{"ID":722752,"id":722752,"title":"Image3","filename":"Image3.png","filesize":13962,"url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3.png","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\/image3-4","alt":"","author":"6891","description":"","caption":"","name":"image3-4","status":"inherit","uploaded_to":722342,"date":"2020-02-04 00:01:38","modified":"2020-02-04 00:01:38","menu_order":0,"mime_type":"image\/png","type":"image","subtype":"png","icon":"https:\/\/www.esri.com\/arcgis-blog\/wp-includes\/images\/media\/default.png","width":876,"height":210,"sizes":{"thumbnail":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3-213x200.png","thumbnail-width":213,"thumbnail-height":200,"medium":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3.png","medium-width":464,"medium-height":111,"medium_large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3.png","medium_large-width":768,"medium_large-height":184,"large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3.png","large-width":876,"large-height":210,"1536x1536":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3.png","1536x1536-width":876,"1536x1536-height":210,"2048x2048":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3.png","2048x2048-width":876,"2048x2048-height":210,"card_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3-826x198.png","card_image-width":826,"card_image-height":198,"wide_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image3.png","wide_image-width":876,"wide_image-height":210}},"image_position":"center","orientation":"horizontal","hyperlink":""},{"acf_fc_layout":"content","content":"<p>Next, on the <strong>Data<\/strong> tab, use the filter buttons in the column headers to filter the dataset to only include schools in Marion County, IN that teach students Grades 9 through 12. This will be the preliminary list of high schools your promotion needs to cover. Download the filtered dataset as a shapefile.<\/p>\n"},{"acf_fc_layout":"image","image":{"ID":722822,"id":722822,"title":"Image4","filename":"Image4.png","filesize":20444,"url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4.png","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\/image4-4","alt":"","author":"6891","description":"","caption":"","name":"image4-4","status":"inherit","uploaded_to":722342,"date":"2020-02-04 13:39:12","modified":"2020-02-04 13:39:12","menu_order":0,"mime_type":"image\/png","type":"image","subtype":"png","icon":"https:\/\/www.esri.com\/arcgis-blog\/wp-includes\/images\/media\/default.png","width":1159,"height":323,"sizes":{"thumbnail":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4-213x200.png","thumbnail-width":213,"thumbnail-height":200,"medium":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4.png","medium-width":464,"medium-height":129,"medium_large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4.png","medium_large-width":768,"medium_large-height":214,"large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4.png","large-width":1159,"large-height":323,"1536x1536":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4.png","1536x1536-width":1159,"1536x1536-height":323,"2048x2048":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4.png","2048x2048-width":1159,"2048x2048-height":323,"card_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4-826x230.png","card_image-width":826,"card_image-height":230,"wide_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image4.png","wide_image-width":1159,"wide_image-height":323}},"image_position":"center","orientation":"horizontal","hyperlink":""},{"acf_fc_layout":"content","content":"<p>Next, you will examine the data. In ArcGIS Pro, add the shapefile to a map, and open its attribute table. Sort the schools in descending order of address. Of the total 58 high schools, notice that some of the schools are located extremely close to others. Use the select tool to select one such cluster on the map. You can tell that it has 3 schools, as 3 records get selected in the attribute table.<\/p>\n"},{"acf_fc_layout":"image","image":{"ID":722832,"id":722832,"title":"Image5","filename":"Image5.png","filesize":246664,"url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5.png","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\/image5-5","alt":"","author":"6891","description":"","caption":"The Marion County boundary is also included in the screenshot for reference","name":"image5-5","status":"inherit","uploaded_to":722342,"date":"2020-02-04 13:40:49","modified":"2020-02-04 13:42:01","menu_order":0,"mime_type":"image\/png","type":"image","subtype":"png","icon":"https:\/\/www.esri.com\/arcgis-blog\/wp-includes\/images\/media\/default.png","width":1290,"height":843,"sizes":{"thumbnail":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5-213x200.png","thumbnail-width":213,"thumbnail-height":200,"medium":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5.png","medium-width":399,"medium-height":261,"medium_large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5.png","medium_large-width":768,"medium_large-height":502,"large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5.png","large-width":1290,"large-height":843,"1536x1536":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5.png","1536x1536-width":1290,"1536x1536-height":843,"2048x2048":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5.png","2048x2048-width":1290,"2048x2048-height":843,"card_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5-712x465.png","card_image-width":712,"card_image-height":465,"wide_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image5.png","wide_image-width":1290,"wide_image-height":843}},"image_position":"center","orientation":"horizontal","hyperlink":""},{"acf_fc_layout":"content","content":"<p>One of those schools is named \u201cArea 31 Career &amp; Tech Center\u201d and has an enrollment of 0. Clearly, it is not a high school, and apparently a case of the facility being used as a career and tech center (presumably after school hours). Delete this location from the list.<\/p>\n<p>The other two are \u201cBen Davis High School\u201d and \u201cBen Davis Ninth Grade Center\u201d. From the Start Grade and End Grade columns, it is evident that the Ninth Grade Center serves only Ninth Grade students and the High School serves Grades 10 through 12. From the map, it appears that they are part of the same facility or school building.<\/p>\n"},{"acf_fc_layout":"image","image":{"ID":722842,"id":722842,"title":"Image6","filename":"Image6.png","filesize":109833,"url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6.png","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\/image6-4","alt":"","author":"6891","description":"","caption":"","name":"image6-4","status":"inherit","uploaded_to":722342,"date":"2020-02-04 13:43:44","modified":"2020-02-04 13:43:44","menu_order":0,"mime_type":"image\/png","type":"image","subtype":"png","icon":"https:\/\/www.esri.com\/arcgis-blog\/wp-includes\/images\/media\/default.png","width":1292,"height":573,"sizes":{"thumbnail":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6-213x200.png","thumbnail-width":213,"thumbnail-height":200,"medium":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6.png","medium-width":464,"medium-height":206,"medium_large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6.png","medium_large-width":768,"medium_large-height":341,"large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6.png","large-width":1292,"large-height":573,"1536x1536":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6.png","1536x1536-width":1292,"1536x1536-height":573,"2048x2048":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6.png","2048x2048-width":1292,"2048x2048-height":573,"card_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6-826x366.png","card_image-width":826,"card_image-height":366,"wide_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image6.png","wide_image-width":1292,"wide_image-height":573}},"image_position":"center","orientation":"horizontal","hyperlink":""},{"acf_fc_layout":"content","content":"<p>Since you need school locations to plan direct outreach and marketing, it makes sense to treat this as one single location high school that needs to be covered, rather than two. Merge the Ninth grade center with the high school using these steps:<\/p>\n<ul>\n<li>Add the Ninth Grade Center enrollment to the High School enrollment figure.<\/li>\n<li>Add an attribute to the table for \u201cNotes\u201d and add a Note to ensure the Ninth Grade Center is not overlooked.<\/li>\n<li>Delete the Ninth Grade Center record.<\/li>\n<\/ul>\n"},{"acf_fc_layout":"image","image":{"ID":722852,"id":722852,"title":"Image7","filename":"Image7.png","filesize":48467,"url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7.png","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\/image7-3","alt":"","author":"6891","description":"","caption":"","name":"image7-3","status":"inherit","uploaded_to":722342,"date":"2020-02-04 13:46:14","modified":"2020-02-04 13:46:14","menu_order":0,"mime_type":"image\/png","type":"image","subtype":"png","icon":"https:\/\/www.esri.com\/arcgis-blog\/wp-includes\/images\/media\/default.png","width":1258,"height":250,"sizes":{"thumbnail":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7-213x200.png","thumbnail-width":213,"thumbnail-height":200,"medium":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7.png","medium-width":464,"medium-height":92,"medium_large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7.png","medium_large-width":768,"medium_large-height":153,"large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7.png","large-width":1258,"large-height":250,"1536x1536":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7.png","1536x1536-width":1258,"1536x1536-height":250,"2048x2048":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7.png","2048x2048-width":1258,"2048x2048-height":250,"card_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7-826x164.png","card_image-width":826,"card_image-height":164,"wide_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image7.png","wide_image-width":1258,"wide_image-height":250}},"image_position":"center","orientation":"horizontal","hyperlink":""},{"acf_fc_layout":"content","content":"<p>Continue cleaning up the dataset by following these steps:<\/p>\n<ul>\n<li>Review the other clusters and consolidate separate schools that are located at the same site.<\/li>\n<li>Sort the table in ascending order of enrollment and delete any other sites with an enrollment of 0.<\/li>\n<li>Lastly, delete or hide columns that will not be needed in your analysis \u2013 for example LATITUDE, LONGITUDE, Country, and VAL_DATE.<\/li>\n<\/ul>\n<p>After these data cleanup steps are complete, the dataset has 42 high school sites.<\/p>\n"},{"acf_fc_layout":"image","image":{"ID":722862,"id":722862,"title":"Image8","filename":"Image8.png","filesize":172380,"url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8.png","link":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\/image8-3","alt":"","author":"6891","description":"","caption":"","name":"image8-3","status":"inherit","uploaded_to":722342,"date":"2020-02-04 13:47:34","modified":"2020-02-04 13:47:34","menu_order":0,"mime_type":"image\/png","type":"image","subtype":"png","icon":"https:\/\/www.esri.com\/arcgis-blog\/wp-includes\/images\/media\/default.png","width":1098,"height":666,"sizes":{"thumbnail":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8-213x200.png","thumbnail-width":213,"thumbnail-height":200,"medium":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8.png","medium-width":430,"medium-height":261,"medium_large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8.png","medium_large-width":768,"medium_large-height":466,"large":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8.png","large-width":1098,"large-height":666,"1536x1536":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8.png","1536x1536-width":1098,"1536x1536-height":666,"2048x2048":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8.png","2048x2048-width":1098,"2048x2048-height":666,"card_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8-767x465.png","card_image-width":767,"card_image-height":465,"wide_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2020\/02\/Image8.png","wide_image-width":1098,"wide_image-height":666}},"image_position":"center","orientation":"horizontal","hyperlink":""},{"acf_fc_layout":"content","content":"<p>It&#8217;s rare to find data that&#8217;s already perfectly formatted for your needs, but the work you do to prepare and clean a dataset also gives you better understanding for the analysis you&#8217;re embarking on. In <a href=\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/no-data-no-problem-leverage-living-atlas-spatial-joins-and-data-enrichment\/\">another blog post<\/a>, this prepared schools dataset is used to enrich a boundary feature layer for use in a territory design analysis. You can read about the analysis in the Learn ArcGIS Lesson <a href=\"https:\/\/learn.arcgis.com\/en\/projects\/balance-territories-for-college-recruiters\/\" target=\"_blank\" rel=\"noopener\">Balance Territories for College Recruiters<\/a>.<\/p>\n"}],"authors":[{"ID":6891,"user_firstname":"Debashish","user_lastname":"Ghosh","nickname":"Debashish Ghosh","user_nicename":"dghosh","display_name":"Debashish Ghosh","user_email":"dghosh@esri.com","user_url":"","user_registered":"2018-03-02 00:18:57","user_description":"I am a Product Engineer and Writer at Esri, focused primarily on the Business Analyst applications for web, mobile and desktop.","user_avatar":"<img data-del=\"avatar\" src='https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2019\/10\/InColorado-213x200.png' class='avatar pp-user-avatar avatar-96 photo ' height='96' width='96'\/>"}],"related_articles":"","card_image":false,"wide_image":false},"yoast_head":"<!-- This site is optimized with the Yoast SEO Premium plugin v25.9 (Yoast SEO v25.9) - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Data selection and preparation<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Data selection and preparation\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\" \/>\n<meta property=\"og:site_name\" content=\"ArcGIS Blog\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/esrigis\/\" \/>\n<meta property=\"article:modified_time\" content=\"2020-02-23T15:05:45+00:00\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:site\" content=\"@ESRI\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":[\"Article\",\"BlogPosting\"],\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation#article\",\"isPartOf\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\"},\"author\":{\"name\":\"Debashish Ghosh\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/e1b87363d6712e909b8ab06534d4e65d\"},\"headline\":\"Data selection and preparation\",\"datePublished\":\"2020-02-14T16:33:33+00:00\",\"dateModified\":\"2020-02-23T15:05:45+00:00\",\"mainEntityOfPage\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\"},\"wordCount\":4,\"publisher\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#organization\"},\"keywords\":[\"ArcGIS Hub\",\"ArcGIS Pro\"],\"articleSection\":[\"Data Management\"],\"inLanguage\":\"en-US\"},{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\",\"name\":\"Data selection and preparation\",\"isPartOf\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#website\"},\"datePublished\":\"2020-02-14T16:33:33+00:00\",\"dateModified\":\"2020-02-23T15:05:45+00:00\",\"breadcrumb\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation\"]}]},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.esri.com\/arcgis-blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Data selection and preparation\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#website\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/\",\"name\":\"ArcGIS Blog\",\"description\":\"Get insider info from Esri product teams\",\"publisher\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#organization\"},\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.esri.com\/arcgis-blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Organization\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#organization\",\"name\":\"Esri\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/\",\"logo\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/logo\/image\/\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2018\/04\/Esri.png\",\"contentUrl\":\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2018\/04\/Esri.png\",\"width\":400,\"height\":400,\"caption\":\"Esri\"},\"image\":{\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/logo\/image\/\"},\"sameAs\":[\"https:\/\/www.facebook.com\/esrigis\/\",\"https:\/\/x.com\/ESRI\",\"https:\/\/www.linkedin.com\/company\/5311\/\"]},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/e1b87363d6712e909b8ab06534d4e65d\",\"name\":\"Debashish Ghosh\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2019\/10\/InColorado-213x200.png\",\"contentUrl\":\"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2019\/10\/InColorado-213x200.png\",\"caption\":\"Debashish Ghosh\"},\"description\":\"I am a Product Engineer and Writer at Esri, focused primarily on the Business Analyst applications for web, mobile and desktop.\",\"url\":\"https:\/\/www.esri.com\/arcgis-blog\/author\/dghosh\"}]}<\/script>\n<!-- \/ Yoast SEO Premium plugin. -->","yoast_head_json":{"title":"Data selection and preparation","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation","og_locale":"en_US","og_type":"article","og_title":"Data selection and preparation","og_url":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation","og_site_name":"ArcGIS Blog","article_publisher":"https:\/\/www.facebook.com\/esrigis\/","article_modified_time":"2020-02-23T15:05:45+00:00","twitter_card":"summary_large_image","twitter_site":"@ESRI","schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":["Article","BlogPosting"],"@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation#article","isPartOf":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation"},"author":{"name":"Debashish Ghosh","@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/e1b87363d6712e909b8ab06534d4e65d"},"headline":"Data selection and preparation","datePublished":"2020-02-14T16:33:33+00:00","dateModified":"2020-02-23T15:05:45+00:00","mainEntityOfPage":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation"},"wordCount":4,"publisher":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/#organization"},"keywords":["ArcGIS Hub","ArcGIS Pro"],"articleSection":["Data Management"],"inLanguage":"en-US"},{"@type":"WebPage","@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation","url":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation","name":"Data selection and preparation","isPartOf":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/#website"},"datePublished":"2020-02-14T16:33:33+00:00","dateModified":"2020-02-23T15:05:45+00:00","breadcrumb":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation"]}]},{"@type":"BreadcrumbList","@id":"https:\/\/www.esri.com\/arcgis-blog\/products\/arcgis-pro\/data-management\/data-selection-and-preparation#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.esri.com\/arcgis-blog\/"},{"@type":"ListItem","position":2,"name":"Data selection and preparation"}]},{"@type":"WebSite","@id":"https:\/\/www.esri.com\/arcgis-blog\/#website","url":"https:\/\/www.esri.com\/arcgis-blog\/","name":"ArcGIS Blog","description":"Get insider info from Esri product teams","publisher":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/#organization"},"potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.esri.com\/arcgis-blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Organization","@id":"https:\/\/www.esri.com\/arcgis-blog\/#organization","name":"Esri","url":"https:\/\/www.esri.com\/arcgis-blog\/","logo":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/logo\/image\/","url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2018\/04\/Esri.png","contentUrl":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2018\/04\/Esri.png","width":400,"height":400,"caption":"Esri"},"image":{"@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/logo\/image\/"},"sameAs":["https:\/\/www.facebook.com\/esrigis\/","https:\/\/x.com\/ESRI","https:\/\/www.linkedin.com\/company\/5311\/"]},{"@type":"Person","@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/e1b87363d6712e909b8ab06534d4e65d","name":"Debashish Ghosh","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.esri.com\/arcgis-blog\/#\/schema\/person\/image\/","url":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2019\/10\/InColorado-213x200.png","contentUrl":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2019\/10\/InColorado-213x200.png","caption":"Debashish Ghosh"},"description":"I am a Product Engineer and Writer at Esri, focused primarily on the Business Analyst applications for web, mobile and desktop.","url":"https:\/\/www.esri.com\/arcgis-blog\/author\/dghosh"}]}},"text_date":"February 14, 2020","author_name":"Debashish Ghosh","author_page":"https:\/\/www.esri.com\/arcgis-blog\/author\/dghosh","custom_image":"https:\/\/www.esri.com\/arcgis-blog\/app\/uploads\/2025\/08\/Newsroom-Keyart-Wide-1920-x-1080.jpg","primary_product":"ArcGIS Pro","tag_data":[{"term_id":173712,"name":"ArcGIS Hub","slug":"arcgis-hub","term_group":0,"term_taxonomy_id":173712,"taxonomy":"post_tag","description":"","parent":0,"count":39,"filter":"raw"},{"term_id":42181,"name":"ArcGIS Pro","slug":"arcgis-pro","term_group":0,"term_taxonomy_id":42181,"taxonomy":"post_tag","description":"","parent":0,"count":323,"filter":"raw"}],"category_data":[{"term_id":23851,"name":"Data Management","slug":"data-management","term_group":0,"term_taxonomy_id":23851,"taxonomy":"category","description":"","parent":0,"count":921,"filter":"raw"}],"product_data":[{"term_id":36811,"name":"ArcGIS Hub","slug":"arcgis-hub","term_group":0,"term_taxonomy_id":36811,"taxonomy":"product","description":"","parent":36591,"count":219,"filter":"raw"},{"term_id":36561,"name":"ArcGIS Pro","slug":"arcgis-pro","term_group":0,"term_taxonomy_id":36561,"taxonomy":"product","description":"","parent":0,"count":2038,"filter":"raw"}],"primary_product_link":"https:\/\/www.esri.com\/arcgis-blog\/?s=#&products=arcgis-pro","_links":{"self":[{"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/blog\/722342","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/blog"}],"about":[{"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/types\/blog"}],"author":[{"embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/users\/6891"}],"replies":[{"embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/comments?post=722342"}],"version-history":[{"count":0,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/blog\/722342\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/media?parent=722342"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/categories?post=722342"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/tags?post=722342"},{"taxonomy":"industry","embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/industry?post=722342"},{"taxonomy":"product","embeddable":true,"href":"https:\/\/www.esri.com\/arcgis-blog\/wp-json\/wp\/v2\/product?post=722342"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}