ArcGIS Pro

New! Probabilities in Forest-based and Boosted Classification in ArcGIS Pro 3.3

New in ArcGIS Pro 3.3, probabilities in Forest-based and Boosted Classification in the Spatial Statistics toolbox. In addition to providing predicted categories as classification output, these algorithms now provide accompanying probabilities for each of the categories predicted by the model. Probabilities can tell you more about the likelihoods of individual categories, as well as confidence in individual predictions.

Example Demo

The data for this example comes from FEMA’s National Risk Index and the State of California’s historical wildfire records. This example is for illustrative purposes only.

Below is a map of census tracts in California, shaded by whether they historically were subject to at least one wildfire annually.

Map of California Census Tracts showing historical annual wildfires

Suppose you want to create a model to predict the risk of experiencing an annual wildfire, using other factors about the census tract. For example: drought frequency, strong wind frequency, lightning frequency, winter storm frequency, and information on agricultural land.

The Forest-based and Boosted Classification and Regression tool is set up below, with the new parameter Include All Prediction Probabilities enabled.

Forest-based and Boosted Classification and Regression tool in the Spatial Statistics toolbox set up to predict a categorical variable and include all probabilities.

The output predictions of the model are below. The census tracts that the model predicted would have a wildfire are in orange. This is a simple binary yes or no, which was the primary output of this tool prior to ArcGIS Pro 3.3.

Map of California Census Tracts and their wildfire classification prediction.

Now, the new probability output gives more granularity to that prediction. Below is the probability of the wildfire category mapped.

Map of California census tracts and their associated wildfire probability from the model

The areas in the lightest shade have the lowest probability of having at least one wildfire based on the prediction, and the darkest red have the highest probabilities. This provides a lot more power to categorical predictions. As an analyst, you will treat a wildfire probability of 5% vs. 40% vs. 80% or 90% differently.

choropleth map legend showing the scale of probabilties

Below is a map of observed historical wildfires overlaid with the probabilities. The shades of darkest red on the probabilities map line up with the areas of observed wildfires.

Map of California census tracts colored by wildfire probability. Overlaid with historical wildfire polygons.

As drought, weather patterns, and land use continue to change in the future, new probabilities could be calculated and assist allocation of resources.

This new enhancement to the Forest-based and Boosted Classification and Regression tool in ArcGIS Pro 3.3 brings more detail to categorical predictions and allows for more robust evaluation and actionability of your prediction results.

Learn More

Spatial Statistics Resources

How Forest-based and Boosted Classification and Regression works

Forest-based and Boosted Classification and Regression documentation

About the author

Catherine McSorley

Catherine McSorley is a Product Engineer on the Spatial Statistics Team at Esri.

Connect:
Subscribe
Notify of
0 Comments
Inline Feedbacks
View all comments

Next Article

Mahoning County Public Health fights substance use with the Opioid Epidemic Outreach solution

Read this article