May 24, 2024

New! Probabilities in Forest-based and Boosted Classification in ArcGIS Pro 3.3

By Catherine McSorley

New in ArcGIS Pro 3.3, probabilities in Forest-based and Boosted Classification in the Spatial Statistics toolbox. In addition to providing predicted categories as classification output, these algorithms now provide accompanying probabilities for each of the categories predicted by the model. Probabilities can tell you more about the likelihoods of individual categories, as well as confidence in individual predictions.

Example Demo

The data for this example comes from FEMA’s National Risk Index and the State of California’s historical wildfire records. This example is for illustrative purposes only.

Below is a map of census tracts in California, shaded by whether they historically were subject to at least one wildfire annually.

Map of California Census Tracts showing historical annual wildfires

Suppose you want to create a model to predict the risk of experiencing an annual wildfire, using other factors about the census tract. For example: drought frequency, strong wind frequency, lightning frequency, winter storm frequency, and information on agricultural land.

The Forest-based and Boosted Classification and Regression tool is set up below, with the new parameter Include All Prediction Probabilities enabled.

Forest-based and Boosted Classification and Regression tool in the Spatial Statistics toolbox set up to predict a categorical variable and include all probabilities.

The output predictions of the model are below. The census tracts that the model predicted would have a wildfire are in orange. This is a simple binary yes or no, which was the primary output of this tool prior to ArcGIS Pro 3.3.

Map of California Census Tracts and their wildfire classification prediction.

Now, the new probability output gives more granularity to that prediction. Below is the probability of the wildfire category mapped.

Map of California census tracts and their associated wildfire probability from the model

The areas in the lightest shade have the lowest probability of having at least one wildfire based on the prediction, and the darkest red have the highest probabilities. This provides a lot more power to categorical predictions. As an analyst, you will treat a wildfire probability of 5% vs. 40% vs. 80% or 90% differently.

choropleth map legend showing the scale of probabilties

Below is a map of observed historical wildfires overlaid with the probabilities. The shades of darkest red on the probabilities map line up with the areas of observed wildfires.

Map of California census tracts colored by wildfire probability. Overlaid with historical wildfire polygons.

As drought, weather patterns, and land use continue to change in the future, new probabilities could be calculated and assist allocation of resources.

This new enhancement to the Forest-based and Boosted Classification and Regression tool in ArcGIS Pro 3.3 brings more detail to categorical predictions and allows for more robust evaluation and actionability of your prediction results.

Learn More

Spatial Statistics Resources

How Forest-based and Boosted Classification and Regression works

Forest-based and Boosted Classification and Regression documentation

Catherine McSorley

Catherine McSorley is a Product Engineer on the Spatial Statistics Team at Esri.

Article Discussion:

0 Comments

Oldest

Newest

Inline Feedbacks

View all comments

ARCGIS

CAPABILITIES

BUY ARCGIS

INDUSTRIES

Support & Services

SELF-SERVICE

CONTACT US

ESRI STORIES

About Esri

About GIS

Commitment to Innovation

ArcGIS Blog

New! Probabilities in Forest-based and Boosted Classification in ArcGIS Pro 3.3

Example Demo

Learn More

Article Discussion: