Tree Counting, Crown Segmentation and Height Prediction at Scale

This repo contains code for the paper Deep learning enables image-based tree counting, crown segmentation and height prediction at national scale.
See a followup tutorial for building a tree database using this project here.
Added Docker image support for direct deployment (Feb 2025)
Enhanced code structure for large-scale prediction on local GPU machine (Feb 2025)

Deploy with Docker 🐳

Note a separate license for this Docker image: ⚠️ CC-BY-NC-SA 4.0

Quick introduction

You need:

A GPU machine
Images to predict (RGB or more bands, around 20 cm spatial resolution)

You don't need:

To install any dependencies
To download the models (they will be auto-downloaded)

You get:

Predictions like those in /example_1km_tile_tif/predictions/
Flexibility to choose color bands and models for prediction
More robust prediction by ensemble models

Quick instructions

The Docker image is built to run the large-scale prediction file (the original main4_large_scale_inference_transfer_other_data.py) in a deployment environment.
Install Docker on your GPU machine && check installation with docker --version
Pull the image: sudo docker pull sizhuoli/tree_expert:latest
Download the config file from this repo (config/hyperps.yaml) and modify configs as needed
Run the image with the following command (change paths to your local folders):

sudo docker run --gpus all -it --rm -v /path_to_local_config_folder/hyperps.yaml:/app/config/hyperps.yaml -v /path_to_local_image_folder/:/app/images/ -v /path_to_local_prediction_folder/:/app/predictions/ sizhuoli/tree_expert:latest

Check the predictions in the local prediction folder (e.g. /path_to_local_prediction_folder/)
You may need to change file permissions for the prediction folder to access the results (e.g. sudo chmod -R 777 /path_to_local_prediction_folder/)

Details

What's included in the Docker image

Complete working environment; note that you need to install Nvidia Container Toolkit for GPU support
Trained models will be auto-downloaded and stored in the Docker image for prediction

What's your flexibility for prediction

Specify color band order of your input images
Choose which models to use for prediction (e.g. RGB, RGBNIR, RGBNIRNDVI; models are trained on different color bands)
Ensemble prediction is supported (e.g. you get combined predictions from RGB and RGBNIR models)
See instructions in the default config file (config/hyperps.yaml) for more details
Don't modify file paths in hyperps.yaml, as they are set to the Docker image's file system
In case you don't want to repeat model downloading at runtime, you could download the models manually and mount the local model path at runtime. Follow these steps:
- 1. Download the models manually and store locally `wget -P /path_to_local_model_folder/ https://sid.erda.dk/share_redirect/gS7JX84yvu'
- 1. Docker run with MODEL_SOURCE=local and mount the local model path: sudo docker run -e MODEL_SOURCE=local -v /path_to_local_model_folder/:/app/saved_models/ --gpus all -it --rm -v /path_to_local_config_folder/hyperps.yaml:/app/config/hyperps.yaml -v /path_to_local_image_folder/:/app/images/ -v /path_to_local_prediction_folder/:/app/predictions/ sizhuoli/tree_expert:latest

Please raise an issue if you encounter any problems with the Docker image or need new functionalities (e.g., fine-tuning mode, resolution change, etc.)

Enhancements in the code structure (Feb 2025)

This is an upgrade to the original code for Working with local GPU machine

Config files are now more organized, see /config/hyperps_local.yaml
Model ensembling is supported for large-scale prediction
Instead of running python main4_large_scale_inference_transfer_other_data.py, now you run python main_local.py

Open Data!! Training and testing data released for research purposes

Tree crown delineation data now available here
Please cite our paper if you find the data useful :)
Acknowledgements to Styrelsen for Dataforsyning og Infrastructur and National Land Survey of Finland for open sourcing aeiral images for building the datasets
P.S. A data corruption issue was spotted and fixed on 27/09/2024, please download the data again if you have downloaded it before this date!! The data is now stored in our university's data platform ERDA.

Key features

For the Danish dataset:

We offer image patches preprocessed in two ways: a. patch-normalization to 0 mean and unit std (used in paper); b. raw patches with orginial pixel intensities.
There are several empty patches with no crown delineations (used as negative sample for training), which can be removed
Coordinates have been removed

Trained models ready for deployment

Download models: 🤞

https://drive.google.com/file/d/1ZNibrh6pa4-cjXLawua6L96fOKS3uwbn/view?usp=sharing

Model names:

trees - date - time - optimizer - lr - input bnads - input shape - no. training tiles - segmentation loss - density loss - boundary weight (for separate individual trees) - model . h5

Working with Google Earth Engine 💪

https://github.com/google/earthengine-community/blob/master/guides/linked/Earth_Engine_TensorFlow_tree_counting_model.ipynb

Working with local GPU machine 💻

Set up conda environment

Depending on your GPU driver, you may try the following environment.yml files to set up the conda environment:

To set up the same environment:

conda env create -f environment_trees_updated.yml

If the above does not work, try the following to set up the basic environment and install the missing packages manually:

conda env create -f environment_trees_updated_basic.yml

Activate the environment

conda activate tf2151full

Run predictions

Get ready:

Aerial images of arbitrary size, preferably with comparable spatial resolution as the training data (20 cm, otherwise may need downsampling or upsampling)
A proper model downloaded from the link above, or fine-tuned on local data. select a proper model based on the color bands

--- 🔖 set configs ---

config/RasterAnalysis.py

python main4_large_scale_inference_transfer_other_data.py

Example prediction can be found in /example_1km_tile_tif/, where a model trained on RGB bands was applied to a 1 km x 1 km tile of aerial image captured in Denmark. You may reproduce the results by running step4_large_scale_inference_transfer_other_data.py with the provided configs (config/RasterAnalysis.py).

Tips for direct depolyment 😎

Standardize input image patches channel-wise -> ((image-mean)/std)
For large image, predict with overlap (stride < patch width), take max prediction where overlap
Upsample / Downsample to match the trained input resolution (20 cm)
Finetune with small local annotation dataset

Finetune / Train using local annotation data

Prepare your own training data 🙈

Figure 0. Preparing your own tree crown annotation dataset. Delineate tree crowns inclusively within each selected annotating area.

Finetuning typically requires a small local annotation dataset (< 5 k tree crowns)

Run finetuning

Get ready:

A proper model downloaded from the link above, matching the color bands of your local data
Your small local annotation dataset, prepared in the same structure as the Denmark/Finland dataset (downloadable with the link above). To convert from shapefile/geopackage to raster images, check main0_preprocessing.py
Recommended: the pretraining data (Denmark dataset), downloadable with the link above
A correct environment set up with the yml file

--- 🔖 set configs --- config/UNetTrainingFinetune.py

python main1-2_segcount_transfer_learning.py

Main training code:

Preprocessing for tree crown segmentation and counting

Convert tree crown delineation data (in shapefile or geopackage format) to raster images for training

python main0_preprocessing.py

--- 🔖 set configs ---

config/Preprocessing.py

Train 1st model from scratch: Tree crown segmentation & density counting:

python main1_multitask_counting_segmentation.py

--- 🔖 set configs ---

case1: same input spatial resolution: use config/UNetTraining.py

case2: inputs of different spatial resolution (only support 1/2 coarser resolution for now): use config/UNetTraining_multires.py

--- 🔖 example data for demonstration ---

example input data in: example_extracted_data/

⚠️ IMPORTANT: LISCENCE REQUIRED FOR FORMAL USE OF THE DATA!

--- ✨ major tunable hyperparameters ---

boundary weights: determine the penalty on crown boundaries for better individual separation
task_ratio: the ratio for weighting the two losses to balance the emphasis on both tasks during training, may differ from case to case
normalize: ratio to augment data by randomly normalizing image patches

Train 2nd model from scratch: Tree height estimation from aerial images:

python main2_height_prediction.py

--- 🔖 set configs ---

config/UNetTraining_CHM.py

Test 1st model (results in the paper): segmentation & counting:

python main3_predict_segmentation_counting.py

--- 🔖 set configs ---

config/Model_compare_multires.py

--- 🎏 Example prediction ---

See /example_extracted_data/

segmentation result: seg_41.tif
counting result: density_41.tif

⚠️ Note that the model was trained using image patch no.41, and thus should not be tested using the same image in the test phrase. Here we simply demonstrate how to apply the model on a test image.

Figure 1. Overview of the framework used to count individual trees and predict their crown area and height. a, Deep learning-based framework for individual tree counting, crown segmentation, and height prediction. Spatial locations of individual trees are incorporated in the tree density maps and the crown segmentation maps. The canopy height map (CHM) derived from LiDAR data provides pixel-wise height information, which, when available for a specific study area, can optionally be used as an additional input band for the individual tree counting and crown segmentation tasks. b, Data preparation and modeling for tree counting and crown segmentation. The manually delineated individual tree crowns are modeled as density maps for the counting task by extracting the polygon centroids. The gaps between adjacent crowns are highlighted for the separation of individual tree crowns during the training phase.

Figure 2. Example products from the proposed framework. a, Wall-to-wall tree count prediction for Denmark. b, Detailed examples showing the individual tree counting (second row), crown segmentation (third row), and height prediction (third row) from three major types of landscapes (deciduous forest, coniferous forest, and non-forest). c, Large-scale individual tree crown segmentation results colored by height predictions. Examples in b and c were sampled from the region indicated by the orange box in a.

Name		Name	Last commit message	Last commit date
Latest commit History 128 Commits
config		config
core2		core2
example_1km_tile_tif		example_1km_tile_tif
example_extracted_data		example_extracted_data
example_preprocessing_data		example_preprocessing_data
figures		figures
models		models
Dockerfile		Dockerfile
LICENSE-CC-BY-NC-SA.md		LICENSE-CC-BY-NC-SA.md
LICENSE.md		LICENSE.md
README.md		README.md
environment_trees_updated.yml		environment_trees_updated.yml
environment_trees_updated_basic.yml		environment_trees_updated_basic.yml
environment_trees_updated_docker.yml		environment_trees_updated_docker.yml
fix_permission.sh		fix_permission.sh
main0_preprocessing.py		main0_preprocessing.py
main1-2_segcount_transfer_learning.py		main1-2_segcount_transfer_learning.py
main1_multitask_counting_segmentation.py		main1_multitask_counting_segmentation.py
main2_height_prediction.py		main2_height_prediction.py
main3_predict_segmentation_counting.py		main3_predict_segmentation_counting.py
main4_large_scale_inference_transfer_other_data.py		main4_large_scale_inference_transfer_other_data.py
main_docker.py		main_docker.py
main_local.py		main_local.py
md		md
run_script.sh		run_script.sh

License

Licenses found

sizhuoli/TreeCountSegHeight

Folders and files

Latest commit

History

Repository files navigation

Tree Counting, Crown Segmentation and Height Prediction at Scale

Deploy with Docker 🐳

Note a separate license for this Docker image: ⚠️ CC-BY-NC-SA 4.0

Quick introduction

Quick instructions

Details

What's included in the Docker image

What's your flexibility for prediction

Please raise an issue if you encounter any problems with the Docker image or need new functionalities (e.g., fine-tuning mode, resolution change, etc.)

Enhancements in the code structure (Feb 2025)

This is an upgrade to the original code for Working with local GPU machine

Open Data!! Training and testing data released for research purposes

Key features

Trained models ready for deployment

Download models: 🤞

Working with Google Earth Engine 💪

Working with local GPU machine 💻

Set up conda environment

Activate the environment

Run predictions

Tips for direct depolyment 😎

Finetune / Train using local annotation data

Prepare your own training data 🙈

Finetuning typically requires a small local annotation dataset (< 5 k tree crowns)

Run finetuning

Main training code:

Preprocessing for tree crown segmentation and counting

Train 1st model from scratch: Tree crown segmentation & density counting:

Train 2nd model from scratch: Tree height estimation from aerial images:

Test 1st model (results in the paper): segmentation & counting:

About

Resources

License

Licenses found

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages