Skip to content

liushuaicare/satellite-image-deep-learning

Repository files navigation

Introduction

This document primarily lists resources for performing deep learning (DL) on satellite imagery. To a lesser extent Machine learning (ML, e.g. random forests, stochastic gradient descent) are also discussed, as are classical image processing techniques.

Top links

Table of contents

Datasets

Sentinel

Landsat

Worldview-3

Kaggle

Kaggle hosts several large satellite image datasets (> 1 GB). A list if general image datasets is here. A list of land-use datasets is here. The kaggle blog is an interesting read.

Kaggle - Deepsat - classification challenge

Each sample image is 28x28 pixels and consists of 4 bands - red, green, blue and near infrared. The training and test labels are one-hot encoded 1x6 vectors. Each image patch is size normalized to 28x28 pixels. Data in .mat Matlab format. JPEG?

  • Sat4 500,000 image patches covering four broad land cover classes - barren land, trees, grassland and a class that consists of all land cover classes other than the above three Example notebook
  • Sat6 405,000 image patches each of size 28x28 and covering 6 landcover classes - barren land, trees, grassland, roads, buildings and water bodies.

Kaggle - Amazon from space - classification challenge

Kaggle - DSTL - segmentation challenge

Kaggle - Airbus Ship Detection Challenge

Kaggle - Draper - place images in order of time

Kaggle - other

Alternative datasets

There are a variety of datasets suitable for land classification problems.

UC Merced

AWS datasets

Quilt

  • Several people have uploaded datasets to Quilt

Google Earth Engine

Weather Datasets

Online computing resources

Generally a GPU is required for DL. Googles colab is free but limited compute time (12 hours) and somewhat non persistent,

Kaggle

  • Free to use
  • GPU Kernels (may run for 1 hour which limits usefulness?)
  • Tensorflow, pytorch & fast.ai available
  • Advantage that many datasets are already available
  • Read

### Clouderizer

  • https://clouderizer.com/
  • Clouderizer is a cloud computing management service, it takes care of installing the required packages to a cloud computing instance (like Amazon AWS or Google Colab). Clouderizer is free for 200 hours per month (Robbie plan) and does not require a credit card to sign up.
  • Run projects locally, on cloud or both.
  • SSH terminal, Jupyter Notebooks and Tensorboard are securely accessible from Clouderizer Web Console.

AWS

Microsoft Azure

Google

  • ML engine - sklearn, tensorflow, keras
  • Collaboratory (notebooks with GPU as a backend for free for 12 hours at a time),
  • Tensorflow available
  • pytorch can be installed, useful articles

Floydhub

  • https://www.floydhub.com/
  • Cloud GPUs
  • Jupyter Notebooks
  • Tensorboard
  • Version Control for DL
  • Deploy Models as REST APIs
  • Public Datasets

Paperspace

Crestle

Salamander

Interesting DL projects

Raster Vision by Azavea

RoboSat

RoboSat.Pink

DeepOSM

DeepNetsForEO - segmentation

Skynet-data

Production

Custom REST API

Tensorflow Serving

TensorFlow Serving makes it easy to deploy new algorithms and experiments, while keeping the same server architecture and APIs. Multiple models, or indeed multiple versions of the same model, can be served simultaneously. TensorFlow Serving comes with a scheduler that groups individual inference requests into batches for joint execution on a GPU

Floydhub

  • Allows exposing model via rest API

modeldepot

Image formats & catalogues

STAC - SpatioTemporal Asset Catalog

State of the art

What are companies doing?

Online platforms for Geo analysis

  • This article discusses some of the available platforms -> TLDR Pangeo rocks
  • Pangeo - open source resources for parallel processing using Dask and Xarray http://pangeo.io/index.html
  • Descartes Labs -> access to EO imagery from a variety of providers via python API -> not clear which imagery is available (Airbus + others?) or pricing
  • DigitalGlobe have a cloud hosted Jupyter notebook platform called GBDX. Cloud hosting means they can guarantee the infrastructure supports their algorithms, and they appear to be close/closer to deploying DL. Tutorial notebooks here. Only Sentinel-2 and Landsat data on free tier.
  • Planet have a Jupyter notebook platform which can be deployed locally and requires an API key (14 days free). They have a python wrapper (2.7..) to their rest API. No price after 14 day trial.

Techniques

This section explores the different techniques (DL, ML & classical) people are applying to common problems in satellite imagery analysis. Classification problems are the most simply addressed via DL, object detection is harder, and cloud detection harder still (niche interest).

Land classification

Change detection

Image registration

Object detection

Cloud detection

  • A subset of the object detection problem, but surprisingly challenging
  • From this article on sentinelhub there are three popular classical algorithms that detects thresholds in multiple bands in order to identify clouds. In the same article they propose using semantic segmentation combined with a CNN for a cloud classifier (excellent review paper here), but state that this requires too much compute resources.
  • This article compares a number of ML algorithms, random forests, stochastic gradient descent, support vector machines, Bayesian method.
  • DL..

Super resolution

Pansharpening

Stereo imaging for terrain mapping & DEMs

NVDI - vegetation index

SAR

For fun

Useful open source software

  • QGIS- Create, edit, visualise, analyse and publish geospatial information. Python scripting and plugins.
  • Orfeo toolbox - remote sensing toolbox with python API (just a wrapper to the C code). Do activites such as pansharpening, ortho-rectification, image registration, image segmentation & classification. Not much documentation.

Useful References

About

Resources for performing deep learning on satellite imagery

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 99.8%
  • Python 0.2%