This repository exposes to Python most COLMAP reconstruction objects, such as Cameras and Points3D, as well as estimators for absolute and relative poses.
Wheels for Linux (Python 3.7 & 3.8) and macOS 10 & 11 (Python 3.8 & 3.9) can be install using pip:
pip install pycolmap
The wheels are automatically built and pushed to pypi at each release.
Alternatively, we explain below how to compile PyCOLMAP from source. COLMAP should first be installed as a library following the instructions. We require the latest commit of the COLMAP dev
branch. Using a previous COLMAP build might not work. Then clone the repository and its submodules:
git clone --recursive [email protected]:mihaidusmanu/pycolmap.git
And finally build PyCOLMAP:
cd pycolmap
pip install pycolmap
On Windows, we recommend to install COLMAP with vcpkg. From your vcpkg directory, run:
.\vcpkg.exe install colmap --triplet=x64-windows --head
Then set the CMAKE_TOOLCHAIN_FILE
environment variable to your vcpkg\scripts\buildsystems\vcpkg.cmake
path. For Example in powershell:
$env:CMAKE_TOOLCHAIN_FILE='C:\Workspace\vcpkg\scripts\buildsystems\vcpkg.cmake'
Finally go to the PyCOLMAP folder and run
py -m pip install ./
We can load and manipulate an existing COLMAP 3D reconstruction:
import pycolmap
reconstruction = pycolmap.Reconstruction("path/to/my/reconstruction/")
print(reconstruction.summary())
for image_id, image in reconstruction.images.items():
print(image_id, image)
for point3D_id, point3D in reconstruction.points3D.items():
print(point3D_id, point3D)
for camera_id, camera in reconstruction.cameras.items():
print(camera_id, camera)
reconstruction.write("path/to/new/reconstruction/")
reconstruction.export_PLY("path/to/new/reconstruction/reconstruction.ply")
The object API mirrors the COLMAP C++ library. The bindings support many other operations, for example:
- projecting a 3D point into an image with arbitrary camera model:
uv = camera.world_to_image(image.project(point3D.xyz))
- aligning two 3D reconstruction by their camera poses:
tfm = reconstruction1.align_poses(reconstruction2) # transforms reconstruction1 in-place
print(tfm.rotation, tfm.translation)
We provide robust RANSAC-based estimators for absolute camera pose (single-camera and multi-camera-rig), essential matrix, fundamental matrix, homography, and two-view relative pose for calibrated cameras.
All RANSAC and estimation parameters are exposed as objects that behave similarly as Python dataclasses. The RANSAC options are described in colmap/src/optim/ransac.h
and their default values are:
ransac_options = pycolmap.RANSACOptions(
max_error=4.0, # reprojection error in pixels
min_inlier_ratio=0.01,
confidence=0.9999,
min_num_trials=1000,
max_num_trials=100000,
)
For instance, to estimate the absolute pose of a query camera given 2D-3D correspondences:
# Parameters:
# - points2D: Nx2 array; pixel coordinates
# - points3D: Nx3 array; world coordinates
# - camera: pycolmap.Camera
# Optional parameters:
# - max_error_px: float; RANSAC inlier threshold in pixels (default=12.0)
# - estimation_options: dict or pycolmap.AbsolutePoseEstimationOptions
# - refinement_options: dict or pycolmap.AbsolutePoseRefinementOptions
answer = pycolmap.absolute_pose_estimation(points2D, points3D, camera, max_error_px=12.0)
# Returns: dictionary of estimation outputs
2D and 3D points are passed as Numpy arrays or lists. The options are defined in estimators/absolute_pose.cc
and can be passed as regular (nested) Python dictionaries:
pycolmap.absolute_pose_estimation(
points2D, points3D, camera,
estimation_options={'ransac': {'max_error': 12.0}},
refinement_options={'refine_focal_length': True},
)
# Parameters:
# - tvec: List of 3 floats, translation component of the pose (world to camera)
# - qvec: List of 4 floats, quaternion component of the pose (world to camera)
# - points2D: Nx2 array; pixel coordinates
# - points3D: Nx3 array; world coordinates
# - inlier_mask: array of N bool; inlier_mask[i] is true if correpondence i is an inlier
# - camera: pycolmap.Camera
# Optional parameters:
# - refinement_options: dict or pycolmap.AbsolutePoseRefinementOptions
answer = pycolmap.pose_refinement(tvec, qvec, points2D, points3D, inlier_mask, camera)
# Returns: dictionary of refinement outputs
# Parameters:
# - points2D1: Nx2 array; pixel coordinates in image 1
# - points2D2: Nx2 array; pixel coordinates in image 2
# - camera1: pycolmap.Camera of image 1
# - camera2: pycolmap.Camera of image 2
# Optional parameters:
# - max_error_px: float; RANSAC inlier threshold in pixels (default=4.0)
# - options: dict or pycolmap.RANSACOptions
answer = pycolmap.essential_matrix_estimation(points2D1, points2D2, camera1, camera2)
# Returns: dictionary of estimation outputs
answer = pycolmap.fundamental_matrix_estimation(
points2D1,
points2D2,
[max_error_px], # optional RANSAC inlier threshold in pixels
[options], # optional dict or pycolmap.RANSACOptions
)
answer = pycolmap.homography_matrix_estimation(
points2D1,
points2D2,
[max_error_px], # optional RANSAC inlier threshold in pixels
[options], # optional dict or pycolmap.RANSACOptions
)
COLMAP can also estimate a relative pose between two calibrated cameras by estimating both E and H and accounting for the degeneracies of each model.
# Parameters:
# - points2D1: Nx2 array; pixel coordinates in image 1
# - points2D2: Nx2 array; pixel coordinates in image 2
# - camera1: pycolmap.Camera of image 1
# - camera2: pycolmap.Camera of image 2
# Optional parameters:
# - max_error_px: float; RANSAC inlier threshold in pixels (default=4.0)
# - options: dict or pycolmap.TwoViewGeometryOptions
answer = pycolmap.homography_matrix_estimation(points2D1, points2D2)
# Returns: dictionary of estimation outputs
The options are defined in estimators/absolute_pose.cc
and control how each model is selected. The return dictionary contains the relative pose, inlier mask, as well as the type of camera configuration, such as degenerate or planar. This type is an instance of the enum pycolmap.TwoViewGeometry
whose values are explained in colmap/src/estimators/two_view_geometry.h
.
All estimators expect a COLMAP camera object, which can be created as follow:
camera = pycolmap.Camera(
COLMAP_CAMERA_MODEL_NAME,
IMAGE_WIDTH,
IMAGE_HEIGHT,
EXTRA_CAMERA_PARAMETERS,
)
The different camera models and their extra parameters are defined in colmap/src/base/camera_models.h
. For example for a pinhole camera:
camera = pycolmap.Camera(
model='SIMPLE_PINHOLE',
width=width,
height=height,
params=[focal_length, cx, cy],
)
Alternatively, we can also pass a camera dictionary:
camera_dict = {
'model': COLMAP_CAMERA_MODEL_NAME,
'width': IMAGE_WIDTH,
'height': IMAGE_HEIGHT,
'params': EXTRA_CAMERA_PARAMETERS_LIST
}
We provide bindings for multiple step of the standard reconstruction pipeline. They are defined in pipeline.cc
and include:
- importing an image folder into a COLMAP database
- inferring the camera parameters from the EXIF metadata of an image file
- running two-view geometric verification of matches on a COLMAP database
- triangulating points into an existing COLMAP model
- running incremental reconstruction from a COLMAP database
For an example of usage, see hloc/reconstruction.py
.
import numpy as np
import pycolmap
from PIL import Image, ImageOps
# Input should be grayscale image with range [0, 1].
with open('image.jpg', 'rb') as f:
img = Image.open(f)
img = img.convert('RGB')
img = ImageOps.grayscale(img)
img = np.array(img).astype(np.float) / 255.
# Parameters:
# - image: HxW float array
# Named parameters:
# - num_octaves: int (4)
# - octave_resolution: int (3)
# - first_octave: int (0)
# - edge_thresh: float (10)
# - peak_thresh: float (0.01)
# - upright: bool (False)
keypoints, scores, descriptors = pycolmap.extract_sift(img)
# Returns:
# - keypoints: Nx4 array; format: x (j), y (i), sigma, angle
# - scores: N array; DoG scores
# - descriptors: Nx128 array; L2-normalized descriptors
- Add documentation
- Add more detailed examples
- Add unit tests for reconstruction bindings