The Apache Pulsar community produces 2 docker images with each official release.
apachepulsar/pulsar
- contains the necessary components for a working Pulsar clusterapachepulsar/pulsar-all
- extends theapachepulsar/pulsar
image by adding many Pulsar connectors and offloaders
Since the 2.10.0 release, these docker images run as an unnamed, non-root user that is also part of the root group, by default. This was done to increase container security. The user is part of the root group to ensure that the container image can easily run on OpenShift and to ensure that the Pulsar process can write to configuration files.
You can build and test these docker images on your own machine by running the ./build.sh
script in this directory.
Note that you first must build the project in order to have the right dependencies in your local environment.
If you find the apachepulsar/pulsar-all
docker image too large, but you want to use a connector or an offloader,
you can easily build an image with a curated list of connectors or offloaders based on the official Apache Pulsar
images. You can use the following sample docker image as a guide:
ARG VERSION
# Load the pulsar-all image as a builder image
FROM apachepulsar/pulsar-all:${VERSION} as pulsar-all
FROM apachepulsar/pulsar:${VERSION}
# Add the cassandra connector
COPY --from=pulsar-all /pulsar/connectors/pulsar-io-cassandra-*.nar /pulsar/connectors
# Add the jcloud offloader
COPY --from=pulsar-all /pulsar/connectors/tiered-storage-jcloud-*.nar /pulsar/offloaders
NOTE: the above example uses a wildcard in the COPY
commands because argument expansion does not work for COPY
.
Assuming that you have the above Dockerfile
in your local directory and are running docker on your local host, you can
run the following command to build a custom image with the cassandra connector and the jcloud offloader.
docker build --build-arg VERSION=2.9.1 -t pulsar-custom:2.9.1 .
For reference, here are the sizes of the official 2.9.1 docker images and the custom image built from the above
Dockerfile
:
REPOSITORY | TAG | SIZE |
---|---|---|
apachepulsar/pulsar | 2.9.1 | 1.59GB |
apachepulsar/pulsar-all | 2.9.1 | 3.44GB |
pulsar-custom | 2.9.1 | 1.6GB |
Troubleshooting is harder because the docker image runs as a non-root user. For example, a non-root user won't be able to download arbitrary utilities. There are several ways to troubleshoot.
One option is to build a custom docker image that includes your preferred debugging tools. Here is an example of adding some tools to an existing docker image.
FROM apachepulsar/pulsar:2.10.0
# Switch to root user to download tools
USER 0
# Install your preferred utilities
RUN apt-get update \
&& apt-get install -y vim net-tools unzip \
&& apt-get clean \
&& rm -rf /var/lib/apt/lists/*
# Assuming you still want to run as a non root user by default
USER 10000
The remaining debug options depend on your environment. For example, if you have access to the host running your
container, you might be able to use the docker exec
command to shell into the container. By using the --user
argument, you can run as the root user.
If you're running your container on kubernetes, you can override the container's default user by setting the pod's
securityContext
.
Bitnami provides a helpful guide here: https://engineering.bitnami.com/articles/running-non-root-containers-on-openshift.html.