Name	Name	Last commit message	Last commit date
Latest commit History 20 Commits
spark	spark
.dockerignore	.dockerignore
.gitignore	.gitignore
README.md	README.md
docker-compose.yml	docker-compose.yml

Name

Last commit message

Last commit date

20 Commits

Spark + Iceberg Quickstart Image

This is a docker compose environment to quickly get up and running with a Spark environment and a local Iceberg catalog. It uses a postgres database as a JDBC catalog.

note: If you don't have docker installed, you can head over to the Get Docker page for installation instructions.

Usage

First, start up the spark-iceberg and postgres container by running:

docker-compose up

Next, run any of the following commands, depending on which shell you prefer to use:

docker exec -it spark-iceberg spark-shell

docker exec -it spark-iceberg spark-sql

docker exec -it spark-iceberg pyspark

docker exec -it spark-iceberg notebook

To stop the service, just run docker-compose down.

Troubleshooting & Maintenance

Resetting Catalog Data

To reset the catalog and data, remove the postgres and warehouse directories.

docker-compose down && docker-compose kill && rm -rf ./postgres && rm -rf ./warehouse

Refreshing Docker Image

The prebuilt spark image is uploaded to Dockerhub. Out of convenience, the image tag defaults to latest.

If you have an older version of the image, you might need to remove it to upgrade.

docker image rm tabulario/spark-iceberg && docker-compose pull

For more information on getting started with using Iceberg, checkout the Getting Started guide in the official docs.

Languages

Jupyter Notebook 89.8%

Dockerfile 3.2%

Java 3.1%

Shell 2.8%

Python 1.1%

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spark + Iceberg Quickstart Image

Usage

Troubleshooting & Maintenance

Resetting Catalog Data

Refreshing Docker Image

About

Releases

Packages

Languages

License

tusharchou/docker-spark-iceberg

Folders and files

Latest commit

History

Repository files navigation

Spark + Iceberg Quickstart Image

Usage

Troubleshooting & Maintenance

Resetting Catalog Data

Refreshing Docker Image

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages