pyiceberg is a python library for programmatic access to iceberg table metadata as well as to table data in iceberg format. It is a Python implementation of iceberg table spec. Documentation is available at https://pyiceberg.apache.org/.
pyiceberg is currently in development, for development and testing purposes the best way to install the library is to perform the following steps:
git clone https://github.com/apache/iceberg.git
cd iceberg/python
pip install -e .
Development is made easy using Poetry. Poetry provides virtual environments for development:
poetry shell
make install
make test
For more information, please refer to the Manage environments section of Poetry.
Testing is done using Poetry:
poetry install -E pyarrow
poetry run pytest