Skip to content

Docker compose and Google Colab demo to build a CDC with Delta Lake

Notifications You must be signed in to change notification settings

masfworld/cdc_deltaLake

Repository files navigation

PostgreSQL (Debezium) - Kafka - Spark Delta Lake

Description

This project is a demo for testing a CDC (Change Data Capture). All infrastructure is built using docker.

alt text

Features

  • PostgreSql as Legacy database
  • Debezium as Change Data Capture
  • Kafka to ingest data from Debezium
  • KSQLDB to transform Avro messages into JSON
  • Spark Delta Lake to manage events from legacy database

About

Docker compose and Google Colab demo to build a CDC with Delta Lake

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages