Skip to content

How to automate the creation and management of data analytics environments.

License

Apache-2.0, MPL-2.0 licenses found

Licenses found

Apache-2.0
LICENSE
MPL-2.0
LICENSE.txt
Notifications You must be signed in to change notification settings

Chambras/Hashitalks2024

Repository files navigation

Terraform

Mastering Databricks environment creation in the cloud with Terraform

This project is a demo of how to create a Data Analytics environments in the cloud using Terraform. It uses Azure as the cloud provider and Ansible as the configuration management tool. It also uses a Databricks terraform provider to create all the required resources in Databricks.

Project Structure

This project has the following folders which make them easy to reuse, add or remove.

.
├── .github
│   └── workflows
├── Ansible
│   └── KafkaServer
│       └── roles
├── Infrastructure
│   ├── terraform-azure
│   └── terraform-databricks
└── Notebooks
    ├── SWIM
    │   └── python
    └── Songs
        ├── python
        └── sql

Pre-requisites

This project requires the following versions:

  • Terraform =>1.7.2
  • Azure provider 3.89.0
  • Databricks provider 1.34.0
  • Azure CLI 2.57.0

It also uses GitHub Secrets to store all required keys and secrets. The following GitHub Secrets need to be created ahead of time:

  • ARM_SUBSCRIPTION_ID - Your Azure Subscription ID.
  • ARM_CLIENT_ID - Your Azure Client ID.
  • ARM_CLIENT_SECRET - Your Azure Client Secret.
  • ARM_TENANT_ID - Your Azure Tenant ID.
  • PBLC_VM_SSH - Public SSH key of the VM.
  • PRVT_VM_SSH - Private SSH key of the VM.
  • TF_API_TOKEN - Terraform Cloud API Token.

It also needs access to FAA SWIM for the streamming part of the demo. A more complete demo and information about SWIM can be found in my article Ingest FAA SWIM content to analyze flight data

Caution

Be aware that by running this project your account will get billed.

Authors

  • Marcelo Zambrana

About

How to automate the creation and management of data analytics environments.

Resources

License

Apache-2.0, MPL-2.0 licenses found

Licenses found

Apache-2.0
LICENSE
MPL-2.0
LICENSE.txt

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published