Skip to content

tiagotele/airflow-on-kubernetes

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 

Repository files navigation

Airflow + Spark on Kubernetes

Running Airflow with Kubernetes on Kubernetes(Minikube)

Requirements

Steps

Adding Helm repositories

helm repo add apache-airflow https://airflow.apache.org
helm repo add spark-operator https://googlecloudplatform.github.io/spark-on-k8s-operator

Creating Kubernetes on Miniube

minikube start --cpus 7 --memory 10000  --kubernetes-version=v1.22.0

Configuring cluster

Labeling Nodes

kubectl label nodes minikube nodePool=cluster

Setting cluster admin

kubectl create clusterrolebinding permissive-binding --clusterrole=cluster-admin --user=admin --user=kubelet --group=system:serviceaccounts

Installing Spark Operator on cluster

helm install my-release spark-operator/spark-operator --version 1.1.5

Installing Airflow on Kubernetes

kubectl create ns spark-jobs --dry-run=client -o yaml | kubectl apply -f -
helm install airflow apache-airflow/airflow --namespace spark-jobs  --timeout 600s

About

Local environment to play with Airflow and Kubernetes

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published