End-to-end sample example to do data extraction from Cloud Storage to BigQuery using Composer.git
- Cloud Storage to BigQuery
- With Airflow operator BigQueryInsertJobOperator loads data into BigQuery.
- Cloud Composer instance setup
- Cloud Sotrage bucket needs to created
- BigQuery API enabled
.
├── README.md
└── data-ingestion
├── asset
│ └── dag_image.png
├── cloudbuild.yaml
├── dags
│ └── common_layer.py
├── data
│ └── common_layer
│ └── config
│ ├── common_layer_config_defaults.yml
│ └── common_layer_config_ingestion_demo.yml
└── plugins
└── includes
├── entities.py
├── exceptions.py
├── loggers.py
└── utils.py
- Place a file in the landing bucket location in Google Cloud Storage and update the configuration file accordingly.
- The DAG (Directed Acyclic Graph) will be triggered, the file is picked up by the GCS sensor, and it will begin loading the data into the BigQuery table.
- Please refer to the screenshot below to see how the DAG appears during execution: