Skip to content

cgliang/quickstart-datalake-47lining

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

quickstart-datalake-47lining

Data Lake Foundation on the AWS Cloud

This Quick Start deploys a data lake foundation that integrates Amazon Web Services (AWS) services such as Amazon Simple Storage Service (Amazon S3), Amazon Redshift, Amazon Kinesis, Amazon Athena, Amazon Elasticsearch Service (Amazon ES), and Amazon QuickSight.

The data lake foundation uses these AWS services to provide data submission, ingest processing, dataset management, data transformation, aggregation, and analysis, search, publishing, and visualization capabilities. Once this foundation is in place, you may choose to augment the data lake with ISV and SaaS tools.

The deployment also includes an optional wizard and a sample dataset that is loaded into Amazon Redshift and Kinesis streams to demonstrate data lake capabilities.

The AWS CloudFormation templates included with the Quick Start automate the following:

  • Deploying the data lake foundation into a new virtual private cloud (VPC)
  • Deploying the data lake foundation into an existing VPC in your AWS account

You can also use the AWS CloudFormation templates as a starting point for your own implementation.

Quick Start architecture for data lake foundation on AWS

For architectural details, best practices, step-by-step instructions, and customization options, see the deployment guide.

To post feedback, submit feature ideas, or report bugs, use the Issues section of this GitHub repo. If you'd like to submit code for this Quick Start, please review the AWS Quick Start Contributor's Kit.

About

AWS Quick Start Team

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 45.0%
  • Python 37.9%
  • Jupyter Notebook 8.0%
  • JavaScript 5.6%
  • CSS 3.5%