- Atlanta, GA
- in/justin-hc
-
efficient_data_processing_spark Public
Forked from josephmachado/efficient_data_processing_sparkCode for "Efficient Data Processing in Spark" Course
Python UpdatedMay 8, 2024 -
deequ Public
Forked from awslabs/deequDeequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Scala Apache License 2.0 UpdatedMar 5, 2024 -
AdvancedSQLPuzzles Public
Forked from smpetersgithub/AdvancedSQLPuzzlesWelcome to my GitHub repository. I hope you enjoy solving these puzzles as much as I have enjoyed creating them.
TSQL UpdatedMar 4, 2024 -
-
public-apis Public
Forked from public-apis/public-apisA collective list of free APIs
Python MIT License UpdatedFeb 7, 2024 -
-
Bash-Cheat-Sheet Public
Forked from RehanSaeed/Bash-Cheat-SheetA cheat sheet for bash commands.
MIT License UpdatedDec 12, 2023 -
data-engineering-practice Public
Forked from danielbeach/data-engineering-practiceData Engineering Practice Problems
Dockerfile UpdatedNov 26, 2023 -
cli-cheat-sheet Public
Forked from milanaryal/cli-cheat-sheetThe essential tasks on the command line interface for web developers.
UpdatedNov 3, 2023 -
DataEngineeringProject Public
Forked from damklis/DataEngineeringProjectExample end to end data engineering project.
Python MIT License UpdatedDec 8, 2022 -
Sales Analytics Dashboard based on public datasets
UpdatedJun 15, 2022 -
statsmodels Public
Forked from statsmodels/statsmodelsStatsmodels: statistical modeling and econometrics in Python
Python BSD 3-Clause "New" or "Revised" License UpdatedMay 30, 2022 -
-
-
Repository for first Capstone project.
Jupyter Notebook UpdatedMay 24, 2022 -
Docker-Fundamentals Public
Forked from team-data-science/Docker-FundamentalsDocker Course
Python UpdatedMay 20, 2022 -
data-engineer-roadmap Public
Forked from boringPpl/data-engineer-roadmapLearning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups
UpdatedMay 8, 2022 -
data-engineering-zoomcamp Public
Forked from DataTalksClub/data-engineering-zoomcampFree Data Engineering course!
Jupyter Notebook UpdatedApr 28, 2022 -
document-streaming Public
Forked from team-data-science/document-streamingRepository for the Document streaming capstone projects
Jupyter Notebook UpdatedFeb 24, 2022 -
aws-data-engineering Public
Forked from team-data-science/aws-data-engineeringCourse Material Data Engineering on AWS Course
Python UpdatedFeb 4, 2022 -
Data-Engineering-Platform Public
Forked from feifanfiona/Data-Engineering-PlatformData Engineering Platform Projects with MySQL, MongoDB, Neo4j, Tableau, Excel
HTML UpdatedJan 29, 2022 -
learning-notes Public
Forked from keyvanakbary/learning-notesNotes on books I read, talks I watch, articles I study, and papers I love
SCSS UpdatedJan 15, 2022 -
-
sharetribe Public
Forked from sharetribe/sharetribeSharetribe Go is a source available marketplace software, also available as a hosted, no-code SaaS product. For a headless, API-first marketplace solution, check out Sharetribe Flex: https://www.sh…
Ruby Other UpdatedOct 26, 2021 -
Impractical_Python_Projects Public
Forked from rlvaugh/Impractical_Python_ProjectsCode & supporting files for chapters in book
Python UpdatedOct 1, 2021 -
azure-data-engineering Public
Forked from team-data-science/azure-data-engineeringPython UpdatedJul 1, 2021 -
-
apis-with-fastapi Public
Forked from team-data-science/apis-with-fastapiThis is the repo for the course Building APIs with FastAPI
Python UpdatedJun 9, 2021 -
AWS-Certified-Cloud-Practitioner-Notes Public
Forked from kennethleungty/AWS-Certified-Cloud-Practitioner-NotesNotes compiled based on AWS E-Learning lessons and transcripts
UpdatedJan 3, 2021 -
learning-apache-spark Public
Forked from team-data-science/learning-apache-sparkRepository for Apache Spark course at Team Data Science
Jupyter Notebook UpdatedOct 23, 2020