Starred repositories
Bash script to get the last update time of all hive tables.
🚀 Utility classes and functions for common data science libraries
Just a small project to practice bash interacting with the AWS cli
A data engineering project (Twitter monitor app)
Code from the mCoding sample videos
Python-Application-Development-Tips-Tricks-and-Techniques [Video]
Bonus materials, exercises, and example projects for our Python tutorials
😎 Awesome lists about all kinds of interesting topics
Free Data Engineering course!
A chrome extension to deny google meet entry with just one click.
Collection of useful data science topics along with articles, videos, and code
Evaluation exercises for a Data Engineering role in the Rockies organization. Includes Python code for accessing MLB data, and SQL code to calculate summary statistics against the organizations dat…
A curated list of projects related to the reMarkable tablet
IPython and Jupyter in-depth Tutorial, first presented at PyCon 2012
JupyterLab computational environment.
A tutorial on how to get started with Presto.
A toolkit providing a uniform interface for connecting to and extracting data from a wide variety of (potentially remote) data stores (including HDFS, Hive, Presto, MySQL, etc).
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
The official home of the Presto distributed SQL query engine for big data
Pandas Cookbook, published by Packt
List of Computer Science courses with video lectures.
📚 Playground and cheatsheet for learning Python. Collection of Python scripts that are split by topics and contain code examples with explanations.
A boilerplate for writing PySpark Jobs