Stars
Pyspark RDD, DataFrame and Dataset Examples in Python language
Create animated bar chart races in Python with matplotlib
Toturials coming with the "data science roadmap" picture.
Roadmap to becoming a data engineer in 2021
Open Data, more than 50 financial data. 提供超過 50 個金融資料(台股為主),每天更新 https://finmind.github.io/
The "Python Machine Learning (3rd edition)" book code repository
python machine learning tutorial
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Command-line program to download videos from YouTube.com and other video sites
Labs and demos for courses for GCP Training (http://cloud.google.com/training).
Apache Beam is a unified programming model for Batch and Streaming data processing.
Transpile curl commands into Python, JavaScript and 27 other languages
Mastering-Python-Design-Patterns-Second-Edition, published by Packt
TaiwanSparkUserGroup / spark-programming-guide-zh-tw
Forked from aiyanbo/spark-programming-guide-zh-cnSpark 編程指南繁體中文版
it is a repository to store all slides used by Triton Ho's public presentation and course.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
CKIP Neural Chinese Word Segmentation, POS Tagging, and NER
PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer
Original Apollo 11 Guidance Computer (AGC) source code for the command and lunar modules.
The easy-to-use open source Business Intelligence and Embedded Analytics tool that lets everyone work with data 📊