-
Pintu
- Indonesia
-
05:40
(UTC +07:00) - https://leetcode.com/user0431U/
Lists (1)
Sort Name ascending (A-Z)
Stars
An Awesome List of Open-Source Data Engineering Projects
Implementing best practices for PySpark ETL jobs and applications.
Data Engineering Practice Problems
Data Engineering Handbook for beginners and everyone
Chronon is a data platform for serving for AI/ML applications.
Repository with code examples of mlflow
Singer.io Tap for MongoDB - PipelineWise compatible
Data pipeline for uploading, preprocessing, and visualising COVID19 data
The Trino (https://trino.io/) adapter plugin for dbt (https://getdbt.com)
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
Demo showing how the dlt load_info can be used to create a data lineage overview.
Free MLOps course from DataTalks.Club
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Google Machine Learning for Solutions Architects, published by Packt
Data Engineering in Bioinformatics, published by Packt
AI for DevOps and Site Reliability Engineering, published by Packt
Mastering Python Design Patterns, Third Edition by Packt Publishing
Polars Cookbook, Published by Packt
Building ETL Pipelines with Python
Best Practices on Recommendation Systems
An open-source project dedicated to constructing robust data pipelines and scalable software infrastructure. We leverage industry-standard tools favored by developers to enhance efficiency and reli…
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
A template repository for deploying DBT to Cloud Run
All-in-one Modern Data Stack (MDS) in a box
Food for thoughts around data contracts
Template for a data contract used in a data mesh.