-
Mckinsey & Company
Stars
A lightweight data processing framework built on DuckDB and 3FS.
Hopsworks - Data-Intensive AI platform with a Feature Store
Automated database platform for PostgreSQL® - Your own DBaaS. The open-source alternative to cloud-managed databases.
⭐️ Companies that don't have a broken hiring process
Python Backtesting library for trading strategies
Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals
🖥️ macOS status monitoring app written in SwiftUI.
Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
The Metadata Platform for your Data and AI Stack
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
Roadmap to becoming a data engineer in 2021
Worldwide holidays and workdays computational toolkit.
A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
🧠 Laws, Theories, Principles and Patterns for developers and technologists.
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
Rich is a Python library for rich text and beautiful formatting in the terminal.
DuckDB is an analytical in-process SQL database management system
The easiest way to serve AI apps and models - Build Model Inference APIs, Job queues, LLM apps, Multi-model pipelines, and more!
Statistical package in Python based on Pandas
📝 An awesome Data Science repository to learn and apply for real world problems.
Prefect is a workflow orchestration framework for building resilient data pipelines in Python.
Questions to ask the company during your interview