Lists (1)
Sort Name ascending (A-Z)
Stars
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Scalable datastore for metrics, events, and real-time analytics
Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of …
An extremely fast Python package and project manager, written in Rust.
OpenCL Miner for Autolykos v2 (Ergo) for AMD GPUs
This is a repo with links to everything you'd ever want to learn about data engineering
A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.
This repo provides a customizable stack for starting new ML projects on Databricks that follow production best-practices out of the box.
Download market data from Yahoo! Finance's API
Jupyter Notebooks and code for Python for Finance (2nd ed., O'Reilly) by Yves Hilpisch.
Tutorials for DataCamp (www.datacamp.com)
Mirror of the official PostgreSQL GIT repository. Note that this is just a *mirror* - we don't work with pull requests on github. To contribute, please see https://wiki.postgresql.org/wiki/Submitti…
A set of homebrew formulae to install virt-manager and virt-viewer on MAC OSX
Always know what to expect from your data.
A collection of examples that show how to use CrewAI framework to automate workflows.
This repository is for active development of the Azure SDK for Python. For consumers of the SDK we recommend visiting our public developer docs at https://learn.microsoft.com/python/azure/ or our v…
Open, Multi-modal Catalog for Data & AI
🦜🔗 Build context-aware reasoning applications
AGI's query engine - Platform for building AI that can learn and answer questions over federated data.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.
Python packaging and dependency management made easy
Update the packages in a requirements.txt file.
Data Structures and Algorithms in Python
Contains all the code samples from the Zero to Mastery : Master the Coding Interview - Data Structures + Algorithms course by Andrei Neagoie, in Python.
DuckDB is an analytical in-process SQL database management system