Stars
Databricks framework to validate Data Quality of pySpark DataFrames
A chatbot/GraphRAG framework that creates multi-llm-agents from social platform user comments and let them debate on specific topics.
Community managed domain list. Generate geosite.dat for V2Ray.
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Documentation for n8n, a fair-code licensed automation tool with a free community edition and powerful enterprise options. Build AI functionality into your workflows.
Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.
This is the github repo for Learning Spark: Lightning-Fast Data Analytics [2nd Edition]
DBSQL SME Repo contains demos, tutorials, blog code, advanced production helper functions and more!
Python training for business analysts and traders
(Legacy) Command Line Interface for Databricks
Databricks’ Dolly, a large language model trained on the Databricks Machine Learning Platform
Repository of helm charts for deploying DataHub on a Kubernetes cluster
The Metadata Platform for your Data and AI Stack
This repo demonstrates the development of a real-time data pipeline designed to ingest, process, and analyze stock market data. Using cutting-edge tools like Apache Kafka, PostgreSQL, and Python, t…
Demonstration of using Files in Repos with Databricks Delta Live Tables
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
First open-source data discovery and observability platform. We make a life for data practitioners easy so you can focus on your business.
An example showing how to apply software engineering best practices to Databricks notebooks.
🧱 Databricks CLI eXtensions - aka dbx is a CLI tool for development and advanced Databricks workflows management.
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.