Stars
Data Engineering Zoomcamp is a free nine-week course that covers the fundamentals of data engineering.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Smoothly Manage Multiple LLMs (OpenAI, Anthropic, Azure) and Image Models (Dall-E, SDXL), Speed Up Responses, and Ensure Non-Stop Reliability.
The world's best login box powered by WorkOS and Radix.
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage
This is a template you can use for your next data engineering portfolio project.
SoTA LLM for converting natural language questions to SQL queries
EZQL ask your database questions using natural language.
A dbt package for modelling dbt metadata. https://brooklyn-data.github.io/dbt_artifacts
This extension makes vscode seamlessly work with dbt™: Auto-complete, preview, column lineage, AI docs generation, health checks, cost estimation etc
One-stop-shop for docs and test coverage of dbt projects.
This dbt package contains macros to support unit testing that can be (re)used across dbt projects.
dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)
Dagster Labs' open-source data platform, built with Dagster.
Examples of programs built using Modal
Master programming by recreating your favorite technologies from scratch.
⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
📊 Cube — Universal semantic layer platform for AI, BI, spreadsheets, and embedded analytics
Eclipse Theia is a cloud & desktop IDE framework implemented in TypeScript.
Provision remote development environments via Terraform
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
Flowman is an ETL framework powered by Apache Spark. With its declarative approach, Flowman simplifies the development of complex data pipelines.