Data
An open source multi-tool for exploring and publishing data
A terminal spreadsheet multitool for discovering and arranging data
A self-contained dbt project for testing purposes
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
Nessie: Transactional Catalog for Data Lakes with Git-like semantics
ClickHouse® is a real-time analytics database management system
Apache Druid: a high performance real-time analytics database.
1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
Apache Pinot - A realtime distributed OLAP datastore
A desktop application for viewing and analyzing tabular data
A curated list of analytics frameworks, software and other tools.
A curated list of awesome big data frameworks, ressources and other awesomeness.
re_data - fix data issues before your users & CEO would discover them 😊
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more.
Business intelligence as code: build fast, interactive data visualizations in SQL and markdown
Monte Carlo simulation of the NBA season, leveraging dbt, duckdb and evidence.dev
PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…
Self-serve BI to 10x your data team ⚡️
🦆 A curated list of awesome DuckDB resources
A fast viewer for CSV/Parquet files and databases such as DuckDB, SQLite, PostgreSQL, MySQL, Clickhouse, etc., base on Tauri
An intuitive spreadsheet-like interface that lets users of all technical skill levels view, edit, query, and collaborate on Postgres data directly—100% open source and self hosted, with native Post…
Turns Data and AI algorithms into production-ready web applications in no time.