Skip to content
View siaush's full-sized avatar

Block or report siaush

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Data

67 repositories

An open source multi-tool for exploring and publishing data

Python 9,842 713 Updated Mar 3, 2025

A terminal spreadsheet multitool for discovering and arranging data

Python 8,079 288 Updated Mar 4, 2025

A self-contained dbt project for testing purposes

483 960 Updated Sep 12, 2024

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 15,073 3,645 Updated Mar 6, 2025

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java 1,150 145 Updated Mar 6, 2025

Dremio - the missing link in modern data

Java 1,414 446 Updated Oct 25, 2024

ClickHouse® is a real-time analytics database management system

C++ 39,386 7,155 Updated Mar 7, 2025

Apache Druid: a high performance real-time analytics database.

Java 13,627 3,727 Updated Mar 6, 2025

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Python 12,768 1,697 Updated Mar 5, 2025

Apache Pinot - A realtime distributed OLAP datastore

Java 5,656 1,336 Updated Mar 7, 2025

A desktop application for viewing and analyzing tabular data

TypeScript 3,263 121 Updated Mar 5, 2025

A curated list of analytics frameworks, software and other tools.

3,996 438 Updated May 9, 2024

A curated list of awesome big data frameworks, ressources and other awesomeness.

13,475 2,565 Updated Feb 14, 2025

re_data - fix data issues before your users & CEO would discover them 😊

HTML 1,565 125 Updated Apr 30, 2024

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 39,054 14,769 Updated Mar 7, 2025

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java 13,304 4,721 Updated Mar 6, 2025

Nowcasting Malaysia GDP

Jupyter Notebook 4 5 Updated Mar 9, 2023

Gel supercharges Postgres with a modern data model, graph queries, Auth & AI solutions, and much more.

Python 13,490 405 Updated Mar 7, 2025

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

JavaScript 4,900 239 Updated Mar 6, 2025

Monte Carlo simulation of the NBA season, leveraging dbt, duckdb and evidence.dev

Python 494 71 Updated Feb 12, 2025

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 10,128 227 Updated Mar 6, 2025

Rust-powered collection of financial functions.

Rust 189 19 Updated Jan 2, 2025

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 6,209 1,167 Updated Mar 6, 2025

Self-serve BI to 10x your data team ⚡️

TypeScript 4,501 499 Updated Mar 6, 2025

🦆 A curated list of awesome DuckDB resources

1,571 120 Updated Mar 5, 2025

A fast viewer for CSV/Parquet files and databases such as DuckDB, SQLite, PostgreSQL, MySQL, Clickhouse, etc., base on Tauri

Rust 295 11 Updated Feb 11, 2025

An intuitive spreadsheet-like interface that lets users of all technical skill levels view, edit, query, and collaborate on Postgres data directly—100% open source and self hosted, with native Post…

Svelte 3,985 367 Updated Mar 6, 2025

Turns Data and AI algorithms into production-ready web applications in no time.

Python 17,871 1,878 Updated Mar 6, 2025