Skip to content
View siaush's full-sized avatar

Block or report siaush

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Data

55 repositories

An open source multi-tool for exploring and publishing data

Python 9,639 695 Updated Nov 29, 2024

A terminal spreadsheet multitool for discovering and arranging data

Python 7,966 283 Updated Dec 9, 2024

A self-contained dbt project for testing purposes

465 944 Updated Sep 12, 2024

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 14,739 3,571 Updated Dec 17, 2024

Nessie: Transactional Catalog for Data Lakes with Git-like semantics

Java 1,066 134 Updated Dec 18, 2024

Dremio - the missing link in modern data

Java 1,388 444 Updated Oct 25, 2024

ClickHouse® is a real-time analytics DBMS

C++ 38,100 6,969 Updated Dec 18, 2024

Apache Druid: a high performance real-time analytics database.

Java 13,552 3,713 Updated Dec 17, 2024

1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

Python 12,602 1,690 Updated Dec 18, 2024

Apache Pinot - A realtime distributed OLAP datastore

Java 5,564 1,307 Updated Dec 17, 2024

A desktop application for viewing and analyzing tabular data

TypeScript 3,214 120 Updated Nov 22, 2024

A curated list of analytics frameworks, software and other tools.

3,953 437 Updated May 9, 2024

A curated list of awesome big data frameworks, ressources and other awesomeness.

13,320 2,561 Updated May 7, 2024

re_data - fix data issues before your users & CEO would discover them 😊

HTML 1,560 123 Updated Apr 30, 2024

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 37,626 14,396 Updated Dec 17, 2024

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java 13,145 4,672 Updated Dec 16, 2024

Nowcasting Malaysia GDP

Jupyter Notebook 4 5 Updated Mar 9, 2023

A graph-relational database with declarative schema, built-in migration system, and a next-generation query language

Python 13,208 405 Updated Dec 17, 2024

Business intelligence as code: build fast, interactive data visualizations in SQL and markdown

JavaScript 4,636 219 Updated Dec 17, 2024

Monte Carlo simulation of the NBA season, leveraging dbt, duckdb and evidence.dev

Python 462 70 Updated Dec 16, 2024

PRQL is a modern language for transforming data — a simple, powerful, pipelined SQL replacement

Rust 10,009 222 Updated Dec 16, 2024

Rust-powered collection of financial functions.

Rust 180 17 Updated Oct 28, 2024

OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team co…

TypeScript 5,746 1,078 Updated Dec 17, 2024

Self-serve BI to 10x your data team ⚡️

TypeScript 4,077 437 Updated Dec 17, 2024

🦆 A curated list of awesome DuckDB resources

1,434 111 Updated Dec 17, 2024

A fast viewer for CSV/Parquet files and databases such as DuckDB, SQLite, PostgreSQL, MySQL, Clickhouse, etc., base on Tauri

Rust 254 10 Updated Nov 1, 2024

Web application providing an intuitive user experience to databases.

Svelte 2,438 340 Updated Dec 17, 2024

Turns Data and AI algorithms into production-ready web applications in no time.

Python 17,270 1,866 Updated Dec 17, 2024