Skip to content
View cboden's full-sized avatar
๐Ÿ“‰
Munging data
๐Ÿ“‰
Munging data

Organizations

@reactphp @ratchetphp

Block or report cboden

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this userโ€™s behavior. Learn more about reporting abuse.

Report abuse
Showing results

An AI-powered Personal Identifiable Information (PII) scanner.

Python 647 56 Updated Nov 24, 2023

๐Ÿ”ฅ Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.

TypeScript 19,630 1,540 Updated Dec 13, 2024

Open-source platform for extracting structured data from documents using AI.

TypeScript 1,126 33 Updated Dec 10, 2024

Replicates any database (CDC events) to Apache Iceberg (To Cloud Storage)

Java 208 38 Updated Dec 3, 2024

There can be more than Notion and Miro. AFFiNE(pronounced [ษ™โ€˜fain]) is a next-gen knowledge base that brings planning, sorting and creating all together. Privacy first, open-source, customizable anโ€ฆ

TypeScript 43,295 2,830 Updated Dec 13, 2024

๐Ÿ“™ Awesome Data Catalogs and Observability Platforms.

737 55 Updated Jul 27, 2024

Entity Relation Diagrams generation tool

Python 1,176 118 Updated Dec 9, 2024

Database diagrams editor that allows you to visualize and design your DB with a single query.

TypeScript 10,847 551 Updated Dec 11, 2024

A curated list of data engineering tools for software developers

6,882 1,237 Updated Oct 24, 2024

Temporal service

Go 12,290 859 Updated Dec 13, 2024

JavaScript library for working with recurrence rules for calendar dates as defined in the iCalendar RFC and more.

TypeScript 3,381 516 Updated Jun 27, 2024

๐——๐—ฎ๐˜๐—ฎ, ๐—”๐—ป๐—ฎ๐—น๐˜†๐˜๐—ถ๐—ฐ๐˜€ & ๐—”๐—œ. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com

Rust 7,969 753 Updated Dec 13, 2024

The data-validation toolkit for enhanced dbt (data build tool) PR review

TypeScript 274 7 Updated Dec 13, 2024

Scan databases and data warehouses for PII data. Tag tables and columns in data catalogs like Amundsen and Datahub

Python 286 97 Updated Jan 5, 2024

Collect, aggregate, and visualize a data ecosystem's metadata

Java 1,798 322 Updated Dec 12, 2024

Generate the ERD as a code from dbt artifacts

Python 222 32 Updated Sep 28, 2024

jq extension for SQLite.

C 93 Updated Jul 8, 2024

ingestr is a CLI tool to copy data between any databases with a single command seamlessly.

Python 2,582 65 Updated Dec 13, 2024

data load tool (dlt) is an open source Python library that makes data loading easy ๐Ÿ› ๏ธ

Python 2,800 183 Updated Dec 13, 2024

Opiniated RAG for integrating GenAI in your apps ๐Ÿง  Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: โ€ฆ

Python 36,883 3,599 Updated Dec 13, 2024

Easily run a modded dedicated Valheim server.

Go 9 Updated Dec 11, 2024

Titan Core - Snowflake infrastructure-as-code. Provision environments, automate deploys, CI/CD. Manage RBAC, users, roles, and data access. Declarative Python Resource API. Change Management tool fโ€ฆ

Python 438 31 Updated Nov 27, 2024

The GitHub/GitLab for database DevSecOps. World's most advanced database DevSecOps solution for Developer, Security, DBA and Platform Engineering teams.

Go 11,604 743 Updated Dec 13, 2024

dbt package to manage snowflake objects

Python 26 6 Updated Oct 24, 2024

Snowflake Native SDK for Connectors

Java 26 13 Updated Dec 11, 2024

Scalar is an open-source API platform:ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€๐ŸŒ Modern Rest API Clientใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€๐Ÿ“– Beautiful API Referencesใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€ใ€€โ€ฆ

TypeScript 7,865 249 Updated Dec 13, 2024

Provides automated YAML management, a dbt server, streamlit workbench, and git-integrated dbt model output diff tools

Python 484 49 Updated Dec 11, 2024

A high-performance observability data pipeline.

Rust 18,379 1,619 Updated Dec 13, 2024
Next