Starred repositories
RAG for Vietnamese Wikipedia corpus.
🧙 Build, run, and manage data pipelines for integrating and transforming data.
A cloud-native vector database, storage for next generation AI applications
A curated list of open source tools used in analytics platforms and data engineering ecosystem
Nyc_Taxi_Data_Pipeline - DE Project
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore, Minio, Postgres)
The Python Beginner Course given at Code Academy tailored by me
The Machine Learning project including ML/DL projects, notebooks, cheat codes of ML/DL, useful information on AI/AGI and codes or snippets/scripts/tasks with tips.
Open-source customer engagement. Automate transactional and marketing messages across email, SMS, mobile push, WhatsApp, Slack, and more 📨
📺IPTV电视直播源更新工具🚀:✨央视、📡卫视、☘️广东及各省份地方台、🌊港·澳·台、🎬电影、🎥咪咕、🏀体育、🪁动画、🎮游戏、🎵音乐、🏛经典剧场;支持IPv4/IPv6;支持自定义增加频道;支持组播源、酒店源、订阅源、关键字搜索;每天自动更新两次,结果可用于TVBox等播放软件;支持工作流、Docker(amd64/arm64/arm v7)、命令行、GUI运行方式 | IPTV live …
Python programs, usually short, of considerable difficulty, to perfect particular skills.
1000+ DevOps Bash Scripts - AWS, GCP, Kubernetes, Docker, CI/CD, APIs, SQL, PostgreSQL, MySQL, Hive, Impala, Kafka, Hadoop, Jenkins, GitHub, GitLab, BitBucket, Azure DevOps, TeamCity, Spotify, MP3,…
Free & Opensource Laravel CRM solution for SMEs and Enterprises for complete customer lifecycle management.
Buildroot external tree for RPi4 based edgemap image with Hyperpixel 4" display
A query builder for PostgreSQL, MySQL, CockroachDB, SQL Server, SQLite3 and Oracle, designed to be flexible, portable, and fun to use.
🚀 10x easier, 🚀 140x lower storage cost, 🚀 high performance, 🚀 petabyte scale - Elasticsearch/Splunk/Datadog alternative for 🚀 (logs, metrics, traces, RUM, Error tracking, Session replay).
A modern JavaScript utility library that's 2-3 times faster and up to 97% smaller—a major upgrade to lodash.
A vector search SQLite extension that runs anywhere!
Next-generation ORM for Node.js & TypeScript | PostgreSQL, MySQL, MariaDB, SQL Server, SQLite, MongoDB and CockroachDB
Simple ClickHouse UI that relies on system tables to help monitor and provide overview of your cluster
Add some chaos to your HTTP streams to validate player behaviour
Automatically generate subtitles from an input audio or video file using OpenAI Whisper
VictoriaMetrics: fast, cost-effective monitoring solution and time series database
Code examples on Apache Spark using python