Skip to content
View guykhazma's full-sized avatar

Highlights

  • Pro

Organizations

@xskipper-io

Block or report guykhazma

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Efficient Triton Kernels for LLM Training

Python 3,828 229 Updated Dec 15, 2024

Create architecture diagrams from code automatically using large language models (LLMs).

TypeScript 24 2 Updated Dec 7, 2024

Python SQL Parser and Transpiler

Python 6,866 731 Updated Dec 13, 2024

Open Control Plane for Tables in Data Lakehouse

Java 315 53 Updated Dec 12, 2024

Apache Amoro (incubating) is a Lakehouse management system built on open data lake formats.

Java 887 298 Updated Dec 13, 2024

DSPy: The framework for programming—not prompting—language models

Python 20,181 1,532 Updated Dec 15, 2024

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

TypeScript 51,501 8,609 Updated Dec 15, 2024

Open source project for data preparation of LLM application builders

Python 357 140 Updated Dec 15, 2024

Open-source end-to-end LLM Development Platform

Java 2,576 158 Updated Dec 15, 2024

Generate Parquet Files

Rust 10 3 Updated Dec 11, 2024

Apache Polaris, the interoperable, open source catalog for Apache Iceberg

Python 1,218 147 Updated Dec 13, 2024

Open-source vector similarity search for Postgres

C 13,005 611 Updated Dec 9, 2024

Header-only C++/python library for fast approximate nearest neighbors

C++ 4,435 660 Updated Aug 11, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 31,932 3,672 Updated Dec 14, 2024

A tutorial of building an LSM-Tree storage engine in a week.

Rust 2,937 407 Updated Dec 9, 2024

A RocksDB compatible KV storage engine with better performance

C++ 2,052 204 Updated Jun 7, 2024

🎨 Diagram as Code for prototyping cloud system architectures

Python 39,937 2,553 Updated Dec 11, 2024

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 9,192 941 Updated Nov 25, 2024

Command line (CLI) tool to inspect Apache Parquet files on the go

Python 176 10 Updated Nov 9, 2023

MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising

Python 2,513 268 Updated Jun 28, 2024

💬 An extensive collection of exceptional resources dedicated to the captivating world of talking face synthesis! ⭐ If you find this repo useful, please give it a star! 🤩

878 45 Updated Dec 14, 2024

MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting

Python 3,075 382 Updated Nov 27, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 36,919 4,205 Updated Nov 7, 2024

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translatio…

Python 11,259 1,861 Updated Dec 12, 2024

Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction…

Python 2,144 354 Updated Dec 7, 2024

This is the official repository for OTAvatar: One-shot Talking Face Avatar with Controllable Tri-plane Rendering [CVPR2023].

Python 314 40 Updated Mar 5, 2024

Talking Head (3D): A JavaScript class for real-time lip-sync using Ready Player Me full-body 3D avatars.

JavaScript 374 113 Updated Dec 2, 2024

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

C++ 25,468 3,984 Updated Sep 3, 2024

Includes notes on using Apache Spark in general, notes on using Spark for Physics, how to run TPCDS on PySpark, how to create histograms with Spark, tools for performance testing CPUs, Jupyter note…

Jupyter Notebook 427 149 Updated Sep 12, 2024

Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance Dashboard using containers technology.

Dockerfile 118 22 Updated Nov 19, 2024
Next