- Plaksha University
-
23:43
(UTC +05:30)
Highlights
- Pro
Lists (2)
Sort Name ascending (A-Z)
Stars
Inference and training library for high-quality TTS models.
🔥Highlighting the top ML papers every week.
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
SwarmZero's SDK for building AI agents, swarms of agents and much more.
This repository contains demos I made with the Transformers library by HuggingFace.
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
Official repo for consistency models.
Offical code repository of “BooW-VTON: Boosting In-the-Wild Virtual Try-On via Mask-Free Pseudo Data Training”
Official PyTorch implementation of SegFormer
Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
Implementation of Alphafold 3 from Google Deepmind in Pytorch
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
⌚A curated list of awesome watchOS frameworks, libraries, sample apps, including Objective-C and Swift Projects
Stable Diffusion with Core ML on Apple Silicon
Largest list of models for Core ML (for iOS 11+)
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
High-resolution models for human tasks.
Code for ACL 2024 main conference paper "Can We Achieve High-quality Direct Speech-to-Speech Translation Without Parallel Speech Data?".
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
A playbook for systematically maximizing the performance of deep learning models.
llama3 implementation one matrix multiplication at a time
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
A minimal GPU design in Verilog to learn how GPUs work from the ground up
pix2tex: Using a ViT to convert images of equations into LaTeX code.
[TMM 2024] Implementation of the paper “Temporal Decoupling Graph Convolutional Network for Skeleton-based Gesture Recognition”.
thetushargoyal / roadcam
Forked from bdtinc/maskcamJetson Nano-based smart camera system.
Jetson Nano-based smart camera system that measures crowd face mask usage in real-time.
OpenCV Python Neural Network Autonomous RC Car