- All languages
- C
- C#
- C++
- CSS
- Circom
- Clojure
- Cuda
- Cython
- Dart
- Dockerfile
- Elixir
- G-code
- Go
- Groovy
- HTML
- Java
- JavaScript
- Jupyter Notebook
- Lua
- MATLAB
- MDX
- Markdown
- Mustache
- Nim
- PHP
- Perl
- Python
- Rich Text Format
- Roff
- Ruby
- Rust
- Scala
- Shell
- Solidity
- Svelte
- Swift
- SystemVerilog
- TeX
- TypeScript
- Vim Script
- Vue
- YARA
- Zig
Starred repositories
Official repository for our work on micro-budget training of large-scale diffusion models.
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
Generate any location from the real world in Minecraft Java Edition with a high level of detail.
Make websites accessible for AI agents
This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.
A framework to enable multimodal models to operate a computer.
Godot Engine – Multi-platform 2D and 3D game engine
Anime Girls Holding Programming Books
Scan for React performance issues and eliminate slow renders in your app
Adaptive Length Image Tokenization via Recurrent Allocation | How many tokens is an image worth ?
Browser automation system that uses AI-driven planning to navigate web pages and perform goals.
CoTracker is a model for tracking any point (pixel) on a video.
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
An open-source, lightweight note-taking solution. The pain-less way to create your meaningful notes. Your Notes, Your Way.
Anthropic's educational courses
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
Codebase for Aria - an Open Multimodal Native MoE
library & platform to build, distribute, monetize ai apps that have the full context (like rewind, granola, etc.), open source, 100% local, developer friendly. 24/7 screen, mic, keyboard recording …
This is a python implementation for stitching images.
🌱 a fast, batteries-included static-site generator that transforms Markdown content into fully functional websites
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340