Stars
Survey of Small Language Models from Penn State, ...
Official Implementation of "KBLaM: Knowledge Base augmented Language Model"
A simple screen parsing tool towards pure vision based GUI agent
[ICML 2024] LLMCompiler: An LLM Compiler for Parallel Function Calling
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
Create powerful Hydra applications without the yaml files and boilerplate code.
Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
GPUd automates monitoring, diagnostics, and issue identification for GPUs
Code repo for the paper "SpinQuant LLM quantization with learned rotations"
Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA
Jobs_Applier_AI_Agent_AIHawk aims to easy job hunt process by automating the job application process. Utilizing artificial intelligence, it enables users to apply for multiple jobs in a tailored way.
A parser, editor and profiler tool for ONNX models.
List of papers related to neural network quantization in recent AI conferences and journals.
PyTorch code for Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
ONNX-compatible Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data
Official repository for ICLR 2025 paper "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing". Your efficient and high-quality synthetic data generation pipeline!
Universal Monocular Metric Depth Estimation
[NeurIPS 2024] Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
LPIPS metric. pip install lpips
Allows to use your GoPro camera as a webcam on linux
A natural language interface for computers