-
16:00
(UTC +08:00)
Stars
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
This repository contains examples for customers to get started using the Amazon Bedrock Service. This contains examples for all available foundational models
Solve Visual Understanding with Reinforced VLMs
A Flexible Framework for Comprehensive Multimodal Model Evaluation
The official implementation of "NAS-BNN: Neural Architecture Search for Binary Neural Networks"
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Supercharge Your LLM Application Evaluations 🚀
LVBench: An Extreme Long Video Understanding Benchmark
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
This guide provides instructions for creating and managing a SageMaker Hyperpod cluster, and training the AnimateAnyone algorithm on SageMaker Hyperpod
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
🌀 R^2-Tuning: Efficient Image-to-Video Transfer Learning for Video Temporal Grounding (ECCV 2024)
Official pytorch repository for CG-DETR "Correlation-guided Query-Dependency Calibration in Video Representation Learning for Temporal Grounding"
A modular and comprehensive solution to deploy a Multi-LLM and Multi-RAG powered chatbot (Amazon Bedrock, Anthropic, HuggingFace, OpenAI, Meta, AI21, Cohere, Mistral) using AWS CDK on AWS
Easy, fast and very cheap training and inference on AWS Trainium and Inferentia chips.
Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。