Highlights
- Pro
Stars
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
This is an navigation app which gives route from source to destination. a streamlined navigation app powered by the OpenRouteService API. Get instant directions optimized for speed and accuracy.
An example desktop app built with Tauri version 2. Bundle a Next.js frontend with a Python api server.
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieval Results in RAG Systems (WWW 2025)
Fully open reproduction of DeepSeek-R1
Official code repo for the O'Reilly Book - "Hands-On Large Language Models"
Modeling, training, eval, and inference code for OLMo
[RSS2024] Official implementation of "Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation"
[Embodied-AI-Survey-2024] Paper list and projects for Embodied AI
SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding
[CVPR'23] OpenScene: 3D Scene Understanding with Open Vocabularies
[CVPR 2024] Open3DSG: Open-Vocabulary 3D Scene Graphs from Point Clouds with Queryable Objects and Open-Set Relationships
[ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"
Official repo and evaluation implementation of VSI-Bench
记录cv算法工程师的成长之路,分享计算机视觉和模型压缩部署技术栈笔记。https://harleyszhang.github.io/cv_note/
🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
Transform any arXiv papers into slides using LLMs
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…
A generative and self-guided robotic agent that endlessly propose and master new skills.
The Scene Language: Representing Scenes with Programs, Words, and Embeddings (arXiv preprint)
A Procedural World Generator for Robotics Simulation of Agricultural Tasks
Differentiable Flexible Mechanical Metamaterials
Code and additional information for our paper entitled 'Scene Augmentation Methods for Interactive Embodied AI Tasks'