Skip to content
Change the repository type filter

All

    Repositories list

    • VLM-R1

      Public
      Solve Visual Understanding with Reinforced VLMs
      Python
      146000Updated Feb 20, 2025Feb 20, 2025
    • imageflow

      Public
      High-performance image manipulation for web servers. Includes imageflow_server, imageflow_tool, and libimageflow
      Rust
      GNU Affero General Public License v3.0
      141000Updated Jan 19, 2025Jan 19, 2025
    • jessibuca

      Public
      Jessibuca是一款开源的纯H5直播流播放器
      C
      GNU General Public License v3.0
      429000Updated Jan 2, 2025Jan 2, 2025
    • k

      Public
      k: Kinematics Library for rust-lang
      Rust
      Apache License 2.0
      17000Updated Dec 26, 2024Dec 26, 2024
    • headway

      Public
      Self-hostable maps stack, powered by OpenStreetMap.
      Rust
      Apache License 2.0
      62000Updated Dec 20, 2024Dec 20, 2024
    • egui

      Public
      egui: an easy-to-use immediate mode GUI in Rust that runs on both web and native
      Rust
      Apache License 2.0
      1.7k000Updated Dec 20, 2024Dec 20, 2024
    • ➷ A robust Javascript library for capturing keyboard input. It has no dependencies.
      JavaScript
      MIT License
      413000Updated Dec 18, 2024Dec 18, 2024
    • Audio waveform player
      TypeScript
      BSD 3-Clause "New" or "Revised" License
      1.7k000Updated Dec 15, 2024Dec 15, 2024
    • The UIOTOS Community Edition, built on javascript and ht.js, opens up its pioneering page nesting technology and offers core source code , inviting developers to learn and utilize these resources in their projects.
      JavaScript
      Apache License 2.0
      12000Updated Dec 15, 2024Dec 15, 2024
    • AudioLDM

      Public
      AudioLDM: Generate speech, sound effects, music and beyond, with text.
      Python
      Other
      230000Updated Dec 9, 2024Dec 9, 2024
    • screen sharing for developers https://screego.net/
      Go
      GNU General Public License v3.0
      587000Updated Dec 7, 2024Dec 7, 2024
    • FunASR

      Public
      A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
      Python
      Other
      864000Updated Dec 5, 2024Dec 5, 2024
    • webvm

      Public
      Virtual Machine for the Web
      Svelte
      Apache License 2.0
      1.9k000Updated Dec 5, 2024Dec 5, 2024
    • A modern ebook manager and reader with sync and backup capacities for Windows, macOS, Linux and Web
      JavaScript
      GNU Affero General Public License v3.0
      1.6k000Updated Nov 17, 2024Nov 17, 2024
    • A recreation of the classic Visual Basic 6 IDE and language in C# with Avalonia
      C#
      MIT License
      81000Updated Nov 12, 2024Nov 12, 2024
    • openvla

      Public
      OpenVLA: An open-source vision-language-action model for robotic manipulation.
      Python
      MIT License
      345000Updated Nov 5, 2024Nov 5, 2024
    • A simple screen parsing tool towards pure vision based GUI agent
      Jupyter Notebook
      Creative Commons Attribution 4.0 International
      1.3k000Updated Nov 5, 2024Nov 5, 2024
    • wechaty

      Public
      Conversational RPA SDK for Chatbot Makers. Join our Discord: https://discord.gg/7q8NBZbQzt
      TypeScript
      Apache License 2.0
      2.7k000Updated Oct 29, 2024Oct 29, 2024
    • Generate changelogs and release notes from a project's commit messages and metadata.
      TypeScript
      ISC License
      728000Updated Oct 7, 2024Oct 7, 2024
    • Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots
      Python
      MIT License
      149000Updated Sep 15, 2024Sep 15, 2024
    • 《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
      Java
      Other
      14k000Updated Sep 12, 2024Sep 12, 2024
    • Umi-OCR

      Public
      OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片,PDF文档识别,排除水印/页眉页脚,扫描/生成二维码。内置多国语言库。
      Python
      MIT License
      3k000Updated Sep 6, 2024Sep 6, 2024
    • MiniCPM-V

      Public
      MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
      Python
      Apache License 2.0
      1.3k000Updated Aug 31, 2024Aug 31, 2024
    • CosyVoice

      Public
      Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
      Python
      Apache License 2.0
      1.1k000Updated Aug 30, 2024Aug 30, 2024
    • ReKep

      Public
      ReKep: Spatio-Temporal Reasoning of Relational Keypoint Constraints for Robotic Manipulation
      Python
      70000Updated Aug 30, 2024Aug 30, 2024
    • lerobot

      Public
      🤗 LeRobot: End-to-end Learning for Real-World Robotics in Pytorch
      Python
      Apache License 2.0
      994000Updated Aug 29, 2024Aug 29, 2024
    • 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
      Python
      MIT License
      4.6k000Updated Aug 28, 2024Aug 28, 2024
    • JSON Hero is an open-source, beautiful JSON explorer for the web that lets you browse, search and navigate your JSON files at speed. 🚀. Built with 💜 by the Trigger.dev team.
      TypeScript
      Apache License 2.0
      553000Updated Aug 22, 2024Aug 22, 2024
    • MessagePack implementation for Rust / msgpack.org[Rust]
      Rust
      MIT License
      140000Updated Aug 21, 2024Aug 21, 2024
    • EchoMimic

      Public
      Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
      Python
      Apache License 2.0
      398000Updated Aug 15, 2024Aug 15, 2024