Skip to content
View Nayuta403's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@bytedance @LianjiaTech @cfug @fluttercandies

Block or report Nayuta403

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repository for ShowUI: One Vision-Language-Action Model for GUI Visual Agent

Python 680 37 Updated Dec 21, 2024

2d 纯计算高性能刚体物理引擎

TypeScript 76 12 Updated Mar 20, 2022

Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.

Jupyter Notebook 40,202 4,266 Updated Jul 28, 2024

Towards Large Multimodal Models as Visual Foundation Agents

Python 145 4 Updated Dec 19, 2024

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 263 17 Updated Dec 17, 2024

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 5,243 407 Updated Dec 13, 2024

🔥🔥 btrace(AKA RheaTrace) is a high performance Android trace tool which is based on Perfetto, it support to define custom events automatically during building apk and using bhook to provider more n…

Kotlin 1,957 274 Updated Sep 18, 2023

VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automation in a step-by-step manner.

Python 47 7 Updated Oct 17, 2024

💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.

355 19 Updated Dec 21, 2024

AndroidWorld is an environment and benchmark for autonomous agents

Python 153 15 Updated Dec 19, 2024

🔥Android无障碍服务(AccessibilityService)开发框架,Android自动化脚本框架,快速开发复杂自动化任务、远程协助、监听等

Kotlin 324 94 Updated Dec 16, 2024

Vreo (VR Video 缩写) 是基于如视三维渲染引擎 Five 和 用户界面构建库 React 实现的如视 3D 空间剧本播放器。

TypeScript 33 8 Updated May 9, 2024
HTML 4 2 Updated Apr 9, 2024

Android 技术中台,但愿人长久,搬砖不再有

Java 6,508 1,365 Updated Sep 10, 2022

An input-component for controlling your app in natural language using an LLM though LangChain.dart

Dart 12 4 Updated Nov 1, 2024

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,204 419 Updated May 29, 2024

Paper list for Personal LLM Agents

346 16 Updated May 8, 2024

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

Python 287 42 Updated Mar 22, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,904 655 Updated Aug 5, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,263 572 Updated Aug 8, 2024

Modular and customizable Material Design UI components for Android

Java 16,458 3,088 Updated Dec 20, 2024

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,568 661 Updated Dec 22, 2024

Real-Time audio processing library written in Dart.

C 105 12 Updated Jul 18, 2024

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 96,534 15,705 Updated Dec 21, 2024

Noise is an Android wrapper for kissfft, a FFT implementation written in C.

Java 327 43 Updated Nov 8, 2019

✨✨✨这有一包小鱼干,确定不要吃嘛?( 逃

1,792 251 Updated May 15, 2024

🔥 Android Kotlin时代的Adapter, Dsl 的形式使用 RecyclerView.Adapter, 支持折叠展开, 树结构,悬停,情感图状态切换, 加载更多, 多类型Item,侧滑菜单等

Kotlin 706 59 Updated Oct 9, 2024

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Python 5,875 537 Updated Jun 11, 2024

Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.

JavaScript 12,795 641 Updated Oct 30, 2024

Sharp looking Flutter applications with fractional device pixel ratios.

Dart 94 2 Updated Feb 16, 2024
Next