Skip to content
View Nayuta403's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@bytedance @LianjiaTech @cfug @fluttercandies

Block or report Nayuta403

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
12 stars written in Python
Clear filter

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,416 427 Updated May 29, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,968 513 Updated Mar 7, 2025

🎉 Repo for LaWGPT, Chinese-Llama tuned with Chinese Legal knowledge. 基于中文法律知识的大语言模型

Python 5,935 548 Updated Jun 11, 2024

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

Python 5,600 618 Updated Mar 6, 2025

Mobile-Agent: The Powerful Mobile Device Operation Assistant Family

Python 3,695 358 Updated Mar 13, 2025

Data manipulation and transformation for audio signal processing, powered by PyTorch

Python 2,622 683 Updated Mar 13, 2025

[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.

Python 1,086 66 Updated Mar 13, 2025

Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"

Python 327 49 Updated Mar 22, 2024

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL

Python 323 25 Updated Feb 8, 2025

AndroidWorld is an environment and benchmark for autonomous agents

Python 236 27 Updated Mar 6, 2025

Towards Large Multimodal Models as Visual Foundation Agents

Python 192 7 Updated Feb 5, 2025

VisionTasker introduces a novel two-stage framework combining vision-based UI understanding and LLM task planning for mobile task automation in a step-by-step manner.

Python 63 9 Updated Feb 17, 2025