-
Zhejiang University
- Hangzhou, China
Lists (5)
Sort Name ascending (A-Z)
Stars
This is the official implementation of MusER (AAAI'24).
CogDL: A Comprehensive Library for Graph Deep Learning (WWW 2023)
A lightweight test input generator for Android. Similar to Monkey, but with more intelligence and cool features!
Source code for the paper "Empowering LLM to use Smartphone for Intelligent Task Automation"
Segment Anything in Medical Images
This is an official repo for fine-tuning SAM to customized medical images.
Acceptance rates for the major AI conferences
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
A curated list of awesome scratch projects
Open Overleaf/ShareLaTex projects in vscode, with full collaboration support.
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
A collection of composable React components for building interactive data visualizations
A scalable template for PyTorch projects, with examples in Image Segmentation, Object classification, GANs and Reinforcement Learning.
A PyTorch Implementation of Neural IMage Assessment
LLM agent system for HCI research question co-creation, brainstorming and ideation
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Official Repository of "A Fair Comparison of Graph Neural Networks for Graph Classification", ICLR 2020
Turn any webpage into structured data using LLMs
Schedule-Free Optimization in PyTorch
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
official implementation for the paper "Simplifying Graph Convolutional Networks"
A professionally curated list of awesome resources (paper, code, data, etc.) on transformers in time series.
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU su…
[CVPR2024] ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts