Skip to content
View HanXiaoyou's full-sized avatar
  • China University of Mining and Technology
  • shanghai,China

Block or report HanXiaoyou

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Real-time, fine-grained reading list on LLM-synthetic-data.🔥

168 15 Updated Jan 2, 2025

Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2

Python 528 66 Updated Dec 16, 2024

Discover the repository for "ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting," a pioneering study that has been accepted for presentation at CVPR 2024.

Python 16 1 Updated Dec 15, 2024

GPT4 & LangChain Chatbot for large PDF docs

TypeScript 15,005 3,024 Updated Jul 29, 2024

Azur Lane bot (CN/EN/JP/TW) 碧蓝航线脚本 | 无缝委托科研,全自动大世界

Python 7,114 848 Updated Jan 4, 2025

A library for efficient similarity search and clustering of dense vectors.

C++ 32,218 3,691 Updated Dec 28, 2024

Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MICCAI 2024.

Python 16 1 Updated Aug 27, 2024

Video classification method for endoscopic ultrasound risk prediction of rectal cancer

Python 7 Updated Jul 6, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,828 8,823 Updated Jan 4, 2025

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,117 4,181 Updated Jan 4, 2025

【ACL 2024】 SALAD benchmark & MD-Judge

Python 114 11 Updated Dec 3, 2024
Python 29 1 Updated Dec 20, 2024

One UI is all done with chatgpt web, midjourney, gpts,suno,luma,runway,viggle,flux,ideogram,realtime,pika,udio; Simultaneous support Web / PWA / Linux / Win / MacOS platform

JavaScript 5,698 1,431 Updated Jan 5, 2025

Minimal web UI for GeminiPro.

TypeScript 4,378 12,448 Updated Dec 6, 2024

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers

TypeScript 3,075 182 Updated Jan 4, 2025

OpenAI 接口管理 & 分发系统,支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元,可用于二次分发管理 key,仅单可执行文件,已打包好 Docker 镜像,一键部署,开箱即用. OpenAI key management & redistributi…

JavaScript 20,613 4,481 Updated Dec 27, 2024

Official code of "Towards General Text-guided Universal Image Synthesis Framework for Customized Multimodal Brain MRI"

Python 7 2 Updated Oct 28, 2024

Generate text images for training deep learning ocr model

Python 1,408 386 Updated Jan 17, 2022

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,333 1,282 Updated Aug 31, 2024

Convert PDF to markdown + JSON quickly with high accuracy

Python 19,050 1,119 Updated Jan 5, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 23,185 1,681 Updated Jan 5, 2025

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,233 1,022 Updated Sep 26, 2024

The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)

Python 126 23 Updated Jul 28, 2024

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python 1,616 230 Updated Jan 3, 2025

百度网盘AI大赛——图像处理挑战赛:文档图像摩尔纹消除第2名方案

Python 38 11 Updated Nov 28, 2023

ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation

Python 96 Updated Jul 15, 2024

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 494 34 Updated Jan 3, 2025

an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网

Python 1,415 163 Updated Nov 26, 2024

Faster Whisper transcription with CTranslate2

Python 13,270 1,120 Updated Jan 1, 2025
Next