HanXiaoyou

HanXiaoyou HanXiaoyou

Her current research mainly focuses on pattern recognition and data mining.

4 followers · 33 following

China University of Mining and Technology
shanghai,China

Lists (24)

Sort

Stars

pengr / LLM-Synthetic-Data

Real-time, fine-grained reading list on LLM-synthetic-data.🔥

168 15 Updated Jan 2, 2025

SuperMedIntel / Medical-SAM2

Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2

Python 528 66 Updated Dec 16, 2024

Yankai96 / ZePT

Discover the repository for "ZePT: Zero-Shot Pan-Tumor Segmentation via Query-Disentangling and Self-Prompting," a pioneering study that has been accepted for presentation at CVPR 2024.

Python 16 1 Updated Dec 15, 2024

mayooear / gpt4-pdf-chatbot-langchain

GPT4 & LangChain Chatbot for large PDF docs

TypeScript 15,005 3,024 Updated Jul 29, 2024

LmeSzinc / AzurLaneAutoScript

Azur Lane bot (CN/EN/JP/TW) 碧蓝航线脚本 | 无缝委托科研，全自动大世界

Python 7,114 848 Updated Jan 4, 2025

facebookresearch / faiss

A library for efficient similarity search and clustering of dense vectors.

C++ 32,218 3,691 Updated Dec 28, 2024

XYPB / CLEFT

Official Implementation of "CLEFT: Language-Image Contrastive Learning with Efficient Large Language Model and Prompt Fine-Tuning" on MICCAI 2024.

Python 16 1 Updated Aug 27, 2024

JsongZhang / RCVA-Net

Video classification method for endoscopic ultrasound risk prediction of rectal cancer

Python 7 Updated Jul 6, 2024

openai / whisper

Robust Speech Recognition via Large-Scale Weak Supervision

Python 73,828 8,823 Updated Jan 4, 2025

microsoft / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 36,117 4,181 Updated Jan 4, 2025

OpenSafetyLab / SALAD-BENCH

【ACL 2024】 SALAD benchmark & MD-Judge

Python 114 11 Updated Dec 3, 2024

Harbinzzy / All-in-One-Image-Restoration-Survey

107 7 Updated Dec 31, 2024

Alibaba-NLP / CoFE-RAG

Python 29 1 Updated Dec 20, 2024

Dooy / chatgpt-web-midjourney-proxy

One UI is all done with chatgpt web, midjourney, gpts,suno,luma,runway,viggle,flux,ideogram,realtime,pika,udio; Simultaneous support Web / PWA / Linux / Win / MacOS platform

JavaScript 5,698 1,431 Updated Jan 5, 2025

babaohuang / GeminiProChat

Minimal web UI for GeminiPro.

TypeScript 4,378 12,448 Updated Dec 6, 2024

CherryHQ / cherry-studio

🍒 Cherry Studio is a desktop client that supports for multiple LLM providers

TypeScript 3,075 182 Updated Jan 4, 2025

songquanpeng / one-api

OpenAI 接口管理 & 分发系统，支持 Azure、Anthropic Claude、Google PaLM 2 & Gemini、智谱 ChatGLM、百度文心一言、讯飞星火认知、阿里通义千问、360 智脑以及腾讯混元，可用于二次分发管理 key，仅单可执行文件，已打包好 Docker 镜像，一键部署，开箱即用. OpenAI key management & redistributi…

JavaScript 20,613 4,481 Updated Dec 27, 2024

Wangyulin-user / TUMSyn

Official code of "Towards General Text-guided Universal Image Synthesis Framework for Customized Multimodal Brain MRI"

Python 7 2 Updated Oct 28, 2024

Sanster / text_renderer

Generate text images for training deep learning ocr model

Python 1,408 386 Updated Jan 17, 2022

PaddlePaddle / ERNIE

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Python 6,333 1,282 Updated Aug 31, 2024

VikParuchuri / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 19,050 1,119 Updated Jan 5, 2025

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 23,185 1,681 Updated Jan 5, 2025

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,233 1,022 Updated Sep 26, 2024

cvlab-stonybrook / PaperEdge

The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)

Python 126 23 Updated Jul 28, 2024

open-compass / VLMEvalKit

Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks

Python 1,616 230 Updated Jan 3, 2025

zdyshine / Baidu-netdisk-AI-Image-processing-Challenge-demoire

百度网盘AI大赛——图像处理挑战赛：文档图像摩尔纹消除第2名方案

Python 38 11 Updated Nov 28, 2023

ChartMimic / ChartMimic

ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation

Python 96 Updated Jul 15, 2024

Yuliang-Liu / MultimodalOCR

On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)

Python 494 34 Updated Jan 3, 2025

jianchang512 / vocal-separate

an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具，本地化网页操作，无需连接外网

Python 1,415 163 Updated Nov 26, 2024

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

Python 13,270 1,120 Updated Jan 1, 2025

HanXiaoyou HanXiaoyou

Lists (24)

2023CVPR

3D handpose estimation

ChatGPT

computer vision

Data_and_download

detection/segemntation

ECG

FoundationModelMedical

humanpose

IncSDA

life trick

LLM

LLM Inference Quant

LMM

Lung/Gastric cancer segementaion

Medical

OCR

popular

RAG

spider

stroke

video

VQA

前端

Stars