Skip to content
View qinyexie's full-sized avatar

Block or report qinyexie

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files

Python 8,792 1,444 Updated Mar 4, 2025

🚀 DeepSeek-V3 R1大模型逆向API【特长:良心厂商】(官方贼便宜,建议直接走官方),支持高速流式输出、多轮对话,联网搜索,R1深度思考,零配置部署,多路token支持,仅供测试,如需商用请前往官方开放平台。

TypeScript 2,124 610 Updated Feb 14, 2025

PyTorch implementation of FractalGen https://arxiv.org/abs/2502.17437

Python 806 39 Updated Feb 25, 2025

DeepSeek Coder: Let the Code Write Itself

Python 20,819 2,323 Updated May 21, 2024
Python 455 43 Updated Feb 19, 2025
Python 3,481 324 Updated Feb 24, 2025

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 21,701 2,384 Updated Aug 12, 2024

[CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".

Python 277 1 Updated Mar 5, 2025

[ICLR 2025][arXiv:2406.07548] Image and Video Tokenization with Binary Spherical Quantization

Python 136 Updated Jun 12, 2024

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 8,372 584 Updated Mar 4, 2025

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 15,820 1,099 Updated Feb 28, 2025

Code for the paper: Why Transformers Need Adam: A Hessian Perspective

Jupyter Notebook 50 7 Updated Apr 26, 2024

A simple and efficient tool to parallelize Pandas operations on all available CPUs

Python 3,727 217 Updated Jul 9, 2024

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,691 277 Updated Feb 15, 2023

A Collection of BM25 Algorithms in Python

Python 1,116 93 Updated Oct 8, 2024

Retrieval and Retrieval-augmented LLMs

Python 8,783 641 Updated Mar 3, 2025

A word alignment tool based on famous GIZA++, extended to support multi-threading, resume training and incremental training.

C++ 161 60 Updated May 12, 2021

Giza++

C++ 12 13 Updated May 12, 2015

🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support

Python 8,432 1,045 Updated Mar 5, 2025

Language Models as Semantic Indexers (ICML 2024)

Python 27 1 Updated May 2, 2024

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Python 6,280 762 Updated Feb 27, 2025

K-Means clustering - constrained with minimum and maximum cluster size. Documentation: https://joshlk.github.io/k-means-constrained

Jupyter Notebook 206 44 Updated Feb 6, 2025

我的超迷你机械臂机器人项目。

C 12,950 2,835 Updated Mar 14, 2024

Learning to Tokenize for Generative Retrieval (NeurIPS 2023)

Python 57 4 Updated Nov 3, 2024

Details on how to get Binance public data

Python 1,743 513 Updated Jan 9, 2025

SIGIR'21: Optimizing DR with hard negatives and achieving SOTA first-stage retrieval performance on TREC DL Track.

Python 131 14 Updated Feb 15, 2022

CIKM'21: JPQ substantially improves the efficiency of Dense Retrieval with 30x compression ratio, 10x CPU speedup and 2x GPU speedup.

Python 52 11 Updated Feb 19, 2022

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,601 2,926 Updated Sep 2, 2024

Inference code for Llama models

Python 57,806 9,713 Updated Jan 26, 2025

[ACL 2024] Progressive LLaMA with Block Expansion.

Python 499 38 Updated May 20, 2024
Next