Skip to content
View yingfeng's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@izenecloud @deepinsight @deepfabric @infiniflow

Block or report yingfeng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 8 Updated Jan 22, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,365 290 Updated Feb 18, 2025

[NeurIPS'24 Spotlight, ICLR'25] To speed up Long-context LLMs' inference, approximate and dynamic sparse calculate the attention, which reduces inference latency by up to 10x for pre-filling on an …

Python 913 46 Updated Feb 12, 2025

Skyrise is a research project exploring data processing on elastic cloud resources.

C++ 8 Updated Jan 6, 2025

EuroSys '24: "Trinity: A Fast Compressed Multi-attribute Data Store"

C++ 17 5 Updated May 1, 2024

[ICLR2025 Spotlight] MagicPIG: LSH Sampling for Efficient LLM Generation

Python 184 14 Updated Dec 16, 2024

10x Faster Long-Context LLM By Smart KV Cache Optimizations

Python 468 50 Updated Feb 18, 2025
Python 70 6 Updated Nov 25, 2024

OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models

Python 1,637 128 Updated Jan 17, 2025
C++ 5 1 Updated Jan 19, 2025

TAG-Bench: A benchmark for table-augmented generation (TAG)

Python 672 76 Updated Feb 18, 2025

A throughput-oriented high-performance serving framework for LLMs

Cuda 737 29 Updated Sep 21, 2024

Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.

Python 1,533 99 Updated Feb 14, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 9,974 612 Updated Feb 18, 2025

Generative AI extensions for onnxruntime

C++ 614 152 Updated Feb 18, 2025

[EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction

Python 3,740 704 Updated Feb 13, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,482 127 Updated Feb 18, 2025

GNN-RAG: Graph Neural Retrieval for Large Language Modeling Reasoning

Python 284 53 Updated Jun 12, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,674 503 Updated Feb 14, 2025
Python 30 3 Updated Jul 23, 2024

StaRD: Statute Retrieval Dataset based on Real-World Legal Consultation

Python 9 1 Updated Jul 1, 2024

Minimal keyword extraction with BERT

Python 3,702 362 Updated Feb 18, 2025

Faster Learned Sparse Retrieval with Block-Max Pruning. ACM SIGIR 2024.

Rust 20 1 Updated Feb 8, 2025

Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"

Python 198 22 Updated Nov 1, 2024

Using GPT to organize and access information, and generate questions. Long term goal is to make an agent-like research assistant.

Jupyter Notebook 669 53 Updated Dec 20, 2023

[Paper List] Papers integrating knowledge graphs (KGs) and large language models (LLMs)

1,716 129 Updated Jan 3, 2025

Official Implementation of NeurIPS 2024 paper "G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering""

Python 392 66 Updated Nov 15, 2024

State-of-the-Art Text Embeddings

Python 15,995 2,545 Updated Feb 14, 2025

Daichi Amagata, ICDE2024

C++ 1 Updated Jan 22, 2025
Next