Skip to content
View hzhwcmhf's full-sized avatar

Organizations

@thu-coai @QwenLM

Block or report hzhwcmhf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

Python 65 3 Updated Nov 25, 2024

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 312 29 Updated Apr 20, 2024

ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.

Python 34 Updated Jun 24, 2024

[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models".

Python 20 Updated May 29, 2024

[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

Python 364 19 Updated Oct 16, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 10,888 672 Updated Dec 4, 2024
Python 56 3 Updated Apr 2, 2024

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 363 14 Updated Jul 9, 2024

The official repo of Qwen-VL (通义千问-VL) chat & pretrained large vision language model proposed by Alibaba Cloud.

Python 5,187 392 Updated Aug 7, 2024

LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.

Python 2,687 215 Updated Dec 15, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,730 1,193 Updated Dec 12, 2024

隐藏miui剪贴板对话框

Kotlin 13 2 Updated Jul 24, 2022

中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

Python 4,030 541 Updated May 23, 2024

搜索所有中文NLP数据集,附常用英文NLP数据集

Python 4,192 614 Updated Nov 21, 2022

记录本人整理的一些数据集

1,013 132 Updated Jun 16, 2022

Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复

Python 18,574 1,938 Updated Apr 4, 2024

GPT4 & LangChain Chatbot for large PDF docs

TypeScript 14,975 3,019 Updated Jul 29, 2024

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 5,438 333 Updated Jul 21, 2024

Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.

Jupyter Notebook 1,541 94 Updated Feb 16, 2024

a large-scale Chinese parabank via machine translation

1 Updated Oct 30, 2022

Improving Non-autoregressive Generation with Mixup Training

Python 8 1 Updated Sep 5, 2022

Official Implementation for the ICML2022 paper "Directed Acyclic Transformer for Non-Autoregressive Machine Translation"

Python 121 18 Updated Sep 10, 2023

A concise but complete full-attention transformer with a set of promising experimental features from various papers

Python 4,897 424 Updated Dec 10, 2024

SimCSE在中文任务上的简单实验

Python 593 83 Updated Aug 7, 2023

Tracking the progress in non-autoregressive generation (translation, transcription, etc.)

306 28 Updated Mar 15, 2023

The entmax mapping and its loss, a family of sparse softmax alternatives.

Python 418 44 Updated Jun 22, 2024

CUDA kernels for generalized matrix-multiplication in PyTorch

Jupyter Notebook 79 13 Updated Oct 11, 2021

Development repository for the Triton language and compiler

C++ 13,691 1,683 Updated Dec 15, 2024

A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).

Python 2,680 160 Updated Aug 18, 2024
Next