Skip to content
View LHRYANG's full-sized avatar
🤒
Out sick
🤒
Out sick

Block or report LHRYANG

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 14,947 1,207 Updated Dec 12, 2024

Large Concept Models: Language modeling in a sentence representation space

Python 765 56 Updated Dec 16, 2024

A generative world for general-purpose robotics & embodied AI learning.

Python 19,152 1,397 Updated Dec 25, 2024

Weather30K & WeatherNet

5 Updated Dec 8, 2024

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

Python 752 37 Updated Dec 17, 2024

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 507 39 Updated Dec 16, 2024

Beamer template with CUHK colors and logos

TeX 28 15 Updated Aug 22, 2021

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and WebKit with a single API.

TypeScript 68,008 3,747 Updated Dec 24, 2024

Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)

Python 11,605 1,018 Updated Dec 23, 2024

ToolkenGPT: Augmenting Frozen Language Models with Massive Tools via Tool Embeddings - NeurIPS 2023 (oral)

Python 240 23 Updated Apr 18, 2024

Stick-breaking attention

Python 38 1 Updated Dec 23, 2024

Hammer: Robust Function-Calling for On-Device Language Models via Function Masking

Python 42 3 Updated Dec 13, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,352 163 Updated Jun 25, 2024

Educational framework exploring ergonomic, lightweight multi-agent orchestration. Managed by OpenAI Solution team.

Python 17,141 1,741 Updated Oct 15, 2024

Constrained Decoding for LLMs against JSON Schema

Python 322 8 Updated May 16, 2023

Accelerate, Optimize performance with streamlined training and serving options with JAX.

Python 212 24 Updated Dec 24, 2024

[ACL 2024] An Easy-to-use Knowledge Editing Framework for LLMs.

Jupyter Notebook 1,996 241 Updated Dec 23, 2024
Python 262 14 Updated Jul 28, 2024

对ChatGLM直接使用RLHF提升或降低目标输出概率|Modify ChatGLM output with only RLHF

Python 190 26 Updated May 23, 2023

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 4,909 445 Updated Dec 25, 2024

OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

Jupyter Notebook 3,229 256 Updated Dec 14, 2024

主要记录大语言大模型(LLMs) 算法(应用)工程师相关的知识及面试题

HTML 4,347 505 Updated Oct 22, 2024

轩辕:度小满中文金融对话大模型

Python 1,091 101 Updated Sep 26, 2024

Ongoing research training transformer language models at scale, including: BERT & GPT-2

Python 1,346 221 Updated Mar 20, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 11,157 681 Updated Dec 24, 2024

DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financ…

Python 631 73 Updated Nov 1, 2023

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 36,021 4,506 Updated Nov 18, 2024

A curated list of Human Preference Datasets for LLM fine-tuning, RLHF, and eval.

325 14 Updated Oct 4, 2023

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Jupyter Notebook 6,418 427 Updated Dec 22, 2024
Next