-
Purdue University
- https://yancong222.github.io/
- https://orcid.org/0000-0003-4571-9978
Lists (6)
Sort Name ascending (A-Z)
Starred repositories
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
scikit-learn: machine learning in Python
A high-throughput and memory-efficient inference and serving engine for LLMs
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
💫 Industrial-strength Natural Language Processing (NLP) in Python
The official Python library for the OpenAI API
Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the most common NLP tasks.
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Machine Learning Engineering Open Book
100+ Chinese Word Vectors 上百种预训练中文词向量
Simple, unified interface to multiple Generative AI providers
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
The official GitHub page for the survey paper "A Survey of Large Language Models".
An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"
XLNet: Generalized Autoregressive Pretraining for Language Understanding
📊 A simple command-line utility for querying and monitoring GPU status
Entropy Based Sampling and Parallel CoT Decoding
Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Beautiful visualizations of how language differs among document types.
Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09110). This framework is also used to evaluate text-to-image …
For running psychology and neuroscience experiments