Skip to content
View CZWin32768's full-sized avatar
🙂
?
🙂
?

Highlights

  • Pro

Block or report CZWin32768

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 8,869 635 Updated Jan 5, 2025

Our solution for the arc challenge 2024

Jupyter Notebook 82 5 Updated Dec 7, 2024

PyTorch implementation of normalizing flow models

Python 760 112 Updated Aug 25, 2024

Seamless operability between C++11 and Python

C++ 16,018 2,124 Updated Jan 7, 2025

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,001 5,543 Updated Jan 7, 2025
Jupyter Notebook 169 22 Updated Jan 16, 2024

Annotated Flow Matching paper

Jupyter Notebook 151 6 Updated Sep 14, 2024

The Abstraction and Reasoning Corpus

JavaScript 4,104 626 Updated Aug 4, 2024

PyTorch implementation for Score-Based Generative Modeling through Stochastic Differential Equations (ICLR 2021, Oral)

Jupyter Notebook 1,802 326 Updated Jul 14, 2024

MetaMath: Bootstrap Your Own Mathematical Questions for Large Language Models

Python 403 38 Updated Feb 1, 2024

High-Resolution Image Synthesis with Latent Diffusion Models

Jupyter Notebook 12,169 1,559 Updated Feb 29, 2024

Dev and Test Data of LogicGame benchmark

12 Updated Oct 10, 2024

Code for the paper "VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment"

Python 108 10 Updated Nov 11, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,664 597 Updated May 31, 2024

Collecting research materials on EBM/EBL (Energy Based Models, Energy Based Learning)

281 28 Updated Nov 24, 2023

DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models

Python 1,071 56 Updated Jan 16, 2024

A curated list of Large Language Model (LLM) Interpretability resources.

1,198 95 Updated Dec 21, 2024

AI算法岗求职攻略(涵盖准备攻略、刷题指南、内推和AI公司清单等资料)

5,332 642 Updated Apr 24, 2024

The official Meta Llama 3 GitHub site

Python 27,838 3,188 Updated Aug 12, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 967 80 Updated Jul 23, 2024

OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.

Python 4,438 468 Updated Jan 7, 2025

Code and data for the paper "Turning English-centric LLMs Into Polyglots: How Much Multilinguality Is Needed?"

Jupyter Notebook 23 2 Updated Dec 20, 2024

MultilingualSIFT: Multilingual Supervised Instruction Fine-tuning

Python 88 5 Updated Aug 15, 2023

Multilingual Large Language Models Evaluation Benchmark

Python 115 17 Updated Aug 21, 2024
Python 1,238 176 Updated Nov 20, 2024

ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward experts. We released a collection of ModuleFormer-based Languag…

Python 216 11 Updated Apr 10, 2024

Must-read Papers on LLM Agents.

2,014 110 Updated Nov 12, 2024

🌟 The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

Python 46,484 5,530 Updated Dec 18, 2024
Next