-
MSR
- Redmond, WA
Highlights
- Pro
-
-
Perplexica Public
Forked from ItzCrazyKns/PerplexicaPerplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
TypeScript MIT License UpdatedFeb 2, 2025 -
open-r1 Public
Forked from huggingface/open-r1Fully open reproduction of DeepSeek-R1
Python Apache License 2.0 UpdatedJan 30, 2025 -
micro_diffusion Public
Forked from SonyResearch/micro_diffusionOfficial repository for our work on micro-budget training of large-scale diffusion models.
Python Apache License 2.0 UpdatedJan 12, 2025 -
OpenRLHF Public
Forked from OpenRLHF/OpenRLHFAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Python Apache License 2.0 UpdatedJan 8, 2025 -
picotron Public
Forked from huggingface/picotronMinimalistic 4D-parallelism distributed training framework for education purpose
Python Apache License 2.0 UpdatedDec 20, 2024 -
streaming Public
Forked from LiyuanLucasLiu/streamingA Data Streaming Library for Efficient Neural Network Training
Python Apache License 2.0 UpdatedNov 28, 2024 -
torchtitan Public
Forked from pytorch/torchtitanA native PyTorch Library for large model training
Python BSD 3-Clause "New" or "Revised" License UpdatedSep 30, 2024 -
torchtune Public
Forked from pytorch/torchtuneA Native-PyTorch Library for LLM Fine-tuning
-
mt-dnn Public
Multi-Task Deep Neural Networks for Natural Language Understanding
-
mistral-src Public
Forked from mistralai/mistral-inferenceReference implementation of Mistral AI 7B v0.1 model.
Jupyter Notebook Apache License 2.0 UpdatedFeb 2, 2024 -
-
Python package built to ease deep learning on graph, on top of existing DL frameworks.
Python Apache License 2.0 UpdatedOct 10, 2023 -
llama.cpp Public
Forked from ggml-org/llama.cppPort of Facebook's LLaMA model in C/C++
C MIT License UpdatedAug 2, 2023 -
boardlaw Public
Forked from andyljones/boardlawScaling scaling laws with board games.
Python MIT License UpdatedJul 17, 2023 -
mlc-llm Public
Forked from mlc-ai/mlc-llmEnable everyone to develop, optimize and deploy AI models natively on everyone's devices.
Python Apache License 2.0 UpdatedMay 9, 2023 -
-
apex Public
Forked from NVIDIA/apexA PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch
Python BSD 3-Clause "New" or "Revised" License UpdatedApr 26, 2023 -
llama Public
Forked from meta-llama/llamaInference code for LLaMA models
Python GNU General Public License v3.0 UpdatedFeb 26, 2023 -
DiT Public
Forked from facebookresearch/DiTOfficial PyTorch Implementation of "Scalable Diffusion Models with Transformers"
Python Other UpdatedDec 22, 2022 -
-
promptsource Public
Forked from bigscience-workshop/promptsourceToolkit for creating, sharing and using natural language prompts.
Python Apache License 2.0 UpdatedAug 3, 2022 -
BIG-bench Public
Forked from google/BIG-benchBeyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models
Python Apache License 2.0 UpdatedJun 10, 2022 -
code_contests Public
Forked from google-deepmind/code_contestsStarlark Apache License 2.0 UpdatedFeb 3, 2022 -
ConvNeXt Public
Forked from facebookresearch/ConvNeXtCode release for ConvNeXt model
Python MIT License UpdatedJan 13, 2022 -
mae Public
Forked from facebookresearch/maePyTorch implementation of MAE https//arxiv.org/abs/2111.06377
Python Other UpdatedJan 6, 2022 -
DeCLUTR Public
Forked from JohnGiorgi/DeCLUTRThe corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!
Python Apache License 2.0 UpdatedOct 18, 2021 -
-
DPR Public
Forked from facebookresearch/DPRDense Passage Retriever - is a set of tools and models for open domain Q&A task.
Python Other UpdatedAug 31, 2021 -
KILT Public
Forked from facebookresearch/KILTLibrary for Knowledge Intensive Language Tasks
Python MIT License UpdatedJun 17, 2021