Skip to content
View namisan's full-sized avatar
  • MSR
  • Redmond, WA

Highlights

  • Pro

Block or report namisan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • Python MIT License Updated Feb 14, 2025
  • Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI

    TypeScript MIT License Updated Feb 2, 2025
  • open-r1 Public

    Forked from huggingface/open-r1

    Fully open reproduction of DeepSeek-R1

    Python Apache License 2.0 Updated Jan 30, 2025
  • Official repository for our work on micro-budget training of large-scale diffusion models.

    Python Apache License 2.0 Updated Jan 12, 2025
  • OpenRLHF Public

    Forked from OpenRLHF/OpenRLHF

    An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

    Python Apache License 2.0 Updated Jan 8, 2025
  • picotron Public

    Forked from huggingface/picotron

    Minimalistic 4D-parallelism distributed training framework for education purpose

    Python Apache License 2.0 Updated Dec 20, 2024
  • A Data Streaming Library for Efficient Neural Network Training

    Python Apache License 2.0 Updated Nov 28, 2024
  • torchtitan Public

    Forked from pytorch/torchtitan

    A native PyTorch Library for large model training

    Python BSD 3-Clause "New" or "Revised" License Updated Sep 30, 2024
  • torchtune Public

    Forked from pytorch/torchtune

    A Native-PyTorch Library for LLM Fine-tuning

    Python 1 BSD 3-Clause "New" or "Revised" License Updated Apr 21, 2024
  • mt-dnn Public

    Multi-Task Deep Neural Networks for Natural Language Understanding

    Python 2,246 412 MIT License Updated Mar 7, 2024
  • Reference implementation of Mistral AI 7B v0.1 model.

    Jupyter Notebook Apache License 2.0 Updated Feb 2, 2024
  • Apache License 2.0 Updated Jan 30, 2024
  • dgl Public

    Forked from dmlc/dgl

    Python package built to ease deep learning on graph, on top of existing DL frameworks.

    Python Apache License 2.0 Updated Oct 10, 2023
  • llama.cpp Public

    Forked from ggml-org/llama.cpp

    Port of Facebook's LLaMA model in C/C++

    C MIT License Updated Aug 2, 2023
  • boardlaw Public

    Forked from andyljones/boardlaw

    Scaling scaling laws with board games.

    Python MIT License Updated Jul 17, 2023
  • mlc-llm Public

    Forked from mlc-ai/mlc-llm

    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

    Python Apache License 2.0 Updated May 9, 2023
  • Apache License 2.0 Updated May 6, 2023
  • apex Public

    Forked from NVIDIA/apex

    A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

    Python BSD 3-Clause "New" or "Revised" License Updated Apr 26, 2023
  • llama Public

    Forked from meta-llama/llama

    Inference code for LLaMA models

    Python GNU General Public License v3.0 Updated Feb 26, 2023
  • DiT Public

    Forked from facebookresearch/DiT

    Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

    Python Other Updated Dec 22, 2022
  • whisper Public

    Forked from openai/whisper
    Jupyter Notebook MIT License Updated Sep 22, 2022
  • Toolkit for creating, sharing and using natural language prompts.

    Python Apache License 2.0 Updated Aug 3, 2022
  • BIG-bench Public

    Forked from google/BIG-bench

    Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

    Python Apache License 2.0 Updated Jun 10, 2022
  • Starlark Apache License 2.0 Updated Feb 3, 2022
  • Code release for ConvNeXt model

    Python MIT License Updated Jan 13, 2022
  • mae Public

    Forked from facebookresearch/mae

    PyTorch implementation of MAE https//arxiv.org/abs/2111.06377

    Python Other Updated Jan 6, 2022
  • DeCLUTR Public

    Forked from JohnGiorgi/DeCLUTR

    The corresponding code from our paper "DeCLUTR: Deep Contrastive Learning for Unsupervised Textual Representations". Do not hesitate to open an issue if you run into any trouble!

    Python Apache License 2.0 Updated Oct 18, 2021
  • exdeep-nmt Public

    32 Updated Sep 27, 2021
  • DPR Public

    Forked from facebookresearch/DPR

    Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

    Python Other Updated Aug 31, 2021
  • KILT Public

    Forked from facebookresearch/KILT

    Library for Knowledge Intensive Language Tasks

    Python MIT License Updated Jun 17, 2021