Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      1700Updated Nov 28, 2024Nov 28, 2024
    • Training SAEs for your LLM, and visualize it in one place
      Python
      Apache License 2.0
      0600Updated Nov 4, 2024Nov 4, 2024
    • MLLM_KE

      Public
      MLLM_KE
      0000Updated Sep 9, 2024Sep 9, 2024
    • SEATv2

      Public
      Python
      1100Updated Sep 1, 2024Sep 1, 2024
    • 57310Updated Aug 22, 2024Aug 22, 2024
    • CoreScheduler: A High-Performance Scheduler for Large Model Training
      C++
      Apache License 2.0
      6200Updated Aug 21, 2024Aug 21, 2024
    • Find the most efficient way for a specific large language model to learn a specific task
      Apache License 2.0
      1300Updated Aug 19, 2024Aug 19, 2024
    • FViT

      Public
      Jupyter Notebook
      1300Updated Aug 18, 2024Aug 18, 2024
    • SEAT

      Public
      Python
      1300Updated Aug 18, 2024Aug 18, 2024
    • FVLC

      Public
      Jupyter Notebook
      2400Updated Aug 18, 2024Aug 18, 2024
    • Tiny-DeepSpeed, a minimalistic re-implementation of the DeepSpeed library
      Python
      Apache License 2.0
      1200Updated Aug 8, 2024Aug 8, 2024
    • Jupyter Notebook
      1500Updated May 10, 2024May 10, 2024
    • adv-ntk

      Public
      [ICLR 2024] Official repository for "Theoretical Analysis of Robust Overfitting for Wide DNNs: An NTK Approach"
      Python
      MIT License
      1200Updated Feb 4, 2024Feb 4, 2024