Skip to content
View sosofun's full-sized avatar

Block or report sosofun

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
  • SWE-agent Public

    Forked from SWE-agent/SWE-agent

    SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It solves 12.47% of bugs in the SWE-bench evaluation set and takes just 1 minute to run.

    Python MIT License Updated Jul 1, 2024
  • code-act Public

    Forked from xingyaoww/code-act

    Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhang, Yunzhu Li, Hao Peng, Heng Ji.

    Python MIT License Updated May 23, 2024
  • Online-RLHF Public

    Forked from RLHFlow/Online-RLHF
    Python Updated May 18, 2024
  • gemma-sft Public

    Forked from yongzhuo/gemma-sft

    Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)

    Python Apache License 2.0 Updated Feb 27, 2024
  • Finetune Gemma for Chinese

    Jupyter Notebook Apache License 2.0 Updated Feb 26, 2024
  • 最好用的北京联通、北京移动IPTV频道列表。https://bjiptv.gq/

    HTML Creative Commons Zero v1.0 Universal Updated Jan 20, 2024
  • MAmmoTH Public

    Forked from TIGER-AI-Lab/MAmmoTH

    This repo contains the code and data for "MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning"

    Jupyter Notebook Updated Jan 17, 2024
  • Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

    Other Updated Dec 31, 2023
  • Go Updated Dec 10, 2023
  • qmoe Public

    Forked from IST-DASLab/qmoe

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python Apache License 2.0 Updated Oct 28, 2023
  • Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python Other Updated Oct 12, 2023
  • ⭐️ NLP Algorithms with transformers lib. Supporting Text-Classification, Text-Generation, Information-Extraction, Text-Matching, RLHF, SFT etc.

    Jupyter Notebook Updated Sep 29, 2023
  • Langchain-Chatchat(原Langchain-ChatGLM)基于 Langchain 与 ChatGLM 等语言模型的本地知识库问答 | Langchain-Chatchat (formerly langchain-ChatGLM), local knowledge based LLM (like ChatGLM) QA app with langchain

    Python Apache License 2.0 Updated Sep 24, 2023
  • yarn Public

    Forked from jquesnelle/yarn

    YaRN: Efficient Context Window Extension of Large Language Models

    Python MIT License Updated Sep 4, 2023
  • DeepSpeed Public

    Forked from microsoft/DeepSpeed

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    Python Apache License 2.0 Updated Aug 11, 2023
  • loguru Public

    Forked from Delgan/loguru

    Python logging made (stupidly) simple

    Python MIT License Updated Aug 7, 2023
  • rerope Public

    Forked from bojone/rerope

    Rectified Rotary Position Embeddings

    Python Updated Aug 7, 2023
  • 🧑‍🏫 59 Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cycleg…

    Jupyter Notebook MIT License Updated Jun 30, 2023
  • ColossalAI Public

    Forked from hpcaitech/ColossalAI

    Making big AI models cheaper, easier, and more scalable

    Python 1 Apache License 2.0 Updated May 10, 2023
  • 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch

    Python MIT License Updated Mar 13, 2023
  • rust-course Public

    Forked from sunface/rust-course

    “连续六年成为全世界最受喜爱的语言,无 GC 也无需手动内存管理、极高的性能和安全性、过程/OO/函数式编程、优秀的包管理、JS 未来基石" — 工作之余的第二语言来试试 Rust 吧。<<Rust语言圣经>>拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容,甚至还有JS程序员关注的 WASM 和 Deno 等专题。这可能是目前最用心的 Rust 中文学习教程 / Book

    Rust Updated Oct 18, 2022
  • PERSIA Public

    Forked from PersiaML/PERSIA

    High performance distributed framework for training deep learning recommendation models based on PyTorch.

    Rust MIT License Updated Dec 14, 2021
  • 基于vite+vue3+gin搭建的开发基础平台,集成jwt鉴权,权限管理,动态路由,分页封装,多点登录拦截,资源权限,上传下载,代码生成器,表单生成器等开发必备功能,五分钟一套CURD前后端代码。

    Go Apache License 2.0 Updated Dec 10, 2021
  • A curated list of the latest breakthroughs in AI by release date with a clear video explanation, link to a more in-depth article, and code. [work in progress]

    MIT License Updated Dec 5, 2021
  • a web project template based on fastapi and sqlalchemy2

    Python Updated Nov 28, 2021
  • croc Public

    Forked from schollz/croc

    Easily and securely send things from one computer to another 🐊 📦

    Go MIT License Updated Nov 18, 2021
  • 🚀✨ Help beginners to contribute to open source projects

    MIT License Updated Nov 4, 2021
  • rustlings Public

    Forked from rust-lang/rustlings

    🦀 Small exercises to get you used to reading and writing Rust code!

    Rust MIT License Updated Nov 3, 2021
  • X6 Public

    Forked from antvis/X6

    🚀 JavaScript diagramming library that uses SVG and HTML for rendering.

    TypeScript MIT License Updated Nov 1, 2021
  • joplin Public

    Forked from laurent22/joplin

    Joplin - an open source note taking and to-do application with synchronization capabilities for Windows, macOS, Linux, Android and iOS. Forum: https://discourse.joplinapp.org/

    TypeScript Other Updated Nov 1, 2021