🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Official implementation of Stochastic Taylor Derivative Estimator (STDE) NeurIPS2024
Paper: Alignment at Pre-training! Towards Native Alignment for Arabic LLMs
Tiny evaluation of leading LLMs on competitive programming problems
BigCodeBench: Benchmarking Code Generation Towards AGI
AnchorAttention: Improved attention for LLMs long-context training
Aioli: A unified optimization framework for language model data mixing
Adversaial attack comparative assessment Large Language Model
Every practical and proposed defense against prompt injection.
Muon optimizer for neural networks: >30% extra sample efficiency, <3% wallclock overhead
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
The Open Cookbook for Top-Tier Code Large Language Model
Anthropic's educational courses
算法面试必备,推荐刷题网站。北大学霸的《LeetCode刷题模板》+V领取: jiuzhangfeifei
Effective Data Augmentation With Diffusion Models
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
Python Sorted Container Types: Sorted List, Sorted Dict, and Sorted Set
刷算法全靠套路,认准 labuladong 就够了!English version supported! Crack LeetCode, not only how, but also why.