-
Characterization of Large Language Model Development in the Datacenter:https://arxiv.org/pdf/2403.07648
-
MegaScale: Scaling Large Language Model Training to More Than 10,000 GPUs:https://www.usenix.org/system/files/nsdi24-jiang-ziheng.pdf
llm-train
Folders and files
Name | Name | Last commit date | ||
---|---|---|---|---|
parent directory.. | ||||