Skip to content

Running large language models on a single GPU for throughput-oriented scenarios.

Notifications You must be signed in to change notification settings

electron-shaders/FlexGen

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

About

Running large language models on a single GPU for throughput-oriented scenarios.

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 97.1%
  • Shell 2.9%