Skip to content

lee20/llama_int8

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

使用SmoothQuant高性能实现llama2 openMP、AVX2

使用方法

编译方法

cmake build cd build make

运行方法

./main

About

纯手写高性能实现llama,本地推理

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published