We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
纯手写高性能实现llama,本地推理
使用SmoothQuant高性能实现llama2 openMP、AVX2
使用方法
编译方法
cmake build cd build make
运行方法
./main