Stars
5
results
for source starred repositories
written in C
Clear filter
fastLLaMa: An experimental high-performance framework for running Decoder-only LLMs with 4-bit quantization in Python using a C/C++ backend.
Filesystem (fuse) implemented on Mosso's Cloud Files