Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
justusmattern27 authored Dec 6, 2023
1 parent d4b5c9c commit 5179c4f
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion README.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,5 @@
# Mamba-Chat 🐍

Mamba-Chat is the first chat-based language model that is based on a state-space model architecture, and not a transformer.
Mamba-Chat is the first chat-based language model implemented with a state-space model architecture, and not a transformer.

The model is based on Albert Gu's and Tri Dao's work *Mamba: Linear-Time Sequence Modeling with Selective State Spaces* as well as their [model implementation](https://github.com/state-spaces/mamba). This repository provides training / fine-tuning code for the model based on some modifications of the Huggingface Trainer.

0 comments on commit 5179c4f

Please sign in to comment.