Skip to content

Tags: mdf-git/llama.cpp

Tags

b4567

Toggle b4567's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature.
AMD: parse the architecture as supplied by gcnArchName (ggerganov#11244)

The value provided by minor doesn't include stepping for AMD, parse the value returned by gcnArchName instead to retrieve an accurate ID.

master-3cd8dde

Toggle master-3cd8dde's commit message
Revert "Fix memory allocation issues and seg faults"

This reverts commit 4870e45.

Will provide the correct fix later

master-f7dc43b

Toggle master-f7dc43b's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix instruct mode broken by PR ggerganov#354 (ggerganov#409)

Co-authored-by: Johnman <tjohnman@github>

master-ea10d3d

Toggle master-ea10d3d's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Command line args bounds checking (ggerganov#424)

* command line args bounds checking

* unknown and invalid param exit codes 0 -> 1

master-e4412b4

Toggle master-e4412b4's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
CI: CMake: Separate build and test steps (ggerganov#376)

* CI: Separate Build and Test steps (CMake)

* CI: Make sure build passes before running tests (CMake)

* CI: Standardise step id names

master-ad072fc

Toggle master-ad072fc's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix GPTQ converter (ggerganov#423)

* Fix GPTQ converter

* Fix comment

---------

Co-authored-by: Georgi Gerganov <[email protected]>

master-a140219

Toggle master-a140219's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Fix Makefile echo escape codes (by removing them). (ggerganov#418)

master-4870e45

Toggle master-4870e45's commit message

Verified

This commit was signed with the committer’s verified signature.
ggerganov Georgi Gerganov
Fix memory allocation issues and seg faults

master-483bab2

Toggle master-483bab2's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Avoid the transposed X branch in the Z = X * Y matrix multiplication (g…

…gerganov#439)

Should make results reproducible for different number of threads and batch sizes

master-2e17dfd

Toggle master-2e17dfd's commit message

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Replace EOS with newline to prevent context/memory being flushed by E…

…OS in interactive mode (ggerganov#333)

* Improve interactive mode's coherence after EOS

Aims to improve coherence and ability to resume the interactive session when the user is given input back after an end of text token is reached.
Not sure what token 13 is or why it seems to help. See conversation for examples.

* Make newline token a constant

* dynamically determine newline token

* relocate previous newline token const

* cleanup whitespace

* print a new line on end of text in interactive

this may need to be looked into further when not using a reverse prompt

* only print manual newline with reverse prompt

fix formatting of reverse prompts so they don't end up at the end of the current line while not introducing unnecessary new lines otherwise

* alternate approach to replace end of text tokens

* Inject the reverse prompt again after eos in interactive mode

* tokenize reverse prompt when needed

makes this PR compatible with ggerganov#330

* tokenize and inject only first reverse prompt

thanks to tjohnman

* tokenize first reverse prompt once

* add newline token

* add newline token

* tokenize/inject reverse prompt for refactor

this doesn't seem right though

* tokenize nothing for antiprompt if no reverse

* Update main.cpp

* Update main.cpp

* tokenize and inject reverse prompt as needed

this doesn't seem to work if the reverse prompt is tokenized outside earlier on

* not needed

* remove newline token

* remove newline token

* tokenize newline token

* add space to comment

* Update main.cpp

Co-authored-by: Georgi Gerganov <[email protected]>

---------

Co-authored-by: Slaren <[email protected]>
Co-authored-by: Georgi Gerganov <[email protected]>