Skip to content

Commit

Permalink
bump mem allocation a bit
Browse files Browse the repository at this point in the history
  • Loading branch information
apage43 authored and manyoso committed Jul 14, 2023
1 parent 936dcd2 commit 1c4a244
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion gpt4all-backend/bert.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -871,7 +871,7 @@ struct bert_ctx * bert_load_from_file(const char *fname)

// TODO: Max tokens should be a param?
int32_t N = new_bert->model.hparams.n_max_tokens;
new_bert->mem_per_input = 1.9 * (new_bert->mem_per_token * N); // add 10% to account for ggml object overhead
new_bert->mem_per_input = 2.2 * (new_bert->mem_per_token * N); // add 10% to account for ggml object overhead

}
#if defined(DEBUG_BERT)
Expand Down

0 comments on commit 1c4a244

Please sign in to comment.