TheBloke
/

airoboros-13b-gpt4-GGML

Model card Files Files and versions Community

TheBloke commited on Jun 4, 2023

Commit

f936b04

·

1 Parent(s): 2d71c63

Update README.md

Files changed (1) hide show

README.md +12 -0

README.md CHANGED Viewed

@@ -45,6 +45,18 @@ USER: prompt
 ASSISTANT:
 ```
 ## THE FILES IN MAIN BRANCH REQUIRES LATEST LLAMA.CPP (May 19th 2023 - commit 2d5db48)!
 llama.cpp recently made another breaking change to its quantisation methods - https://github.com/ggerganov/llama.cpp/pull/1508

 ASSISTANT:
 ```
+## Context length with GGML
+The base Airoboros GPT4 models have an increased context length of 4096.
+However this GGML conversion appears to still have the default 2048 context.
+I have experimented with llama.cpp's `-n 4096` parameter to specify a context of 4096 but it so far always results in gibberish output.
+I will investigate this further and upload a correct model if this proves necessary.
+For now, please assume this GGML to have a context of 2048.
 ## THE FILES IN MAIN BRANCH REQUIRES LATEST LLAMA.CPP (May 19th 2023 - commit 2d5db48)!
 llama.cpp recently made another breaking change to its quantisation methods - https://github.com/ggerganov/llama.cpp/pull/1508