TareksGraveyard
/

Thalassic-Omega-LLaMa-70B

Text Generation

text-generation-inference

Model card Files Files and versions Community

Tarek07 commited on Feb 2

Commit

b742cf3

·

verified ·

1 Parent(s): f041e4e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ tags:
 - merge
 license: llama3.3
 ---
-My Thalassic series includes my favorite Llama 3 models with Deepseek R1, using various methods I made 4 merges, of which Thalassic Delta, which I made with an SCE merge, was the best of the set. I actually got some advice from Steelskull about how I had my top k parameter way too high. So I decided to lower it for this merge. Now because I feel Delta was still a decent model with the top k on 1, I halved the top k parameter to 0.50 which is basically the max recommended setting.
 # merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).

 - merge
 license: llama3.3
 ---
+My Thalassic series includes my favorite Llama 3 models with Deepseek R1, using various methods I made 4 merges. Thalassic Delta, which I made with an SCE merge, was the best of the set. I actually got some advice from Steelskull about how I had my top k parameter way too high. So I decided to lower it for this merge. Now because I feel Delta was still a decent model with the top k on 1, I halved the top k parameter to 0.50 which is basically the max recommended setting.
 # merge
 This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).