ilsp
/

Llama-Krikri-8B-Base

Text Generation

text-generation-inference

Model card Files Files and versions Community

soksof commited on Feb 7

Commit

5663382

·

verified ·

1 Parent(s): f4f6fd1

Update README.md

Files changed (1) hide show

README.md +18 -3

README.md CHANGED Viewed

@@ -5,7 +5,7 @@ license: llama3.1
 # Llama-Krikri-8B: A large foundation Language Model for the Greek language
 Following the release of [Meltemi-7B](https://huggingface.co/ilsp/Meltemi-7B-v1) on the 26th March 2024 we are happy to welcome Krikri to the family of ILSP open Greek LLMs.
-Krikri is built on top of [Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B), extending its capabilities for Greek through continual pretraining on a large corpus of high-quality and locally relevant Greek texts. We present Llama-Krikri-8B-Base, as well as an instruct version [Llama-Krikri-8b-Instruct](https://huggingface.co/ilsp/Llama-Krikri-8B-instruct).
 ![image/png](llama-krikri-image.jpg)
@@ -31,9 +31,24 @@ Krikri is built on top of [Llama-3.1-8B](https://huggingface.co/meta-llama/Llama
 Chosen subsets of the 89.65 billion corpus were upsampled resulting in a size of 110 billion tokens.
-# Usage
-Please make sure that the BOS token is always included in the tokenized prompts. This might not be the default setting in all evaluation or fine-tuning frameworks.
 # Evaluation

 # Llama-Krikri-8B: A large foundation Language Model for the Greek language
 Following the release of [Meltemi-7B](https://huggingface.co/ilsp/Meltemi-7B-v1) on the 26th March 2024 we are happy to welcome Krikri to the family of ILSP open Greek LLMs.
+Krikri is built on top of [Llama-3.1-8B](https://huggingface.co/meta-llama/Llama-3.1-8B), extending its capabilities for Greek through continual pretraining on a large corpus of high-quality and locally relevant Greek texts. We present Llama-Krikri-8B-Base, as well as an instruct version [Llama-Krikri-8B-Instruct](https://huggingface.co/ilsp/Llama-Krikri-8B-instruct).
 ![image/png](llama-krikri-image.jpg)
 Chosen subsets of the 89.65 billion corpus were upsampled resulting in a size of 110 billion tokens.
+# How to use
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+device = "cuda"
+model = AutoModelForCausalLM.from_pretrained("ilsp/Llama-Krikri-8B-Base")
+tokenizer = AutoTokenizer.from_pretrained("ilsp/Llama-Krikri-8B-Base")
+model.to(device)
+input_text = tokenizer("Ποιες είναι οι διαφορές ανάμεσα σε ένα λάμα και ένα κρικρί", return_tensors='pt').to(device)
+outputs = model.generate(input_text['input_ids'], max_new_tokens=256, do_sample=True)
+print(tokenizer.batch_decode(outputs)[0])
+```
 # Evaluation