Pinkstack
/

Luau-coder-v2-3B-base-32k

Text Generation

text-generation-inference

Model card Files Files and versions

Pinkstack commited on 3 days ago

Commit

8081bac

·

verified ·

1 Parent(s): a6ad3d8

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -16,13 +16,15 @@ pipeline_tag: text-generation
 base_model:
 - allenai/OLMo-2-0425-1B
 ---
-Note: this is not a chat model, the chat model is coming soon but this is the base model for further fine-tuning.
 ![Thumbnail](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/GeIinCTOzBfsgiqwQlKUY.png)
 # print("Before we start")
 We are not related to Roblox in any way, any mention of Roblox is purely to help people understand what the model is about.
 As per the [Roblox website](https://create.roblox.com/docs/assistant/guide), they use Meta's Llama 3 (we assume 70B) for their AI assistant. This model, while powerful, cannot come close to the performance of a 70B model.
 # print("Stages of pre-training")
 This model was continually pre-trained in 3 stages. (Note, allenai states that olmo 2 1B, which is the model this is based on was pre-trained on 4 trillion or so tokens.)

 base_model:
 - allenai/OLMo-2-0425-1B
 ---
+Note: this is not a chat model, the chat model is coming soon but this is the base model for further fine-tuning, stay tuned for the chat model release! This page will be updated once that model is out. (The chat model will be under a different repo)
 ![Thumbnail](https://cdn-uploads.huggingface.co/production/uploads/6710ba6af1279fe0dfe33afe/GeIinCTOzBfsgiqwQlKUY.png)
 # print("Before we start")
 We are not related to Roblox in any way, any mention of Roblox is purely to help people understand what the model is about.
 As per the [Roblox website](https://create.roblox.com/docs/assistant/guide), they use Meta's Llama 3 (we assume 70B) for their AI assistant. This model, while powerful, cannot come close to the performance of a 70B model.
+But unlike Llama 3, this model (luau-coder-v2-3b-32k) aka luaucoder for short is under an open apache 2.0 license.
 # print("Stages of pre-training")
 This model was continually pre-trained in 3 stages. (Note, allenai states that olmo 2 1B, which is the model this is based on was pre-trained on 4 trillion or so tokens.)