Unbabel
/

TowerInstruct-7B-v0.1

@@ -21,7 +21,7 @@ pipeline_tag: translation
 ### Model Description
-TowerInstruct is a language model that results from fine-tuning TowerBase on the TowerBlocks supervised fine-tuning dataset. TowerInstruct v0.1 is the first model in the series.
 The model is trained to handle several translation-related tasks, such as general machine translation (e.g., sentence- and document-level translation, terminology-aware translation, context-aware translation), automatic post edition, named-entity recognition, gramatical error correction, and paraphrase generation.
 We will release more details in the upcoming technical report.
@@ -29,7 +29,7 @@ We will release more details in the upcoming technical report.
 - **Model type:** A 7B parameter model fine-tuned on a mix of publicly available, synthetic datasets on translation-related tasks, as well as conversational datasets and code instructions.
 - **Language(s) (NLP):** English, Portuguese, Spanish, French, German, Dutch, Italian, Korean, Chinese, Russian
 - **License:** CC-BY-NC-4.0
-- **Finetuned from model:** TowerBase
 ## Intended uses & limitations
@@ -45,7 +45,7 @@ The model was initially fine-tuned on a filtered and preprocessed supervised fin
 - Synthetic Chat data
 - Code instructions
-You can find the dataset and all data sources of TowerBlocks here.
 Here's how you can run the model using the `pipeline()` function from 🤗 Transformers:
@@ -95,37 +95,15 @@ TowerInstruct-v0.1 was trained using the ChatML prompt templates without any sys
 ### Supervised tasks
-- Translation
-```
-Translate the following text from $SRC_LANG into $TGT_LANG.
-$SRC_LANG: $SRC_TEXT
-$TGT_LANG: # make sure to add a white space the target placeholder "$TGT_LANG:" for best results
-```
-- Automatic Post Edition
-```
-Translate the following text from $SRC_LANG into $TGT_LANG.
-$SRC_LANG: $SRC_TEXT
-$TGT_LANG:
-```
-- Machine Translation Evaluation
-- Context-aware Translation
-- Terminology-aware Translation
-- Multi-reference Translation
-- Named-entity Recognition
-- Paraphrase Generation
-- Synthetic Chat data
-- Code instructions
 [More Information Needed]
 ## Training Details
 ### Training Data
-Link to TowerBlocks.
-### Training Procedure
-Write sth about Axolotl.
 #### Training Hyperparameters

 ### Model Description
+TowerInstruct-7B is a language model that results from fine-tuning TowerBase on the TowerBlocks supervised fine-tuning dataset. TowerInstruct-7B-v0.1 is the first model in the series.
 The model is trained to handle several translation-related tasks, such as general machine translation (e.g., sentence- and document-level translation, terminology-aware translation, context-aware translation), automatic post edition, named-entity recognition, gramatical error correction, and paraphrase generation.
 We will release more details in the upcoming technical report.
 - **Model type:** A 7B parameter model fine-tuned on a mix of publicly available, synthetic datasets on translation-related tasks, as well as conversational datasets and code instructions.
 - **Language(s) (NLP):** English, Portuguese, Spanish, French, German, Dutch, Italian, Korean, Chinese, Russian
 - **License:** CC-BY-NC-4.0
+- **Finetuned from model:** TowerBase [ADD LINK]
 ## Intended uses & limitations
 - Synthetic Chat data
 - Code instructions
+You can find the dataset and all data sources of TowerBlocks [ADD LINK] here.
 Here's how you can run the model using the `pipeline()` function from 🤗 Transformers:
 ### Supervised tasks
+The prompts for all supervised tasks can be found in TowerBlocks [ADD LINK]. We have used multiple prompt templates for each task. While different prompts may offer different outputs, the difference in downstream performance should be very minimal.
 [More Information Needed]
 ## Training Details
 ### Training Data
+Link to TowerBlocks [ADD LINK].
 #### Training Hyperparameters