uzabase
/

LLM2Vec-Swallow-7b-hf-wikipedia-jp-mntp-unsup-simcse

Model card Files Files and versions Community

h-iida commited on Sep 12, 2024

Commit

b629e33

·

verified ·

1 Parent(s): d6d2684

Update README.md

Files changed (1) hide show

README.md +13 -7

README.md CHANGED Viewed

@@ -20,18 +20,24 @@ For the MNTP Adapter, please refer to [this link](https://huggingface.co/uzabase
 - **License:** Apache2.0
 - **Finetuned from model:** [Swallow-7b-hf](https://huggingface.co/tokyotech-llm/Swallow-7b-hf)
-### Model Sources [optional]
 - **Repository:**  https://github.com/McGill-NLP/llm2vec
 - **Paper:** https://arxiv.org/abs/2404.05961
-## Usage
 - Please see [original LLM2Vec repo](https://huggingface.co/McGill-NLP/LLM2Vec-Llama-2-7b-chat-hf-mntp-unsup-simcse#usage)
-## Training Details
-### Training Data
 - Make Corpus from SimCSE from [Wikipedia](https://huggingface.co/datasets/wikimedia/wikipedia)
 - Script for making SimCSE Corpus
@@ -74,7 +80,7 @@ if __name__ == "__main__":
-#### Training Hyperparameter
 - simcse_dropout: 0.3
 - bidirectional: true
 - pooling_mode: "mean"
@@ -92,7 +98,7 @@ if __name__ == "__main__":
 - gradient_checkpointing: true
-#### Accelerator Settings
 - deepspeed_config:
   - gradient_accumulation_steps: 1
   - gradient_clipping: 1.0
@@ -118,7 +124,7 @@ if __name__ == "__main__":
 - quse_cpu: false
-### Framework versions
 - Python: 3.12.3
 - PEFT 0.11.1

 - **License:** Apache2.0
 - **Finetuned from model:** [Swallow-7b-hf](https://huggingface.co/tokyotech-llm/Swallow-7b-hf)
+### Model Sources
 - **Repository:**  https://github.com/McGill-NLP/llm2vec
 - **Paper:** https://arxiv.org/abs/2404.05961
+# Usage
 - Please see [original LLM2Vec repo](https://huggingface.co/McGill-NLP/LLM2Vec-Llama-2-7b-chat-hf-mntp-unsup-simcse#usage)
+# Benchmark
+= Followings are summaries. Details are [here]()
+## MTEB(Japanese)
+## MTEB(English)
+# Training Details
+## Training Data
 - Make Corpus from SimCSE from [Wikipedia](https://huggingface.co/datasets/wikimedia/wikipedia)
 - Script for making SimCSE Corpus
+## Training Hyperparameter
 - simcse_dropout: 0.3
 - bidirectional: true
 - pooling_mode: "mean"
 - gradient_checkpointing: true
+## Accelerator Settings
 - deepspeed_config:
   - gradient_accumulation_steps: 1
   - gradient_clipping: 1.0
 - quse_cpu: false
+## Framework versions
 - Python: 3.12.3
 - PEFT 0.11.1