AhmedZaky1
/

DIMI-embedding-v4

@@ -15,28 +15,22 @@ widget:
 - source_sentence: الرجل يركب حصاناً
   sentences:
   - رجل يُبث الجبن الممزق على البيتزا
-  - ar-ar
-  - رجل يركب حصاناً
 - source_sentence: المرأة تقلي لحم خنزير مشوي
   sentences:
-  - ar-ar
   - امرأة تطبخ لحم خنزير مخبوز
   - طائرة طيران تقلع
 - source_sentence: امرأة تحمل في ذراعها طفل كنغر
   sentences:
   - امرأة تعزف على الغيتار
-  - ar-ar
   - امرأة تحمل و تحمل طفل كنغر
 - source_sentence: رجل يعزف على الناي
   sentences:
   - طائرة ستقلع
-  - ar-ar
   - رجل يعزف على فرقة الخيزران
 - source_sentence: ثلاثة رجال يلعبون الشطرنج.
   sentences:
   - رجلين يلعبان الشطرنج
   - بعض الرجال يقاتلون
-  - ar-ar
 datasets:
 - silma-ai/silma-arabic-english-sts-dataset-v1.0
 pipeline_tag: sentence-similarity
@@ -44,583 +38,260 @@ library_name: sentence-transformers
 metrics:
 - pearson_cosine
 - spearman_cosine
-model-index:
-- name: SentenceTransformer based on AhmedZaky1/DIMI-embedding-v2
-  results:
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: silma sts dev 768
-      type: silma-sts-dev-768
-    metrics:
-    - type: pearson_cosine
-      value: 0.8894298077237747
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8357984695231979
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: silma sts dev 512
-      type: silma-sts-dev-512
-    metrics:
-    - type: pearson_cosine
-      value: 0.8958835653694187
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8394578198917563
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: silma sts dev 256
-      type: silma-sts-dev-256
-    metrics:
-    - type: pearson_cosine
-      value: 0.9078743376141943
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8470163055535588
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: silma sts dev 128
-      type: silma-sts-dev-128
-    metrics:
-    - type: pearson_cosine
-      value: 0.9181556833949818
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.856188415278301
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: silma sts dev 64
-      type: silma-sts-dev-64
-    metrics:
-    - type: pearson_cosine
-      value: 0.9066219844975816
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8434430083292863
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: sts17 ar test 768
-      type: sts17-ar-test-768
-    metrics:
-    - type: pearson_cosine
-      value: 0.8205269118955641
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8258003312254673
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: sts17 ar test 512
-      type: sts17-ar-test-512
-    metrics:
-    - type: pearson_cosine
-      value: 0.8193403796404517
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8226611918447921
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: sts17 ar test 256
-      type: sts17-ar-test-256
-    metrics:
-    - type: pearson_cosine
-      value: 0.8190666923783347
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8245760514866052
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: sts17 ar test 128
-      type: sts17-ar-test-128
-    metrics:
-    - type: pearson_cosine
-      value: 0.8114629254813825
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8183273799928091
-      name: Spearman Cosine
-  - task:
-      type: semantic-similarity
-      name: Semantic Similarity
-    dataset:
-      name: sts17 ar test 64
-      type: sts17-ar-test-64
-    metrics:
-    - type: pearson_cosine
-      value: 0.796172574267003
-      name: Pearson Cosine
-    - type: spearman_cosine
-      value: 0.8077141358495715
-      name: Spearman Cosine
 ---
-# SentenceTransformer based on AhmedZaky1/DIMI-embedding-v2
-This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [AhmedZaky1/DIMI-embedding-v2](https://huggingface.co/AhmedZaky1/DIMI-embedding-v2) on the [silma-arabic-english-sts-dataset-v1.0](https://huggingface.co/datasets/silma-ai/silma-arabic-english-sts-dataset-v1.0) dataset. It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
-## Model Details
-### Model Description
-- **Model Type:** Sentence Transformer
-- **Base model:** [AhmedZaky1/DIMI-embedding-v2](https://huggingface.co/AhmedZaky1/DIMI-embedding-v2) <!-- at revision d4a6e4faaea9d9a2ad374fea48b093946166e753 -->
-- **Maximum Sequence Length:** 8192 tokens
-- **Output Dimensionality:** 768 dimensions
-- **Similarity Function:** Cosine Similarity
-- **Training Dataset:**
-    - [silma-arabic-english-sts-dataset-v1.0](https://huggingface.co/datasets/silma-ai/silma-arabic-english-sts-dataset-v1.0)
-- **Languages:** ar, en
-<!-- - **License:** Unknown -->
-### Model Sources
-- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
-- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
-- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
-### Full Model Architecture
-```
-SentenceTransformer(
-  (0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: NewModel
-  (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
-  (2): Normalize()
-)
-```
-## Usage
-### Direct Usage (Sentence Transformers)
-First install the Sentence Transformers library:
-```bash
-pip install -U sentence-transformers
-```
-Then you can load this model and run inference.
 ```python
 from sentence_transformers import SentenceTransformer
-# Download from the 🤗 Hub
-model = SentenceTransformer("AhmedZaky1/DIMI-embedding-v2-silma-sts-matryoshka")
-# Run inference
 sentences = [
-    'ثلاثة رجال يلعبون الشطرنج.',
-    'رجلين يلعبان الشطرنج',
-    'ar-ar',
 ]
 embeddings = model.encode(sentences)
-print(embeddings.shape)
-# [3, 768]
-# Get the similarity scores for the embeddings
-similarities = model.similarity(embeddings, embeddings)
-print(similarities.shape)
-# [3, 3]
 ```
-<!--
-### Direct Usage (Transformers)
-<details><summary>Click to see the direct usage in Transformers</summary>
-</details>
--->
-<!--
-### Downstream Usage (Sentence Transformers)
-You can finetune this model on your own dataset.
-<details><summary>Click to expand</summary>
-</details>
--->
-<!--
-### Out-of-Scope Use
-*List how the model may foreseeably be misused and address what users ought not to do with the model.*
--->
-## Evaluation
-### Metrics
-#### Semantic Similarity
-* Datasets: `silma-sts-dev-768`, `silma-sts-dev-512`, `silma-sts-dev-256`, `silma-sts-dev-128`, `silma-sts-dev-64`, `sts17-ar-test-768`, `sts17-ar-test-512`, `sts17-ar-test-256`, `sts17-ar-test-128` and `sts17-ar-test-64`
-* Evaluated with [<code>EmbeddingSimilarityEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.EmbeddingSimilarityEvaluator)
-| Metric              | silma-sts-dev-768 | silma-sts-dev-512 | silma-sts-dev-256 | silma-sts-dev-128 | silma-sts-dev-64 | sts17-ar-test-768 | sts17-ar-test-512 | sts17-ar-test-256 | sts17-ar-test-128 | sts17-ar-test-64 |
-|:--------------------|:------------------|:------------------|:------------------|:------------------|:-----------------|:------------------|:------------------|:------------------|:------------------|:-----------------|
-| pearson_cosine      | 0.8894            | 0.8959            | 0.9079            | 0.9182            | 0.9066           | 0.8205            | 0.8193            | 0.8191            | 0.8115            | 0.7962           |
-| **spearman_cosine** | **0.8358**        | **0.8395**        | **0.847**         | **0.8562**        | **0.8434**       | **0.8258**        | **0.8227**        | **0.8246**        | **0.8183**        | **0.8077**       |
-<!--
-## Bias, Risks and Limitations
-*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
--->
-<!--
-### Recommendations
-*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
--->
-## Training Details
-### Training Dataset
-#### silma-arabic-english-sts-dataset-v1.0
-* Dataset: [silma-arabic-english-sts-dataset-v1.0](https://huggingface.co/datasets/silma-ai/silma-arabic-english-sts-dataset-v1.0) at [1885690](https://huggingface.co/datasets/silma-ai/silma-arabic-english-sts-dataset-v1.0/tree/18856908c58bc3779ad089ec327093c8761d2523)
-* Size: 34,436 training samples
-* Columns: <code>sentence1</code>, <code>sentence2</code>, <code>score</code>, and <code>langs</code>
-* Approximate statistics based on the first 1000 samples:
-  |         | sentence1                                                                        | sentence2                                                                        | score                                                          | langs                                                                          |
-  |:--------|:---------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|:---------------------------------------------------------------|:-------------------------------------------------------------------------------|
-  | type    | string                                                                           | string                                                                           | float                                                          | string                                                                         |
-  | details | <ul><li>min: 4 tokens</li><li>mean: 9.68 tokens</li><li>max: 26 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 9.68 tokens</li><li>max: 26 tokens</li></ul> | <ul><li>min: 0.0</li><li>mean: 0.47</li><li>max: 1.0</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 5.0 tokens</li><li>max: 5 tokens</li></ul> |
-* Samples:
-  | sentence1                          | sentence2                          | score            | langs              |
-  |:-----------------------------------|:-----------------------------------|:-----------------|:-------------------|
-  | <code>رجل يعزف على البيانو</code>  | <code>امرأة تعزف على الكمان</code> | <code>0.2</code> | <code>ar-ar</code> |
-  | <code>امرأة تعزف على الكمان</code> | <code>رجل يعزف على البيانو</code>  | <code>0.2</code> | <code>ar-ar</code> |
-  | <code>امرأة تعزف على الناي.</code> | <code>رجل يعزف على الغيتار</code>  | <code>0.2</code> | <code>ar-ar</code> |
-* Loss: [<code>MatryoshkaLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#matryoshkaloss) with these parameters:
-  ```json
-  {
-      "loss": "CoSENTLoss",
-      "matryoshka_dims": [
-          768,
-          512,
-          256,
-          128,
-          64
-      ],
-      "matryoshka_weights": [
-          1,
-          1,
-          1,
-          1,
-          1
-      ],
-      "n_dims_per_step": -1
-  }
-  ```
-### Evaluation Dataset
-#### silma-arabic-english-sts-dataset-v1.0
-* Dataset: [silma-arabic-english-sts-dataset-v1.0](https://huggingface.co/datasets/silma-ai/silma-arabic-english-sts-dataset-v1.0) at [1885690](https://huggingface.co/datasets/silma-ai/silma-arabic-english-sts-dataset-v1.0/tree/18856908c58bc3779ad089ec327093c8761d2523)
-* Size: 100 evaluation samples
-* Columns: <code>sentence1</code>, <code>sentence2</code>, <code>score</code>, and <code>langs</code>
-* Approximate statistics based on the first 100 samples:
-  |         | sentence1                                                                        | sentence2                                                                        | score                                                          | langs                                                                          |
-  |:--------|:---------------------------------------------------------------------------------|:---------------------------------------------------------------------------------|:---------------------------------------------------------------|:-------------------------------------------------------------------------------|
-  | type    | string                                                                           | string                                                                           | float                                                          | string                                                                         |
-  | details | <ul><li>min: 5 tokens</li><li>mean: 9.49 tokens</li><li>max: 19 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 9.49 tokens</li><li>max: 19 tokens</li></ul> | <ul><li>min: 0.1</li><li>mean: 0.74</li><li>max: 1.0</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 5.0 tokens</li><li>max: 5 tokens</li></ul> |
-* Samples:
-  | sentence1                          | sentence2                       | score             | langs              |
-  |:-----------------------------------|:--------------------------------|:------------------|:-------------------|
-  | <code>طائرة ستقلع</code>           | <code>طائرة طيران تقلع</code>   | <code>1.0</code>  | <code>ar-ar</code> |
-  | <code>طائرة طيران تقلع</code>      | <code>طائرة ستقلع</code>        | <code>1.0</code>  | <code>ar-ar</code> |
-  | <code>رجل يعزف على ناي كبير</code> | <code>رجل يعزف على الناي</code> | <code>0.76</code> | <code>ar-ar</code> |
-* Loss: [<code>MatryoshkaLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#matryoshkaloss) with these parameters:
-  ```json
-  {
-      "loss": "CoSENTLoss",
-      "matryoshka_dims": [
-          768,
-          512,
-          256,
-          128,
-          64
-      ],
-      "matryoshka_weights": [
-          1,
-          1,
-          1,
-          1,
-          1
-      ],
-      "n_dims_per_step": -1
-  }
-  ```
-### Training Hyperparameters
-#### Non-Default Hyperparameters
-- `eval_strategy`: steps
-- `per_device_train_batch_size`: 16
-- `per_device_eval_batch_size`: 16
-- `num_train_epochs`: 4
-- `warmup_ratio`: 0.1
-- `save_only_model`: True
-- `fp16`: True
-- `load_best_model_at_end`: True
-#### All Hyperparameters
-<details><summary>Click to expand</summary>
-- `overwrite_output_dir`: False
-- `do_predict`: False
-- `eval_strategy`: steps
-- `prediction_loss_only`: True
-- `per_device_train_batch_size`: 16
-- `per_device_eval_batch_size`: 16
-- `per_gpu_train_batch_size`: None
-- `per_gpu_eval_batch_size`: None
-- `gradient_accumulation_steps`: 1
-- `eval_accumulation_steps`: None
-- `torch_empty_cache_steps`: None
-- `learning_rate`: 5e-05
-- `weight_decay`: 0.0
-- `adam_beta1`: 0.9
-- `adam_beta2`: 0.999
-- `adam_epsilon`: 1e-08
-- `max_grad_norm`: 1.0
-- `num_train_epochs`: 4
-- `max_steps`: -1
-- `lr_scheduler_type`: linear
-- `lr_scheduler_kwargs`: {}
-- `warmup_ratio`: 0.1
-- `warmup_steps`: 0
-- `log_level`: passive
-- `log_level_replica`: warning
-- `log_on_each_node`: True
-- `logging_nan_inf_filter`: True
-- `save_safetensors`: True
-- `save_on_each_node`: False
-- `save_only_model`: True
-- `restore_callback_states_from_checkpoint`: False
-- `no_cuda`: False
-- `use_cpu`: False
-- `use_mps_device`: False
-- `seed`: 42
-- `data_seed`: None
-- `jit_mode_eval`: False
-- `use_ipex`: False
-- `bf16`: False
-- `fp16`: True
-- `fp16_opt_level`: O1
-- `half_precision_backend`: auto
-- `bf16_full_eval`: False
-- `fp16_full_eval`: False
-- `tf32`: None
-- `local_rank`: 0
-- `ddp_backend`: None
-- `tpu_num_cores`: None
-- `tpu_metrics_debug`: False
-- `debug`: []
-- `dataloader_drop_last`: False
-- `dataloader_num_workers`: 0
-- `dataloader_prefetch_factor`: None
-- `past_index`: -1
-- `disable_tqdm`: False
-- `remove_unused_columns`: True
-- `label_names`: None
-- `load_best_model_at_end`: True
-- `ignore_data_skip`: False
-- `fsdp`: []
-- `fsdp_min_num_params`: 0
-- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
-- `tp_size`: 0
-- `fsdp_transformer_layer_cls_to_wrap`: None
-- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
-- `deepspeed`: None
-- `label_smoothing_factor`: 0.0
-- `optim`: adamw_torch
-- `optim_args`: None
-- `adafactor`: False
-- `group_by_length`: False
-- `length_column_name`: length
-- `ddp_find_unused_parameters`: None
-- `ddp_bucket_cap_mb`: None
-- `ddp_broadcast_buffers`: False
-- `dataloader_pin_memory`: True
-- `dataloader_persistent_workers`: False
-- `skip_memory_metrics`: True
-- `use_legacy_prediction_loop`: False
-- `push_to_hub`: False
-- `resume_from_checkpoint`: None
-- `hub_model_id`: None
-- `hub_strategy`: every_save
-- `hub_private_repo`: None
-- `hub_always_push`: False
-- `gradient_checkpointing`: False
-- `gradient_checkpointing_kwargs`: None
-- `include_inputs_for_metrics`: False
-- `include_for_metrics`: []
-- `eval_do_concat_batches`: True
-- `fp16_backend`: auto
-- `push_to_hub_model_id`: None
-- `push_to_hub_organization`: None
-- `mp_parameters`:
-- `auto_find_batch_size`: False
-- `full_determinism`: False
-- `torchdynamo`: None
-- `ray_scope`: last
-- `ddp_timeout`: 1800
-- `torch_compile`: False
-- `torch_compile_backend`: None
-- `torch_compile_mode`: None
-- `include_tokens_per_second`: False
-- `include_num_input_tokens_seen`: False
-- `neftune_noise_alpha`: None
-- `optim_target_modules`: None
-- `batch_eval_metrics`: False
-- `eval_on_start`: False
-- `use_liger_kernel`: False
-- `eval_use_gather_object`: False
-- `average_tokens_across_devices`: False
-- `prompts`: None
-- `batch_sampler`: batch_sampler
-- `multi_dataset_batch_sampler`: proportional
-</details>
-### Training Logs
-| Epoch      | Step     | Training Loss | Validation Loss | silma-sts-dev-768_spearman_cosine | silma-sts-dev-512_spearman_cosine | silma-sts-dev-256_spearman_cosine | silma-sts-dev-128_spearman_cosine | silma-sts-dev-64_spearman_cosine | sts17-ar-test-768_spearman_cosine | sts17-ar-test-512_spearman_cosine | sts17-ar-test-256_spearman_cosine | sts17-ar-test-128_spearman_cosine | sts17-ar-test-64_spearman_cosine |
-|:----------:|:--------:|:-------------:|:---------------:|:---------------------------------:|:---------------------------------:|:---------------------------------:|:---------------------------------:|:--------------------------------:|:---------------------------------:|:---------------------------------:|:---------------------------------:|:---------------------------------:|:--------------------------------:|
-| 0.0929     | 100      | 39.5796       | 45.0982         | 0.7199                            | 0.7173                            | 0.7292                            | 0.7433                            | 0.7196                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.1857     | 200      | 31.3305       | 29.9877         | 0.7233                            | 0.7248                            | 0.7344                            | 0.7337                            | 0.7192                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.2786     | 300      | 27.7756       | 31.4644         | 0.7288                            | 0.7268                            | 0.7331                            | 0.7388                            | 0.7169                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.3714     | 400      | 27.7405       | 33.3315         | 0.7172                            | 0.7168                            | 0.7341                            | 0.7349                            | 0.7219                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.4643     | 500      | 27.1884       | 30.4957         | 0.7469                            | 0.7428                            | 0.7475                            | 0.7547                            | 0.7426                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.5571     | 600      | 27.0428       | 29.5877         | 0.7133                            | 0.7138                            | 0.7380                            | 0.7549                            | 0.7533                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.6500     | 700      | 26.7957       | 30.3813         | 0.7520                            | 0.7430                            | 0.7570                            | 0.7604                            | 0.7647                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.7428     | 800      | 26.2667       | 30.6293         | 0.7323                            | 0.7333                            | 0.7558                            | 0.7609                            | 0.7479                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.8357     | 900      | 25.9412       | 29.8621         | 0.7730                            | 0.7732                            | 0.7913                            | 0.8117                            | 0.7797                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 0.9285     | 1000     | 25.7816       | 31.7315         | 0.7856                            | 0.7918                            | 0.7916                            | 0.8025                            | 0.8048                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.0214     | 1100     | 25.1666       | 31.6311         | 0.7651                            | 0.7668                            | 0.7673                            | 0.7826                            | 0.7846                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.1142     | 1200     | 24.7681       | 32.3005         | 0.7719                            | 0.7892                            | 0.7941                            | 0.8022                            | 0.7939                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.2071     | 1300     | 24.8771       | 32.1761         | 0.7660                            | 0.7744                            | 0.7807                            | 0.7884                            | 0.7841                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.2999     | 1400     | 24.9063       | 33.2694         | 0.7646                            | 0.7644                            | 0.7884                            | 0.7906                            | 0.7886                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.3928     | 1500     | 24.7283       | 32.4350         | 0.7935                            | 0.7974                            | 0.8071                            | 0.8112                            | 0.8062                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.4856     | 1600     | 24.4217       | 34.1219         | 0.7781                            | 0.7754                            | 0.7739                            | 0.7916                            | 0.7889                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.5785     | 1700     | 24.4923       | 33.1239         | 0.7636                            | 0.7709                            | 0.7882                            | 0.7991                            | 0.7913                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.6713     | 1800     | 24.0844       | 33.5233         | 0.7785                            | 0.7832                            | 0.7880                            | 0.7977                            | 0.8014                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.7642     | 1900     | 24.1453       | 35.4602         | 0.7795                            | 0.7816                            | 0.8053                            | 0.8115                            | 0.7944                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.8570     | 2000     | 24.2271       | 36.2812         | 0.8003                            | 0.8009                            | 0.8008                            | 0.8102                            | 0.8009                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 1.9499     | 2100     | 23.7371       | 37.0276         | 0.7769                            | 0.7866                            | 0.7918                            | 0.7926                            | 0.7832                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.0427     | 2200     | 23.3566       | 34.5721         | 0.7931                            | 0.8017                            | 0.8020                            | 0.8159                            | 0.8027                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.1356     | 2300     | 23.2523       | 35.5316         | 0.7931                            | 0.7981                            | 0.7896                            | 0.8157                            | 0.8142                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.2284     | 2400     | 23.0447       | 36.6811         | 0.7973                            | 0.7962                            | 0.7935                            | 0.8030                            | 0.8037                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.3213     | 2500     | 22.9782       | 37.5482         | 0.8121                            | 0.8185                            | 0.8200                            | 0.8293                            | 0.8244                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.4141     | 2600     | 22.9119       | 37.2809         | 0.8077                            | 0.8116                            | 0.8113                            | 0.8333                            | 0.8151                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.5070     | 2700     | 23.1302       | 37.7993         | 0.8255                            | 0.8304                            | 0.8310                            | 0.8376                            | 0.8303                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.5998     | 2800     | 22.9941       | 38.8005         | 0.8182                            | 0.8214                            | 0.8143                            | 0.8193                            | 0.8155                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.6927     | 2900     | 22.8876       | 36.2524         | 0.8201                            | 0.8222                            | 0.8194                            | 0.8347                            | 0.8260                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.7855     | 3000     | 22.5304       | 38.1523         | 0.8195                            | 0.8280                            | 0.8356                            | 0.8545                            | 0.8394                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.8784     | 3100     | 22.446        | 39.4876         | 0.8242                            | 0.8246                            | 0.8319                            | 0.8483                            | 0.8397                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 2.9712     | 3200     | 22.5077       | 39.1910         | 0.8231                            | 0.8249                            | 0.8334                            | 0.8475                            | 0.8372                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| **3.0641** | **3300** | **21.9675**   | **36.4245**     | **0.8408**                        | **0.8425**                        | **0.8456**                        | **0.8619**                        | **0.8577**                       | **-**                             | **-**                             | **-**                             | **-**                             | **-**                            |
-| 3.1569     | 3400     | 21.9361       | 36.7119         | 0.8344                            | 0.8405                            | 0.8460                            | 0.8656                            | 0.8644                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.2498     | 3500     | 21.7747       | 37.7140         | 0.8279                            | 0.8353                            | 0.8414                            | 0.8510                            | 0.8446                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.3426     | 3600     | 21.8649       | 38.9102         | 0.8298                            | 0.8331                            | 0.8456                            | 0.8494                            | 0.8447                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.4355     | 3700     | 21.794        | 37.4385         | 0.8278                            | 0.8328                            | 0.8377                            | 0.8442                            | 0.8373                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.5283     | 3800     | 21.7968       | 37.0225         | 0.8352                            | 0.8501                            | 0.8540                            | 0.8722                            | 0.8553                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.6212     | 3900     | 21.5941       | 37.5736         | 0.8344                            | 0.8515                            | 0.8511                            | 0.8643                            | 0.8587                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.7140     | 4000     | 21.8181       | 37.4984         | 0.8340                            | 0.8440                            | 0.8470                            | 0.8607                            | 0.8484                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.8069     | 4100     | 21.7035       | 37.9701         | 0.8346                            | 0.8394                            | 0.8436                            | 0.8615                            | 0.8479                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.8997     | 4200     | 21.398        | 38.1567         | 0.8349                            | 0.8365                            | 0.8470                            | 0.8572                            | 0.8405                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 3.9926     | 4300     | 21.6518       | 38.3515         | 0.8358                            | 0.8395                            | 0.8470                            | 0.8562                            | 0.8434                           | -                                 | -                                 | -                                 | -                                 | -                                |
-| 4.0        | 4308     | -             | -               | -                                 | -                                 | -                                 | -                                 | -                                | 0.8258                            | 0.8227                            | 0.8246                            | 0.8183                            | 0.8077                           |
-* The bold row denotes the saved checkpoint.
-### Framework Versions
-- Python: 3.12.7
-- Sentence Transformers: 3.3.1
-- Transformers: 4.51.3
-- PyTorch: 2.6.0+cu124
-- Accelerate: 1.4.0
-- Datasets: 3.3.2
-- Tokenizers: 0.21.1
-## Citation
-### BibTeX
-#### Sentence Transformers
-```bibtex
-@inproceedings{reimers-2019-sentence-bert,
-    title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
-    author = "Reimers, Nils and Gurevych, Iryna",
-    booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
-    month = "11",
-    year = "2019",
-    publisher = "Association for Computational Linguistics",
-    url = "https://arxiv.org/abs/1908.10084",
-}
 ```
-#### MatryoshkaLoss
-```bibtex
-@misc{kusupati2024matryoshka,
-    title={Matryoshka Representation Learning},
-    author={Aditya Kusupati and Gantavya Bhatt and Aniket Rege and Matthew Wallingford and Aditya Sinha and Vivek Ramanujan and William Howard-Snyder and Kaifeng Chen and Sham Kakade and Prateek Jain and Ali Farhadi},
-    year={2024},
-    eprint={2205.13147},
-    archivePrefix={arXiv},
-    primaryClass={cs.LG}
-}
 ```
-#### CoSENTLoss
 ```bibtex
-@online{kexuefm-8847,
-    title={CoSENT: A more efficient sentence vector scheme than Sentence-BERT},
-    author={Su Jianlin},
-    year={2022},
-    month={Jan},
-    url={https://kexue.fm/archives/8847},
 }
 ```
-<!--
-## Glossary
-*Clearly define terms in order to be accessible across audiences.*
--->
-<!--
-## Model Card Authors
-*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
--->
-<!--
-## Model Card Contact
-*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
--->

 - source_sentence: الرجل يركب حصاناً
   sentences:
   - رجل يُبث الجبن الممزق على البيتزا
 - source_sentence: المرأة تقلي لحم خنزير مشوي
   sentences:
   - امرأة تطبخ لحم خنزير مخبوز
   - طائرة طيران تقلع
 - source_sentence: امرأة تحمل في ذراعها طفل كنغر
   sentences:
   - امرأة تعزف على الغيتار
   - امرأة تحمل و تحمل طفل كنغر
 - source_sentence: رجل يعزف على الناي
   sentences:
   - طائرة ستقلع
   - رجل يعزف على فرقة الخيزران
 - source_sentence: ثلاثة رجال يلعبون الشطرنج.
   sentences:
   - رجلين يلعبان الشطرنج
   - بعض الرجال يقاتلون
 datasets:
 - silma-ai/silma-arabic-english-sts-dataset-v1.0
 pipeline_tag: sentence-similarity
 metrics:
 - pearson_cosine
 - spearman_cosine
 ---
+# DIMI Embedding model
+<div align="center">
+![DIMI Logo]
+![image/jpeg](https://cdn-uploads.huggingface.co/production/uploads/65fb3ac20cfe262da2bb0fcc/i8PSS4Q4HufI-DG5hyyQw.jpeg)
+*State-of-the-art Multilingual Sentence Embeddings for Arabic-English Semantic Similarity*
+[![Hugging Face](https://img.shields.io/badge/🤗%20Hugging%20Face-Model-yellow)](https://huggingface.co/AhmedZaky1/DIMI-embedding-v3-silma-sts-matryoshka)
+[![License](https://img.shields.io/badge/License-MIT-blue.svg)](https://opensource.org/licenses/MIT)
+[![Python](https://img.shields.io/badge/Python-3.8+-green.svg)](https://python.org)
+</div>
+## 🚀 Model Description
+**DIMI-embedding-v3-silma-sts-matryoshka** is a cutting-edge multilingual sentence embedding model specifically fine-tuned for Arabic-English semantic textual similarity tasks. Built upon the robust DIMI-embedding-v2 architecture, this model leverages **Matryoshka Representation Learning** combined with **CoSENT Loss** to deliver exceptional performance across multiple embedding dimensions.
+### ✨ Key Features
+- **Multi-dimensional embeddings**: Supports output dimensions of 768, 512, 256, 128, and 64
+- **Bilingual expertise**: Optimized for Arabic and English text processing
+- **Matryoshka architecture**: Efficient embedding computation at multiple granularities
+- **State-of-the-art performance**: Fine-tuned on the comprehensive Silma Arabic-English STS dataset
+- **Cosine similarity optimized**: Perfect for semantic similarity and retrieval tasks
+## 📊 Model Performance
+The model demonstrates exceptional performance across different embedding dimensions:
+### Training Techniques
+This model was trained using advanced techniques for optimal performance:
+- **Matryoshka Representation Learning**: Enables efficient embeddings at multiple dimensions [768, 512, 256, 128, 64] without retraining
+- **CoSENT Loss Function**: Cosine-based sentence embedding loss for superior semantic similarity learning
+- **Multi-dimensional Evaluation**: Simultaneous optimization across all target dimensions during training
+- **Mixed Precision Training (FP16)**: Accelerated training with maintained numerical stability
+- **Warmup Learning Rate Schedule**: Gradual learning rate increase for stable convergence
+- **Best Model Selection**: Automatic selection based on highest Spearman correlation on 768d embeddings
+### Final Model Performance
+#### Development Set Results (Silma STS Dataset)
+Final evaluation on the held-out development set:
+| Dimension | Pearson Correlation | Spearman Correlation |
+|-----------|-------------------|---------------------|
+| 768d | 0.8894 | 0.8358 |
+| 512d | 0.8959 | 0.8395 |
+| 256d | 0.8979 | 0.8470 |
+| 128d | 0.9182 | 0.8562 |
+| 64d | 0.9066 | 0.8434 |
+#### MTEB STS17 Arabic Test Results
+Performance on the standard MTEB STS17 (ar-ar) benchmark:
+| Dimension | Pearson Correlation | Spearman Correlation |
+|-----------|-------------------|---------------------|
+| **768d** | **0.8205** | **0.8258** |
+| **512d** | **0.8193** | **0.8227** |
+| **256d** | **0.8191** | **0.8246** |
+| **128d** | **0.8115** | **0.8183** |
+| **64d** | **0.7962** | **0.8077** |
+**Sequential Score**: 0.8077 (based on 64d performance)
+## 🔧 Usage
+### Basic Usage
 ```python
 from sentence_transformers import SentenceTransformer
+# Load the model
+model = SentenceTransformer('AhmedZaky1/DIMI-embedding-v3-silma-sts-matryoshka')
+# Example sentences in Arabic and English
 sentences = [
+    "هذا مثال جميل للذكاء الاصطناعي",  # Arabic
+    "This is a beautiful example of artificial intelligence",  # English
+    "التعلم الآلي يغير العالم",  # Arabic
+    "Machine learning is changing the world"  # English
 ]
+# Generate embeddings
 embeddings = model.encode(sentences)
+print(f"Embedding shape: {embeddings.shape}")
+# Calculate cosine similarity
+from sklearn.metrics.pairwise import cosine_similarity
+similarity_matrix = cosine_similarity(embeddings)
+print("Similarity matrix:")
+print(similarity_matrix)
 ```
+### Matryoshka Embeddings Usage
+```python
+# Use different embedding dimensions
+dimensions = [768, 512, 256, 128, 64]
+for dim in dimensions:
+    # Truncate embeddings to specific dimension
+    truncated_embeddings = embeddings[:, :dim]
+    print(f"Dimension {dim}: {truncated_embeddings.shape}")
+    # Calculate similarity with truncated embeddings
+    similarity = cosine_similarity(truncated_embeddings)
+    print(f"Average similarity at {dim}d: {similarity.mean():.4f}")
 ```
+### Semantic Search Example
+```python
+import numpy as np
+# Query and corpus
+query = "ما هو الذكاء الاصطناعي؟"  # "What is artificial intelligence?"
+corpus = [
+    "الذكاء الاصطناعي هو محاكاة الذكاء البشري",
+    "Machine learning is a subset of AI",
+    "Deep learning uses neural networks",
+    "التعلم العميق يستخدم الشبكات العصبية"
+]
+# Encode query and corpus
+query_embedding = model.encode([query])
+corpus_embeddings = model.encode(corpus)
+# Find most similar documents
+similarities = cosine_similarity(query_embedding, corpus_embeddings)[0]
+top_indices = np.argsort(similarities)[::-1]
+print(f"Query: {query}")
+print("\nMost similar documents:")
+for i, idx in enumerate(top_indices[:3]):
+    print(f"{i+1}. {corpus[idx]} (similarity: {similarities[idx]:.4f})")
 ```
+## 🏗️ Model Architecture
+- **Base Model**: DIMI-embedding-v2
+- **Training Objective**: CoSENT Loss with Matryoshka Learning
+- **Supported Dimensions**: [768, 512, 256, 128, 64]
+- **Max Sequence Length**: 512 tokens
+- **Pooling Method**: Mean pooling
+- **Similarity Function**: Cosine similarity
+## 📊 Training Details
+### Dataset
+- **Primary Dataset**: silma-ai/silma-arabic-english-sts-dataset-v1.0
+- **Evaluation Dataset**: MTEB STS17 (ar-ar)
+- **Training Samples**: ~24,000+ multilingual sentence pairs
+- **Evaluation Samples**: 100 held-out pairs
+### Training Configuration
+- **Batch Size**: 16
+- **Epochs**: 4
+- **Learning Rate**: Warmup ratio 0.1
+- **Precision**: FP16
+- **Evaluation Strategy**: Every 100 steps
+- **Best Model Selection**: Highest Spearman correlation on 768d embeddings
+### Hardware Requirements
+- **GPU**: CUDA-compatible GPU recommended
+- **Memory**: 16GB+ RAM for training
+- **Storage**: 2GB+ for model weights
+## 🎯 Applications
+This model excels in various NLP tasks:
+- **Semantic Textual Similarity**: Measure similarity between Arabic-English text pairs
+- **Information Retrieval**: Find relevant documents in multilingual corpora
+- **Paraphrase Detection**: Identify semantically equivalent sentences
+- **Cross-lingual Search**: Search Arabic content with English queries and vice versa
+- **Clustering**: Group similar multilingual documents
+- **Recommendation Systems**: Content-based recommendations across languages
+## ⚖️ Limitations and Bias
+- Primarily optimized for Arabic and English; performance on other languages may vary
+- Performance may degrade on domain-specific technical terminology
+- Potential cultural and linguistic biases inherited from training data
+- Best performance achieved with sentence-level inputs rather than single words
+## 📝 Citation
+If you use this model in your research, please cite:
 ```bibtex
+@misc{dimi-embedding-v3-2024,
+  title={DIMI-embedding-v3-silma-sts-matryoshka: Multilingual Sentence Embeddings for Arabic-English Semantic Similarity},
+  author={Ahmed Zaky},
+  year={2024},
+  publisher={Hugging Face},
+  url={https://huggingface.co/AhmedZaky1/DIMI-embedding-v3-silma-sts-matryoshka}
 }
 ```
+## 📧 Contact
+**Author**: Ahmed Zaky
+**Email**: [email protected]
+**GitHub**: [@AhmedZaky1](https://github.com/AhmedZaky1)
+## 📄 License
+This model is released under the **MIT License**.
+```
+MIT License
+Copyright (c) 2024 Ahmed Zaky
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
+```
+## 🙏 Acknowledgments
+- **Silma AI** for providing the high-quality Arabic-English STS dataset
+- **Sentence Transformers** library for the excellent framework
+- **Hugging Face** for model hosting and distribution
+- The **MTEB** benchmark for evaluation standards
+---
+<div align="center">
+**Built with ❤️ by Ahmed Zaky**
+*Advancing Arabic NLP through state-of-the-art embedding models*
+</div>