redis
/

langcache-embed-v3

@@ -63,49 +63,6 @@ datasets:
 - redis/langcache-sentencepairs-v2
 pipeline_tag: sentence-similarity
 library_name: sentence-transformers
-metrics:
-- cosine_accuracy@1
-- cosine_precision@1
-- cosine_recall@1
-- cosine_ndcg@10
-- cosine_mrr@1
-- cosine_map@100
-- cosine_auc_precision_cache_hit_ratio
-- cosine_auc_similarity_distribution
-model-index:
-- name: Redis fine-tuned BiEncoder model for semantic caching on LangCache
-  results:
-  - task:
-      type: custom-information-retrieval
-      name: Custom Information Retrieval
-    dataset:
-      name: test
-      type: test
-    metrics:
-    - type: cosine_accuracy@1
-      value: 0.5953768980477223
-      name: Cosine Accuracy@1
-    - type: cosine_precision@1
-      value: 0.5953768980477223
-      name: Cosine Precision@1
-    - type: cosine_recall@1
-      value: 0.5778879609728815
-      name: Cosine Recall@1
-    - type: cosine_ndcg@10
-      value: 0.7775436499957671
-      name: Cosine Ndcg@10
-    - type: cosine_mrr@1
-      value: 0.5953768980477223
-      name: Cosine Mrr@1
-    - type: cosine_map@100
-      value: 0.7274666565910912
-      name: Cosine Map@100
-    - type: cosine_auc_precision_cache_hit_ratio
-      value: 0.36387321267916206
-      name: Cosine Auc Precision Cache Hit Ratio
-    - type: cosine_auc_similarity_distribution
-      value: 0.15403918371209657
-      name: Cosine Auc Similarity Distribution
 ---
 # Redis fine-tuned BiEncoder model for semantic caching on LangCache
@@ -137,6 +94,8 @@ This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [A
 SentenceTransformer(
   (0): Transformer({'max_seq_length': 100, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
   (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
 )
 ```
@@ -169,9 +128,9 @@ print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
-# tensor([[1.0001, 1.0001, 0.8242],
-#         [1.0001, 1.0001, 0.8242],
-#         [0.8242, 0.8242, 1.0000]])
 ```
 <!--
@@ -198,26 +157,6 @@ You can finetune this model on your own dataset.
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
-## Evaluation
-### Metrics
-#### Custom Information Retrieval
-* Dataset: `test`
-* Evaluated with <code>ir_evaluator.CustomInformationRetrievalEvaluator</code>
-| Metric                               | Value      |
-|:-------------------------------------|:-----------|
-| cosine_accuracy@1                    | 0.5954     |
-| cosine_precision@1                   | 0.5954     |
-| cosine_recall@1                      | 0.5779     |
-| **cosine_ndcg@10**                   | **0.7775** |
-| cosine_mrr@1                         | 0.5954     |
-| cosine_map@100                       | 0.7275     |
-| cosine_auc_precision_cache_hit_ratio | 0.3639     |
-| cosine_auc_similarity_distribution   | 0.154      |
 <!--
 ## Bias, Risks and Limitations
@@ -433,12 +372,6 @@ You can finetune this model on your own dataset.
 </details>
-### Training Logs
-| Epoch | Step | test_cosine_ndcg@10 |
-|:-----:|:----:|:-------------------:|
-| -1    | -1   | 0.7775              |
 ### Framework Versions
 - Python: 3.12.3
 - Sentence Transformers: 5.1.0

 - redis/langcache-sentencepairs-v2
 pipeline_tag: sentence-similarity
 library_name: sentence-transformers
 ---
 # Redis fine-tuned BiEncoder model for semantic caching on LangCache
 SentenceTransformer(
   (0): Transformer({'max_seq_length': 100, 'do_lower_case': False, 'architecture': 'ModernBertModel'})
   (1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
+  (mlp_hidden): Dense({'in_features': 768, 'out_features': 768, 'bias': True, 'activation_function': 'torch.nn.modules.activation.ReLU'})
+  (mlp_out): Dense({'in_features': 768, 'out_features': 768, 'bias': True, 'activation_function': 'torch.nn.modules.linear.Identity'})
 )
 ```
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
+# tensor([[1.0000, 1.0000, 0.7693],
+#         [1.0000, 1.0000, 0.7693],
+#         [0.7693, 0.7693, 1.0000]])
 ```
 <!--
 *List how the model may foreseeably be misused and address what users ought not to do with the model.*
 -->
 <!--
 ## Bias, Risks and Limitations
 </details>
 ### Framework Versions
 - Python: 3.12.3
 - Sentence Transformers: 5.1.0

modules.json CHANGED Viewed

@@ -10,5 +10,17 @@
     "name": "1",
     "path": "1_Pooling",
     "type": "sentence_transformers.models.Pooling"
   }
 ]

     "name": "1",
     "path": "1_Pooling",
     "type": "sentence_transformers.models.Pooling"
+  },
+  {
+    "idx": 2,
+    "name": "mlp_hidden",
+    "path": "2_Dense",
+    "type": "sentence_transformers.models.Dense"
+  },
+  {
+    "idx": 3,
+    "name": "mlp_out",
+    "path": "3_Dense",
+    "type": "sentence_transformers.models.Dense"
   }
 ]