redis
/

langcache-embed-v3

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fe28270e57d9e8e90e8666418d902aa8ecb6254c77adda1949f6cbd4bdddb8c0
 size 2362528

 version https://git-lfs.github.com/spec/v1
+oid sha256:9997181ec203c76a0e08ecba57c47a10999519c2736241efc55aadbd8d389584
 size 2362528

3_Dense/model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:9a621984b96056acc53643d57acdce2f420d7dfe7a155ea8fcfd949064f4ff1f
 size 2362528

 version https://git-lfs.github.com/spec/v1
+oid sha256:db470fd6a6c46fd748b3e0d97974cb3788a47741d1005aca5aff6ccc250b737c
 size 2362528

README.md CHANGED Viewed

@@ -12,53 +12,48 @@ tags:
 - retrieval
 - reranking
 - generated_from_trainer
-- dataset_size:13675
 - loss:ArcFaceInBatchLoss
 base_model: Alibaba-NLP/gte-modernbert-base
 widget:
-- source_sentence: Bathurst Street has been the heart of the Jewish community of Toronto
-    for decades .
   sentences:
-  - Baron portrayed actress Violet Carson who played Ena Sharples in the soap .
-  - Bathurst Street has been the heart of the Jewish community of Toronto for many
-    decades .
-  - It stretches approximately 20 miles from Manasquan Inlet in Point Pleasant Beach
-    in the north to Island Beach State Park in the south .
-- source_sentence: All tracks produced by Zack Shada , Jeremy Shada , Logan Charles
-    , John Spicer and Seth Renken . All tracks are written by Zack Odom and Kenneth
-    Mount .
   sentences:
-  - All tracks produced by Zack Shada , Jeremy Shada , Logan Charles , John Spicer
-    and Seth Renken . All tracks are written by Zack Odom and Kenneth Mount .
-  - All tracks by Zack Shada , Jeremy Shada , John Spicer , Logan Charles and Seth
-    Renken are produced by Zack Odom and Kenneth Mount .
-  - Jimmy Connors defeated Eddie Dibbs 7 -- 5 , 7 -- 5
-- source_sentence: Arque Municipality is situated in the eastern part of the province
-    and Tacopaya Municipality is located in the west .
   sentences:
-  - Arque Municipality is situated in the eastern part of the province and Tacopaya
-    Municipality is located in the west .
-  - Bangkok International Preparatory and Secondary School , or Bangkok Prep , is
-    an independent international school located on the National Curriculum of England
-    based in Bangkok , Thailand .
-  - The municipality of Tacopaya is situated in the eastern part of the province and
-    municipality of Arque located in the west .
-- source_sentence: Browning is identified as married , but no wife or child is captured
-    .
   sentences:
-  - Alexander Alexander is the grandson of the Sarawak - leader Tun Jugah Barieng
-    and the son of former politician Tan Sri Datuk Amar Leonard Linggi .
-  - Browning is identified as married , but no wife or child is recorded .
-  - It was formerly known also as ' Crotto ' .
-- source_sentence: Actor Charlie Chan , who portrayed Warner Oland when `` The Black
-    Camel `` was filmed in Hawaii , he met .
   sentences:
-  - Chang met actor Warner Oland , who portrayed Charlie Chan , when `` The Black
-    Camel `` was filmed in Hawaii .
-  - As an actor , he joined the Royal Shakespeare Company of Peter Hall , working
-    with Peggy Ashcroft and Dame Edith Evans .
-  - Actor Charlie Chan , who portrayed Warner Oland when `` The Black Camel `` was
-    filmed in Hawaii , he met .
 datasets:
 - redis/langcache-sentencepairs-v2
 pipeline_tag: sentence-similarity
@@ -160,9 +155,9 @@ from sentence_transformers import SentenceTransformer
 model = SentenceTransformer("redis/langcache-embed-v3")
 # Run inference
 sentences = [
-    'Actor Charlie Chan , who portrayed Warner Oland when `` The Black Camel `` was filmed in Hawaii , he met .',
-    'Actor Charlie Chan , who portrayed Warner Oland when `` The Black Camel `` was filmed in Hawaii , he met .',
-    'Chang met actor Warner Oland , who portrayed Charlie Chan , when `` The Black Camel `` was filmed in Hawaii .',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
@@ -171,9 +166,9 @@ print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
-# tensor([[0.9998, 0.9998, 0.5864],
-#         [0.9998, 0.9998, 0.5864],
-#         [0.5864, 0.5864, 1.0000]])
 ```
 <!--
@@ -239,19 +234,19 @@ You can finetune this model on your own dataset.
 #### LangCache Sentence Pairs (all)
 * Dataset: [LangCache Sentence Pairs (all)](https://huggingface.co/datasets/redis/langcache-sentencepairs-v2)
-* Size: 6,786 training samples
 * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | anchor                                                                            | positive                                                                          | negative                                                                          |
-  |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
-  | type    | string                                                                            | string                                                                            | string                                                                            |
-  | details | <ul><li>min: 9 tokens</li><li>mean: 27.96 tokens</li><li>max: 50 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 27.98 tokens</li><li>max: 51 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 27.56 tokens</li><li>max: 49 tokens</li></ul> |
 * Samples:
-  | anchor                                                                                          | positive                                                                                        | negative                                                                                                                                                     |
-  |:------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------|
-  | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers Win 4-0</code>           | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers win series 4-0 ``</code> | <code>( 1 ) Los Angeles Lakers vs. ( 2 ) San Antonio Spurs : `` Lakers win series 4-0 ``</code>                                                              |
-  | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers win series 4-0 ``</code> | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers Win 4-0</code>           | <code>The study included 752 universities in Pennsylvania , including public schools , public charter schools and traditional public magnet schools .</code> |
-  | <code>( 1 ) Los Angeles Lakers vs. ( 2 ) San Antonio Spurs : `` Lakers win series 4-0 ``</code> | <code>( 1 ) Los Angeles Lakers vs. ( 2 ) San Antonio Spurs : `` Lakers win series 4-0 ``</code> | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers Win 4-0</code>                                                                        |
 * Loss: <code>losses.ArcFaceInBatchLoss</code> with these parameters:
   ```json
   {
@@ -266,19 +261,19 @@ You can finetune this model on your own dataset.
 #### LangCache Sentence Pairs (all)
 * Dataset: [LangCache Sentence Pairs (all)](https://huggingface.co/datasets/redis/langcache-sentencepairs-v2)
-* Size: 6,786 evaluation samples
 * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
-  |         | anchor                                                                            | positive                                                                          | negative                                                                          |
-  |:--------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
-  | type    | string                                                                            | string                                                                            | string                                                                            |
-  | details | <ul><li>min: 9 tokens</li><li>mean: 27.96 tokens</li><li>max: 50 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 27.98 tokens</li><li>max: 51 tokens</li></ul> | <ul><li>min: 9 tokens</li><li>mean: 27.56 tokens</li><li>max: 49 tokens</li></ul> |
 * Samples:
-  | anchor                                                                                          | positive                                                                                        | negative                                                                                                                                                     |
-  |:------------------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------|
-  | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers Win 4-0</code>           | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers win series 4-0 ``</code> | <code>( 1 ) Los Angeles Lakers vs. ( 2 ) San Antonio Spurs : `` Lakers win series 4-0 ``</code>                                                              |
-  | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers win series 4-0 ``</code> | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers Win 4-0</code>           | <code>The study included 752 universities in Pennsylvania , including public schools , public charter schools and traditional public magnet schools .</code> |
-  | <code>( 1 ) Los Angeles Lakers vs. ( 2 ) San Antonio Spurs : `` Lakers win series 4-0 ``</code> | <code>( 1 ) Los Angeles Lakers vs. ( 2 ) San Antonio Spurs : `` Lakers win series 4-0 ``</code> | <code>( 1 ) Lakers vs. ( 2 ) San Antonio Spurs : `` Los Angeles Lakers Win 4-0</code>                                                                        |
 * Loss: <code>losses.ArcFaceInBatchLoss</code> with these parameters:
   ```json
   {
@@ -292,8 +287,8 @@ You can finetune this model on your own dataset.
 #### Non-Default Hyperparameters
 - `eval_strategy`: steps
-- `per_device_train_batch_size`: 4096
-- `per_device_eval_batch_size`: 4096
 - `gradient_accumulation_steps`: 2
 - `weight_decay`: 0.001
 - `adam_beta2`: 0.98
@@ -319,8 +314,8 @@ You can finetune this model on your own dataset.
 - `do_predict`: False
 - `eval_strategy`: steps
 - `prediction_loss_only`: True
-- `per_device_train_batch_size`: 4096
-- `per_device_eval_batch_size`: 4096
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 2
@@ -439,7 +434,7 @@ You can finetune this model on your own dataset.
 ### Training Logs
 | Epoch | Step | Validation Loss | test_cosine_ndcg@10 |
 |:-----:|:----:|:---------------:|:-------------------:|
-| 0     | 0    | 1.4689          | 0.7718              |
 ### Framework Versions

 - retrieval
 - reranking
 - generated_from_trainer
+- dataset_size:1460771
 - loss:ArcFaceInBatchLoss
 base_model: Alibaba-NLP/gte-modernbert-base
 widget:
+- source_sentence: '"How much would I need to narrate a ""Let''s Play"" video in order
+    to make money from it on YouTube?"'
   sentences:
+  - How much money do people make from YouTube videos with 1 million views?
+  - '"How much would I need to narrate a ""Let''s Play"" video in order to make money
+    from it on YouTube?"'
+  - '"Does the sentence, ""I expect to be disappointed,"" make sense?"'
+- source_sentence: '"I appreciate that.'
   sentences:
+  - '"How is the Mariner rewarded in ""The Rime of the Ancient Mariner"" by Samuel
+    Taylor Coleridge?"'
+  - '"I appreciate that.'
+  - I can appreciate that.
+- source_sentence: '"""It is very easy to defeat someone, but too hard to win some
+    one"". What does the previous sentence mean?"'
   sentences:
+  - '"How can you use the word ""visceral"" in a sentence?"'
+  - '"""It is very easy to defeat someone, but too hard to win some one"". What does
+    the previous sentence mean?"'
+  - '"What does ""The loudest one in the room is the weakest one in the room."" Mean?"'
+- source_sentence: '" We condemn this raid which is in our view illegal and morally
+    and politically unjustifiable , " London-based NCRI official Ali Safavi told Reuters
+    by telephone .'
   sentences:
+  - 'London-based NCRI official Ali Safavi told Reuters : " We condemn this raid ,
+    which is in our view illegal and morally and politically unjustifiable . "'
+  - The social awkwardness is complicated by the fact that Marianne is a white girl
+    living with a black family .
+  - art's cause, this in my opinion
+- source_sentence: '"If you click ""like"" on an old post that someone made on your
+    wall yet you''re no longer Facebook friends, will they still receive a notification?"'
   sentences:
+  - '"Is there is any two wheeler having a gear box which has the feature ""automatic
+    neutral"" when the engine is off while it is in gear?"'
+  - '"If you click ""like"" on an old post that someone made on your wall yet you''re
+    no longer Facebook friends, will they still receive a notification?"'
+  - '"If your teenage son posted ""La commedia e finita"" on his Facebook wall, would
+    you be concerned?"'
 datasets:
 - redis/langcache-sentencepairs-v2
 pipeline_tag: sentence-similarity
 model = SentenceTransformer("redis/langcache-embed-v3")
 # Run inference
 sentences = [
+    '"If you click ""like"" on an old post that someone made on your wall yet you\'re no longer Facebook friends, will they still receive a notification?"',
+    '"If you click ""like"" on an old post that someone made on your wall yet you\'re no longer Facebook friends, will they still receive a notification?"',
+    '"If your teenage son posted ""La commedia e finita"" on his Facebook wall, would you be concerned?"',
 ]
 embeddings = model.encode(sentences)
 print(embeddings.shape)
 # Get the similarity scores for the embeddings
 similarities = model.similarity(embeddings, embeddings)
 print(similarities)
+# tensor([[1.0000, 1.0000, 0.2617],
+#         [1.0000, 1.0000, 0.2617],
+#         [0.2617, 0.2617, 1.0000]])
 ```
 <!--
 #### LangCache Sentence Pairs (all)
 * Dataset: [LangCache Sentence Pairs (all)](https://huggingface.co/datasets/redis/langcache-sentencepairs-v2)
+* Size: 132,354 training samples
 * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | anchor                                                                             | positive                                                                           | negative                                                                          |
+  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
+  | type    | string                                                                             | string                                                                             | string                                                                            |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 25.33 tokens</li><li>max: 100 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 24.98 tokens</li><li>max: 100 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 19.06 tokens</li><li>max: 68 tokens</li></ul> |
 * Samples:
+  | anchor                                                                                        | positive                                                                                      | negative                                                                                       |
+  |:----------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------|
+  | <code> What high potential jobs are there other than computer science?</code>                 | <code> What high potential jobs are there other than computer science?</code>                 | <code>Why IT or Computer Science jobs are being over rated than other Engineering jobs?</code> |
+  | <code> Would India ever be able to develop a missile system like S300 or S400 missile?</code> | <code> Would India ever be able to develop a missile system like S300 or S400 missile?</code> | <code>Should India buy the Russian S400 air defence missile system?</code>                     |
+  | <code> water from the faucet is being drunk by a yellow dog</code>                            | <code>A yellow dog is drinking water from the faucet</code>                                   | <code>Childlessness is low in Eastern European countries.</code>                               |
 * Loss: <code>losses.ArcFaceInBatchLoss</code> with these parameters:
   ```json
   {
 #### LangCache Sentence Pairs (all)
 * Dataset: [LangCache Sentence Pairs (all)](https://huggingface.co/datasets/redis/langcache-sentencepairs-v2)
+* Size: 132,354 evaluation samples
 * Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
 * Approximate statistics based on the first 1000 samples:
+  |         | anchor                                                                             | positive                                                                           | negative                                                                          |
+  |:--------|:-----------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------|:----------------------------------------------------------------------------------|
+  | type    | string                                                                             | string                                                                             | string                                                                            |
+  | details | <ul><li>min: 4 tokens</li><li>mean: 25.33 tokens</li><li>max: 100 tokens</li></ul> | <ul><li>min: 4 tokens</li><li>mean: 24.98 tokens</li><li>max: 100 tokens</li></ul> | <ul><li>min: 5 tokens</li><li>mean: 19.06 tokens</li><li>max: 68 tokens</li></ul> |
 * Samples:
+  | anchor                                                                                        | positive                                                                                      | negative                                                                                       |
+  |:----------------------------------------------------------------------------------------------|:----------------------------------------------------------------------------------------------|:-----------------------------------------------------------------------------------------------|
+  | <code> What high potential jobs are there other than computer science?</code>                 | <code> What high potential jobs are there other than computer science?</code>                 | <code>Why IT or Computer Science jobs are being over rated than other Engineering jobs?</code> |
+  | <code> Would India ever be able to develop a missile system like S300 or S400 missile?</code> | <code> Would India ever be able to develop a missile system like S300 or S400 missile?</code> | <code>Should India buy the Russian S400 air defence missile system?</code>                     |
+  | <code> water from the faucet is being drunk by a yellow dog</code>                            | <code>A yellow dog is drinking water from the faucet</code>                                   | <code>Childlessness is low in Eastern European countries.</code>                               |
 * Loss: <code>losses.ArcFaceInBatchLoss</code> with these parameters:
   ```json
   {
 #### Non-Default Hyperparameters
 - `eval_strategy`: steps
+- `per_device_train_batch_size`: 8192
+- `per_device_eval_batch_size`: 8192
 - `gradient_accumulation_steps`: 2
 - `weight_decay`: 0.001
 - `adam_beta2`: 0.98
 - `do_predict`: False
 - `eval_strategy`: steps
 - `prediction_loss_only`: True
+- `per_device_train_batch_size`: 8192
+- `per_device_eval_batch_size`: 8192
 - `per_gpu_train_batch_size`: None
 - `per_gpu_eval_batch_size`: None
 - `gradient_accumulation_steps`: 2
 ### Training Logs
 | Epoch | Step | Validation Loss | test_cosine_ndcg@10 |
 |:-----:|:----:|:---------------:|:-------------------:|
+| 0     | 0    | 2.9916          | 0.7718              |
 ### Framework Versions

config.json CHANGED Viewed

@@ -12,7 +12,7 @@
   "cls_token_id": 50281,
   "decoder_bias": true,
   "deterministic_flash_attn": false,
-  "dtype": "bfloat16",
   "embedding_dropout": 0.0,
   "eos_token_id": 50282,
   "global_attn_every_n_layers": 3,

   "cls_token_id": 50281,
   "decoder_bias": true,
   "deterministic_flash_attn": false,
+  "dtype": "float32",
   "embedding_dropout": 0.0,
   "eos_token_id": 50282,
   "global_attn_every_n_layers": 3,

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:95d02211c4cca89113f9f3e93ed91f5176bf50170faa2cb835f7bfea15bb9dd2
-size 298041696

 version https://git-lfs.github.com/spec/v1
+oid sha256:04aa7437b7f98ed3f652e300c1d767d07c1864c10b3055ea63831997faefa8d6
+size 596070136