mavihsrr's picture
Add new SentenceTransformer model
1b42c5c verified
---
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:174064
- loss:CosineSimilarityLoss
base_model: BAAI/bge-base-en
widget:
- source_sentence: Garlic Ganhiali Pickle. Description :Nutty Yogi Garlic Ganhiali
Pickle is hygienically processed from best quality gandhialo and garlic. This
traditional style pickle is a spicy and tasty complement to any meal. Its sour
and hot taste titillates the taste buds. This mouthwatering pickle is made from
gandhialo and garlic shreds, mixed with other spices and ingredients. It has natural
antioxidants, improving digestive health, providing vitamin C, and helps in generating
healthy gut-flora.!
sentences:
- Microwaveable Plastic Multiutility Bowl - Blue, New Coral, L2271 BL. Description
:It is made of high-quality food grade virgin plastic. These bowls come in beautiful
bright colours. Store fruits, dry fruits, snacks, biscuits etc. in these bowls.
These coral bowls are microwave safe, easy to clean and maintain.!
- Kung Pao Sauce. Description :Bechef Kung Pao Sauce is our take on Classic Sichuan
Chinese Dish of Kung Pao Chicken. This pure vegetarian sauce is a classic example
of Indo Chinese fusion where we have adapted the taste to Indian palate yet be
loyal to the original taste.!
- Popular Aluminium Outer Lid Pressure Cooker (10003). Description :The highly appreciated
Prestige Popular Plus comes with an induction base and host of other top-of-the-line
features. Manufactured from virgin Aluminium to ensure zero contamination, this
cooker boasts of the highest quality of raw materials used. Perfected over a period
of time with trademark Prestige engineering, expected the very best with up-to-date
innovations and features. Add to that, the elegant design and splendid finish,
the cooker is a sheer pleasure to cook with.Prestige Popular Plus base is Stainless
steel perforated (holes) plate, machine pressed to Aluminium cooker base. Suitable
for Induction cooktops and gas cooking, this versatile cooker provides you with
maximum utility.It is the first level of safety feature to release pressure above
1kg/cm2, which makes the cooking safe and time saving for you. It is made up of
brass with steel coating on it for durability.This is the 2nd level of safety
provide in Prestige pressure cooker, in case there is any blockage of vent tube,
the gasket will bulge and steam will release through the hole at the top. Thus,
making it extremely safe for usage.It is the 3rd level of safety feature top fitted
to the lid, to release excess steam when pressure, rises beyond a safe level.
Thus, making it extremely safe to use.Benefits: More Economical.Faster Cooking.!
- source_sentence: Biscuits - Marie. Description :Bisk Farm Marie Biscuits are half
sweet in taste and are arranged using the best normal ingredients. These biscuits
are a wonderful mixture of wheat flour, vegetable oil, sugar. These Marie biscuits
are light and crunchy and are low on calories. Without these biscuits, the tea
feels unfinished.!
sentences:
- Dark Waffy Premium Vanilla Flavoured Choco Wafer. Description :Dukes Dark Waffy
Vanilla Flavoured Wafers is a premium wafer that is layered with a yummy, creamy
vanilla flavour with a whole new mix of a chocolate dark crunchy wafer. It comes
with a delightful aroma and a delicious taste to cherish about. It is a wonderful
snack for a hungry stomach during the day or while going on short journeys.!
- Organic Maai Ka Ladoo - Sugar-Free, Ultra-Low GI, No Preservatives. Description
:Known all over India as an irresistible classic, we gave this traditional sweet
a healthy upgrade! Expertly crafted with only the best organic dry fruits, every
bite of this Maai ka Laddu is generously scrumptious and entirely guilt-free!
Our goal is to bring you an unparalleled experience of a comforting homemade dessert.
Using selective no nasty and all nutritious ingredients, it leaves you feeling
healthy and satisfied! Apt for people on carbohydrate-controlled diets and with
diabetes, D-Alive's Maai ka Laddu is your new one-stop craving, Helps lower and
stabilise blood sugar levels, keeps you fuller for longer and aids weight management.
In spite of being a low Glycemic Index (GI) & Low Carb, our pride lies in the
taste of the product.  Using selective ‘no nasty’ organic, ultra-low GI, nutrient-dense,
slow-releasing ingredients, this superfood leaves you feeling satisfied!!
- 'Organic - Til/Ellu White. Description :Sesame is an important ingredient in cooking.
Sesame seeds give a rich delicate nutty flavour. Almost invisible crunch to your
dishes. Sesame seeds are excellent sources of copper. A very good source of manganese
calcium phosphorus magnesium iron zinc molybdenum vitamin B1 selenium. Although
much of its calorie comes from fats, sesame contains several notable health-benefiting
nutrients, minerals, antioxidants and vitamins.
The seeds are especially rich in mono-unsaturated fatty acid, oleic acid, which
comprises up to 50 percent of fatty acids in them. Oleic acid helps lower LDL
or "bad cholesterol" and increases HDL or "good cholesterol" in the blood. The
seeds are also very valuable sources of dietary protein with fine quality amino
acids that are essential for growth, especially in children.!'
- source_sentence: Amritsari Mutton Curry 130 g + Goan Chicken Cafreal 115 g. Description
:Amritsari Mutton Curry:Full of flavour, free of preservatives, our mutton curry
is a 100% homestyle recipe, wrapped in the flavours and textures of Amritsari
homes. This ready to cook, easy to use curry paste gets a sumptuous meal for 4,
ready in just 20 minutes. Feed your urge to cook something new today with tasty
tales.  Explore. Play. Create.Goan Chicken Cafreal:Full of flavour and free of
preservatives, our Chicken Cafreal is a native homestyle recipe, wrapped in the
flavours & textures of a Goan home. This ready to cook, easy to use curry paste
gets a sumptuous meal for 4, ready in just 20 minutes. Feed your urge to cook
something new today with Tasty Tales - Explore. Play. Create.!
sentences:
- Trendy Stainless Steel Bottle With Sipper Cap - Steel Matt Finish, PXP 1002 DQ.
Description :Now free your environment, and yourself from the unhealthy plastic
bottles and get a healthier one-time product for all your needs. These high-grade
stainless steel bottles are here to enhance your dining and travelling experience,
saving you from the negative effects of plastic. The single-walled steel bottles
are perfect add-ons to your kitchen collection if you are looking for light-weighed,
durable, classy looking product. The bottle comes with sipper & wide mouth steel
cap, catering to double usage. Be it going to the gym, or sending it with the
kids to school, the colourful sipper can always make it a very convenient, handy
and more importantly a style-statement product. You can take it to the office
or just keep on the dinner table. Open the wide mouth lid and use at ease. The
bottles come with the major USP of inter-changeable lid facility. Now you can
make the same steel cap bottle as a sipper bottle by just interchanging the lids.
Hence, get 2 of two-in-one featured bottles of same model and capacities and get
the best of both, with a variety of colours!!
- Round Plastic Container - Black. Description :These storage containers are made
from high-quality plastic for everyday use. It seals the food effectively and
has an easily stackable design for smart storage. The food-grade quality of these
multipurpose airtight storage containers, with attractive design, make them hygienic
for use. They are freezer safe without lid and have a strong and durable body
for longevity.!
- Stainless Steel Lunch Box/Tiffin Set - Blue, BB 575 2. Description :Easily pack
lunch for your loved ones with this blue lunch box by Tedemei. Made of high-quality
stainless steel, the lunch box is sturdy, durable, and easy to clean. The lunch
box is airtight which helps in keeping the food fresh for long. It is a single
layer lunch box which lets you pack solid and liquid food separately. The lunch
box features flap and lock design which makes it easy to open and carry the lunch
box. The modish looking lunch boxes catches the eye.!
- source_sentence: New Extra Large. Description :Pampers baby diaper pants are the
only pants in India with new air channels providing your baby with a new type
of dryness overnight; breathable dryness. Magic Gel that locks the wetness away
for up to 12 hours of dryness. The new and improved product design enables a comfortable
fit, closer to the baby's body.A flexible waistband that adapts to the baby's
movements for a comfortable fit. Baby lotion with aloe vera helps protect your
baby's delicate skin from diaper rash and irritation. A top layer with the cotton-like
soft material, for a comfortable nights sleep. Fun exterior graphics; fun designs
and characters to enjoy with your baby.!
sentences:
- 'Nature''s Super Foods Organic Chana Dal. Description :Aashirvaad introduces an
organic certified range of Nature''s Super Foods, Organic Chana Dal. Chana Dal
is one of the most loved Indian food. It is used in multiple cultural foods across
the nation. It has proteins, carbohydrate, good fibre, Iron and vitamins to enrich
your daily intake of essential nutrients. Chana dal tastes like kernels of small
corns, it is light and easily digestible. 
Aashirvaad organic products are sourced and packed hygienically to ensure you
get the best taste and nutrition with its premium quality products.!'
- Spanish Olives - Pitted Green. Description :Fragata Pitted Green Olives are cured
or pickled and are well-known for their rich and mouth-watering flavoring as well
as for their dietary benefit. Olives include abundant antioxidants identified
as polyphones.!
- Surface Cleaner - Jasmine. Description :Special formulation kills maximum germs,
leaves a pleasant aroma as well as removes toughest stains!
- source_sentence: Extra Virgin Coconut Oil. Description :This cold pressed, pure,
natural, extra virgin coconut oil because of its high saturated fat content, it
is slow to oxidize and, thus, resistant to rancidification, lasting up to six
months at 24 DegreeC without spoiling. This is the purest form of coconut oil,
which retains all of its goodness.!
sentences:
- Wax Candles - Metal, Smokeless, White, CD 05. Description :Enrich the ambience
of the place as you place these captivating looking tealight candles. These are
white in colour and round in shape. They are filled with wax and wick inside.
It is suitable for decorating the house during the festive occasions and parties.
These are smokeless candles that do not leave any soot residue behind. Also, the
tealight candles burn fully without damaging your furniture or floor. Also, it
has 25 pieces.!
- Pearl - Skin Whitening Facial Kit. Description :With Organic Harvests 30 Minute
Makeover Pearl-Skin Whitening Facial Kit, you achieve quick salon-like results
at home, without spending your precious time in the salon. It comes with an assurance
that only certified organic ingredients come in contact with your skin. 30 minutes,
and you will feel that your skin is lily-white.!
- Rubber Gloves - Cotton Lined, Soft & Non Slip, Medium. Description :The Super
Strong Elbow Grease Rubber Gloves in large protect the hands from bacteria and
chemicals during cleaning tasks, ideal for dishwashing, scrubbing task and using
harmful chemicals. These high-quality designs are cotton lined, soft & non-slip
gloves for ease to use. The Elbow Grease Rubber Gloves are the only gloves you
will ever need. Great for domestic or commercial cleaning purpose.!
pipeline_tag: sentence-similarity
library_name: sentence-transformers
---
# SentenceTransformer based on BAAI/bge-base-en
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [BAAI/bge-base-en](https://huggingface.co/BAAI/bge-base-en). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [BAAI/bge-base-en](https://huggingface.co/BAAI/bge-base-en) <!-- at revision b737bf5dcc6ee8bdc530531266b4804a5d77b5d8 -->
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 768 dimensions
- **Similarity Function:** Cosine Similarity
<!-- - **Training Dataset:** Unknown -->
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': True}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': True, 'pooling_mode_mean_tokens': False, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("mavihsrr/bge-final-small-retail-v2")
# Run inference
sentences = [
'Extra Virgin Coconut Oil. Description :This cold pressed, pure, natural, extra virgin coconut oil because of its high saturated fat content, it is slow to oxidize and, thus, resistant to rancidification, lasting up to six months at 24 DegreeC without spoiling. This is the purest form of coconut oil, which retains all of its goodness.!',
'Rubber Gloves - Cotton Lined, Soft & Non Slip, Medium. Description :The Super Strong Elbow Grease Rubber Gloves in large protect the hands from bacteria and chemicals during cleaning tasks, ideal for dishwashing, scrubbing task and using harmful chemicals. These high-quality designs are cotton lined, soft & non-slip gloves for ease to use. The Elbow Grease Rubber Gloves are the only gloves you will ever need. Great for domestic or commercial cleaning purpose.!',
'Wax Candles - Metal, Smokeless, White, CD 05. Description :Enrich the ambience of the place as you place these captivating looking tealight candles. These are white in colour and round in shape. They are filled with wax and wick inside. It is suitable for decorating the house during the festive occasions and parties. These are smokeless candles that do not leave any soot residue behind. Also, the tealight candles burn fully without damaging your furniture or floor. Also, it has 25 pieces.!',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
<!--
### Direct Usage (Transformers)
<details><summary>Click to see the direct usage in Transformers</summary>
</details>
-->
<!--
### Downstream Usage (Sentence Transformers)
You can finetune this model on your own dataset.
<details><summary>Click to expand</summary>
</details>
-->
<!--
### Out-of-Scope Use
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->
<!--
## Bias, Risks and Limitations
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->
<!--
### Recommendations
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->
## Training Details
### Training Dataset
#### Unnamed Dataset
* Size: 174,064 training samples
* Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>score</code>
* Approximate statistics based on the first 1000 samples:
| | sentence1 | sentence2 | score |
|:--------|:-------------------------------------------------------------------------------------|:------------------------------------------------------------------------------------|:----------------------------------------------------------------|
| type | string | string | float |
| details | <ul><li>min: 12 tokens</li><li>mean: 116.54 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 8 tokens</li><li>mean: 111.22 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 0.1</li><li>mean: 0.64</li><li>max: 0.97</li></ul> |
* Samples:
| sentence1 | sentence2 | score |
|:-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:--------------------------------|
| <code>Oil Clear Mud Face Pack. Description :Himalaya Oil Clear Mud Face Pack Rejuvenate your dead skin with Himalaya Oil Clear Mud Face Pack. This herbal formulation deep cleanses facial skin and clears clogged pores by absorbing excess oil and removing impurities. It helps maintain the natural pH of the skin and has deep cleansing and detoxifying properties, leaving the skin cleansed and revitalized. Fullers Earth removes deep-seated dirt and pollutants. It absorbs oil, clears clogged pores and blemishes and helps remove dead skin. Fullers Earth also helps lighten tanned skin caused by UV rays.!</code> | <code>Pure White Mineral Clay Anti Pollution Purity Face Wash Foam. Description :Giving your skin an oil overhaul doesn't have to be a drag. Oil stuck in your pores is what makes your skin feel oily again after a wash! POND'S Clay Foam and Mask is the most fun way to say goodbye oil and hello to an all-day matte glow. Made with 100% natural Moroccan clay that has 4x oil absorption power, it sucks out dirt and oil stuck deep within your pores. What's left behind? Skin that's glowing and matte all day long! Pond's Clay Foam is the most enjoyable and effective way to keep your skin oil-free for longer. Revolutionise face washing with the enriching power of Mineral Clay. One of the most efficacious ingredients in deep cleansing. Its enriched with skin-loving minerals to give you a bouncy glow. So, step up your deep cleansing regimen for an oil-free glow. The clay range comes in two exciting formats. The Pond's white beauty mineral clay foam brightens and smoothens your skin for an oil-free glow!...</code> | <code>0.9511584211850151</code> |
| <code>Essence - Butter Scotch. Description :Concentrate Butterscotch Essence For Sauces, Desserts, Baking And Cakes.Butterscotch Adds A Luscious Flavor Note To Mochas, Lattes And Other Hot, Frozen And Chilled Drinks.!</code> | <code>product<br>Icing Sugar Icing Sugar. Description :Icing Sugar is finel...<br>Icing Sugar Icing Sugar. Description :This finely granulat...<br>Name: combined, dtype: object</code> | <code>0.9643093974992689</code> |
| <code>Marie Light Biscuit - Vita Orange. Description :Sunfeast Marie Light orange offer crisp & light biscuits completed with the choicest golden grains of sun-ripened oranges and wheat. It presents the only Marie biscuit in India with a stimulating, delicious orange flavour. Whats more, there is 0% transfat and 0% cholesterol making it an appetisingly vigorous biscuit.!</code> | <code>Premium Wafer Bites - Dark Choco 100 g + Strawberry 100 g + Tiramisu 100 g. Description :Tasties brings you the Delicious Creamy & Crunchy Wafer Bites. Indulge in the taste of 5 wafers and 4 cream layered mini wafer bites with mouth-melting dark chocolate filling.<br>Tasties brings you the Delicious Creamy & Crunchy Wafer Bites. Indulge in the taste of 5 wafers and 4 cream layered mini wafer bites with mouth-melting strawberry filling.<br>Tasties brings you the Delicious Creamy & Crunchy Wafer Bites. Indulge in the taste of 5 wafers and 4 cream layered mini wafer bites with mouth Tiramisu hazelnut filling.<br><br>Munch on this and say bye to your small hunger pangs.!</code> | <code>0.8838966912863657</code> |
* Loss: [<code>CosineSimilarityLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosinesimilarityloss) with these parameters:
```json
{
"loss_fct": "torch.nn.modules.loss.MSELoss"
}
```
### Evaluation Dataset
#### Unnamed Dataset
* Size: 21,759 evaluation samples
* Columns: <code>sentence1</code>, <code>sentence2</code>, and <code>score</code>
* Approximate statistics based on the first 1000 samples:
| | sentence1 | sentence2 | score |
|:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:----------------------------------------------------------------|
| type | string | string | float |
| details | <ul><li>min: 10 tokens</li><li>mean: 121.44 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 10 tokens</li><li>mean: 112.13 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 0.1</li><li>mean: 0.61</li><li>max: 0.97</li></ul> |
* Samples:
| sentence1 | sentence2 | score |
|:------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------|
| <code>Rose Glycerin Soap For Clean & Refreshed Skin -Cold Processed, 100% Natural & Organic. Description :Feel fresh, clean and refreshed with Rose which will leave your skin delicately scented with an uplifting rose fragrance. This soap does not use any animal product like milk or honey. It is completely vegan. Rose helps to improve the skin's appearance and to perfume the skin. It contains glycerin that softens the skin. It has no added preservatives and SLS and is 100% natural herbs and essential oils. It is a vegan product and is SLS free.!</code> | <code>product<br>Relax Moisturising Hand Wash - Lavender & Ylang-Ylang Relax Moisturising Hand Wash - Lavender & Ylan...<br>Relax Moisturising Hand Wash - Lavender & Ylang-Ylang Relax Moisturising Hand Wash - Lavender & Ylan...<br>Name: combined, dtype: object</code> | <code>0.9641479761938232</code> |
| <code>Dog Food - Focus Starter, Super Premium. Description :The Drools Focus, Super premium all breed formula for Puppies is formulated with the finest natural ingredients to help your dog live a long and healthy life. The result of exhaustive scientific research carried out over the years, by some of the most experienced veterinarians and nutritionists. Just like the rest of the Drools products, this one too is manufactured with a keen eye for detail and utmost care at Asias largest and most modern plant.!</code> | <code>Erina - Coat Cleanser. Description :Action : Dandruff control : Erina prevents the formation of dandruff on your pets skin and hair coat. Antimicrobial : Its antiseptic and antibacterial cleansing eliminates germs and improves overall skin hygiene. Erina protects the body against commonly found pathogens that cause itching and bacterial infections. Deodorant : Erinas deodorizing properties eliminate foul odor. Indications : For controlling dandruff in the hair coat. Prevention and management of pruritus (itching) and pyoder(superficial bacterial infection). Used in routine bathing as a cleanser to maintain a healthy coat.!</code> | <code>0.9112330093194662</code> |
| <code>Fruit & Food Nibbler With Silicone Sack - Green. Description :Introducing new foods to your babys diet can be a fun learning experience as it provides him or her with new varying tastes and flavours. With Mee Mee fruit and food nibbler, your child can safely enjoy fruit and other kinds of whole foods, without the risk of choking or hurting his or her mouth.!</code> | <code>Trendy Stainless Steel Bottle With Sipper Cap - Steel Matt Finish, PXP 1002 DQ. Description :Now free your environment, and yourself from the unhealthy plastic bottles and get a healthier one-time product for all your needs. These high-grade stainless steel bottles are here to enhance your dining and travelling experience, saving you from the negative effects of plastic. The single-walled steel bottles are perfect add-ons to your kitchen collection if you are looking for light-weighed, durable, classy looking product. The bottle comes with sipper & wide mouth steel cap, catering to double usage. Be it going to the gym, or sending it with the kids to school, the colourful sipper can always make it a very convenient, handy and more importantly a style-statement product. You can take it to the office or just keep on the dinner table. Open the wide mouth lid and use at ease. The bottles come with the major USP of inter-changeable lid facility. Now you can make the same steel cap bottle as ...</code> | <code>0.14806349984585232</code> |
* Loss: [<code>CosineSimilarityLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#cosinesimilarityloss) with these parameters:
```json
{
"loss_fct": "torch.nn.modules.loss.MSELoss"
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `per_device_train_batch_size`: 64
- `per_device_eval_batch_size`: 64
- `learning_rate`: 2e-05
- `warmup_ratio`: 0.1
- `bf16`: True
#### All Hyperparameters
<details><summary>Click to expand</summary>
- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 64
- `per_device_eval_batch_size`: 64
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 1
- `eval_accumulation_steps`: None
- `torch_empty_cache_steps`: None
- `learning_rate`: 2e-05
- `weight_decay`: 0.0
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1.0
- `num_train_epochs`: 3
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.1
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `use_ipex`: False
- `bf16`: True
- `fp16`: False
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: False
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: None
- `hub_always_push`: False
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `include_for_metrics`: []
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`:
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `dispatch_batches`: None
- `split_batches`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: False
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `use_liger_kernel`: False
- `eval_use_gather_object`: False
- `average_tokens_across_devices`: False
- `prompts`: None
- `batch_sampler`: batch_sampler
- `multi_dataset_batch_sampler`: proportional
</details>
### Training Logs
| Epoch | Step | Training Loss | Validation Loss |
|:------:|:----:|:-------------:|:---------------:|
| 0.0460 | 500 | 0.1008 | - |
| 0.0092 | 100 | 0.0515 | 0.0453 |
| 0.0184 | 200 | 0.0532 | - |
| 0.0368 | 100 | 0.0491 | 0.0393 |
| 0.0735 | 200 | 0.0427 | 0.0333 |
| 0.1103 | 300 | 0.0373 | 0.0257 |
| 0.1471 | 400 | 0.0294 | 0.0188 |
| 0.1838 | 500 | 0.0212 | 0.0169 |
| 0.2206 | 600 | 0.0174 | 0.0131 |
| 0.2574 | 700 | 0.0145 | 0.0123 |
| 0.2941 | 800 | 0.0125 | 0.0094 |
| 0.3309 | 900 | 0.0109 | 0.0103 |
| 0.3676 | 1000 | 0.0102 | 0.0086 |
| 0.4044 | 1100 | 0.0075 | 0.0088 |
| 0.4412 | 1200 | 0.0077 | 0.0076 |
| 0.4779 | 1300 | 0.0071 | 0.0070 |
| 0.5147 | 1400 | 0.007 | 0.0072 |
| 0.5515 | 1500 | 0.0065 | 0.0068 |
| 0.5882 | 1600 | 0.0058 | 0.0073 |
| 0.625 | 1700 | 0.0064 | 0.0075 |
| 0.6618 | 1800 | 0.0057 | 0.0062 |
| 0.6985 | 1900 | 0.0055 | 0.0060 |
| 0.7353 | 2000 | 0.0054 | 0.0071 |
| 0.7721 | 2100 | 0.0055 | 0.0062 |
| 0.8088 | 2200 | 0.005 | 0.0065 |
| 0.8456 | 2300 | 0.0064 | 0.0061 |
| 0.8824 | 2400 | 0.0046 | 0.0056 |
| 0.9191 | 2500 | 0.0045 | 0.0051 |
| 0.9559 | 2600 | 0.0042 | 0.0051 |
| 0.9926 | 2700 | 0.0046 | 0.0055 |
| 1.0294 | 2800 | 0.0041 | 0.0053 |
| 1.0662 | 2900 | 0.005 | 0.0057 |
| 1.1029 | 3000 | 0.0033 | 0.0055 |
| 1.1397 | 3100 | 0.0037 | 0.0054 |
| 1.1765 | 3200 | 0.004 | 0.0052 |
| 1.2132 | 3300 | 0.0038 | 0.0049 |
| 1.25 | 3400 | 0.0038 | 0.0047 |
| 1.2868 | 3500 | 0.0035 | 0.0052 |
| 1.3235 | 3600 | 0.0034 | 0.0048 |
| 1.3603 | 3700 | 0.0035 | 0.0049 |
| 1.3971 | 3800 | 0.0034 | 0.0045 |
| 1.4338 | 3900 | 0.0037 | 0.0048 |
| 1.4706 | 4000 | 0.0036 | 0.0047 |
| 1.5074 | 4100 | 0.0031 | 0.0046 |
| 1.5441 | 4200 | 0.0039 | 0.0045 |
| 1.5809 | 4300 | 0.0033 | 0.0046 |
| 1.6176 | 4400 | 0.0033 | 0.0047 |
| 1.6544 | 4500 | 0.0035 | 0.0047 |
| 1.6912 | 4600 | 0.0029 | 0.0047 |
| 1.7279 | 4700 | 0.0035 | 0.0046 |
| 1.7647 | 4800 | 0.0033 | 0.0046 |
| 1.8015 | 4900 | 0.003 | 0.0046 |
| 1.8382 | 5000 | 0.0027 | 0.0045 |
| 1.875 | 5100 | 0.003 | 0.0043 |
| 1.9118 | 5200 | 0.0031 | 0.0046 |
| 1.9485 | 5300 | 0.0029 | 0.0045 |
| 1.9853 | 5400 | 0.003 | 0.0044 |
| 2.0221 | 5500 | 0.0031 | 0.0044 |
| 2.0588 | 5600 | 0.0028 | 0.0044 |
| 2.0956 | 5700 | 0.0032 | 0.0044 |
| 2.1324 | 5800 | 0.0027 | 0.0043 |
| 2.1691 | 5900 | 0.0032 | 0.0043 |
| 2.2059 | 6000 | 0.0029 | 0.0043 |
| 2.2426 | 6100 | 0.0028 | 0.0043 |
| 2.2794 | 6200 | 0.0028 | 0.0045 |
| 2.3162 | 6300 | 0.0032 | 0.0043 |
| 2.3529 | 6400 | 0.0026 | 0.0043 |
| 2.3897 | 6500 | 0.0026 | 0.0043 |
| 2.4265 | 6600 | 0.0024 | 0.0044 |
| 2.4632 | 6700 | 0.0024 | 0.0042 |
| 2.5 | 6800 | 0.0028 | 0.0043 |
| 2.5368 | 6900 | 0.0026 | 0.0043 |
| 2.5735 | 7000 | 0.0028 | 0.0042 |
| 2.6103 | 7100 | 0.0024 | 0.0043 |
| 2.6471 | 7200 | 0.0023 | 0.0042 |
| 2.6838 | 7300 | 0.0027 | 0.0041 |
| 2.7206 | 7400 | 0.0024 | 0.0041 |
| 2.7574 | 7500 | 0.003 | 0.0041 |
| 2.7941 | 7600 | 0.003 | 0.0041 |
| 2.8309 | 7700 | 0.0028 | 0.0041 |
| 2.8676 | 7800 | 0.0029 | 0.0041 |
| 2.9044 | 7900 | 0.0026 | 0.0041 |
| 2.9412 | 8000 | 0.0022 | 0.0041 |
| 2.9779 | 8100 | 0.0023 | 0.0041 |
### Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.3.1
- Transformers: 4.47.1
- PyTorch: 2.1.0+cu118
- Accelerate: 1.2.1
- Datasets: 3.2.0
- Tokenizers: 0.21.0
## Citation
### BibTeX
#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
```
<!--
## Glossary
*Clearly define terms in order to be accessible across audiences.*
-->
<!--
## Model Card Authors
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->
<!--
## Model Card Contact
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
-->