m7n's picture
Add new SentenceTransformer model
479a734 verified
---
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:43494
- loss:TripletLoss
base_model: allenai/specter2_aug2023refresh_base
widget:
- source_sentence: As a result of technological progress, environmental aspects and
social change, the automotive industry is undergoing a radical transformation.
The focus is no longer on the product "vehicle" but much more on the mobility
service itself and the users individual experience and well-being during travel
time. In that field of innovation, the study deals with a explorative investigation
of using the travel time for a improvement of the mental health of the passenger.
The vision is to integrate breathwork relaxation in combination with a human centric
lighting scenario as an immersive service within luxury ride-hailing vehicles
to enhance the mental health during automated rides and utilizing the time spent
in cars for personal pleasure. To enable a user-centered and experimental approach,
a test vehicle from the non-profit company bq.Labs was equipped with the bq breath
work app and a spezialized LED-based lighting screen that was developed by Fraunhofer.
The effects were tested on randomly selected and voluntary users in a guerrilla
testing at three different locations in San Diego. The tests explored user acceptance
of the innovative technologies by combining surveys, vital data collection, qualitative
interviews and observations. Initial data analysis provides insights into the
feasibility and potential effects on well-being and user perception. The study
illustrates those innovations in the field of mobility, involve systemic dependencies
and considerations beyond technology, encompassing social and psychological dimensions.
It underscores that successful innovations require a holistic, user-centered approach
that considers technological, social, and psychological dimensions. The findings
lay the groundwork for future research and development of innovation strategies
in the evolving field of mobility and personalized strength.
sentences:
- 'We examined casual decision-making among a group of participants, which frequently
occurs in daily life. In such a situation, participants do not have strong preferences
for the decision. In addition, because the process of decision-making among people
is part of the time they spend together, it is important to feel enjoyment in
the process and satisfaction with the final decision. In this paper, we propose
a game mechanism for generating a sense of enjoyment in the decision-making process
through communication and a sense of acceptance of the final decision. We experimentally
compared two ways to make decisions about beverages: ) majority voting and ) the
proposed game. In the latter case, the participants enjoyed playing the game and
were satisfied with the decision-making process.'
- This paper presents several important factors affecting the resale prices of used
rental cars. In fact, this paper empirically shows and proves several conjectures
regarding the determinants for used rental car resale values through the use of
detailed micro data from one of the biggest rental car companies. Specifically,
the age of a used car has two composite effects on its resale value, even though
overall the two effects work negatively with a concavity, as rental cars ages.
On the other hand, two mileage variables interact with each other and produce
overall decreasing effects on the resale prices with the opposite interactions.
In terms of the effects of brand image, Hyundai and Renault-Samsung have positive
effects on resale values generally. Ssangyong has a positive effect on the resale
values in the SUV category, and Kia and GM-Daewoo are generally inferior to the
other brands in terms of resale values in all categories. In terms of seasonal
effects, we can conclude that this paper confirms the general perception regarding
seasonal effects on resale values. In details, from November to February, resale
values are affected negatively, and March is the recovering month of increasing
demand in the used car market. August seems to be the highest season for the used
car market due to several demand increases. As a result, this paper plays an important
role in providing a substantial amount of information on the factors affecting
the resale prices of rental cars.
- In this paper we present an approach used to enhance students' competency in software
verification. Students were asked to apply software verification techniques to
a complex formal specification system. The complexity of the system stems from
its sophisticated requirements. Selecting such system for this study was intentional
for the following two reasons ) the system is difficult to understand and analyze
because of the domain knowledge required to generate formal specifications in
temporal logic and ) the system is large and complex which lends itself to a wide
range of applicable verification techniques, and thus highlights the differences
in the capabilities of each of the software verification approaches. Students
were assessed using multiple criteria including; examination in applying learned
techniques, students' attitude toward the technique, perceived efficiency of the
techniques in discovering software defects, and the ability of the technique to
locate errors in the code beyond simply indicating their presence. The results
of this work show that the students applied the learned techniques successfully
and their attitudes towards software verification improved.
- source_sentence: EnglishThe literature has argued that, contrary to what claimed
by the rational economic theory, trade unions have progressively moved towards
the representation of atypical workers by adopting more inclusive strategies of
collective bargaining. The strength and modalities of such strategies are affected
by national institutions of labour market and company-level union representation
to which trade unions can draw in workplaces. Within this context, still remain
to be discovered how the aforementioned institutions are enacted and in what subjects
of employment relations can be used by unions in order to protect atypical work.
This paper deals with these issues. It analyzes how unions have used distinctive
institutional factors with regards of both external and interna! flexibility and
in reference to regular and temporary workers to be able to improve the working
conditions of atypical work. Trade unions negotiated a promotion system to permanent
positions and allowed temporary workers to develop the same skills acquired by
regular employees, which were also beneficial for permanent workers' employment
conditions. The defense of regular workers' employrnent conditions was crucial
in order to maintain an inclusive strategy of collective bargaining. italianoIntroduzione.
- Contesto istituzionale e mobilizazione delle risorse. - Flessibilita ed interessi
della forza del lavoro atipica e regolare. - Disegno e metodo della ricerca. -
La strategia di contrattazione colletiva inclusiva fra flessibilita esterna ed
interna. - Analisi e discussione. - Conclusioni.
sentences:
- Much has been invested in big data and artificial intelligence-based solutions
for healthcare. However, few applications have been implemented in clinical practice.
Early economic evaluations can help to improve decision-making by developers of
analytics underlying these solutions aiming to increase the likelihood of successful
implementation, but recommendations about their use are lacking. The aim of this
study was to develop and apply a framework that positions best practice methods
for economic evaluations alongside development of analytics, thereby enabling
developers to identify barriers to success and to select analytics worth further
investments.
- An in situ field test on nine commonly-used soil water sensors was carried out
in a sandy loam soil located in the Potato Research Center, Fredericton, NB (Canada)
using the gravimetric method as a reference. The results showed that among the
tested sensors, regardless of installation depths and soil water regimes, CS000,
Trase, and Troxler performed the best with the factory calibrations, with a relative
root mean square error (RRMSE) of , , and %, and a r( ) of , , and , respectively.
TRIME, Moisture Point (MP000), and Gopher performed slightly worse with the factory
calibrations, with a RRMSE of , , and %, and a r( ) of , , and , respectively,
while the Gypsum, WaterMark, and Netafim showed a frequent need for calibration
in the application in this region.
- 'The article proposes a comparison between British devolution and Italian one,
both have occurred at about the same time (from the end of Nineties years until
now), looking what is common in devolution process inside two cultural and institutional
context deeply different. About the constitutional innovation, British and Italian
political systems know different method to pass a reform: in British system, Westminster
parliament is sovereign not only in ordinary law-making but above all in constitutional
matter (this is the meaning of parliament sovereignty in Dicey''s thought); in
Italian system, constitutional power isn''t on the same degree of ordinary law,
because the parliament makes ordinary law and an ad hoc convention makes the constitution
(or at least its fundamental reforms), as it''s in French tradition. In spite
of so, it''s possible to see a common element in British and Italian devolution,
on the side of its limits: that is the difficult to compatible the post-centralistic
state and its fiscal autonomy with the universalistic principles of welfare state.
This may be one of the mains challenge that Western states will have to face,
looking for a new political balances for the new era that follows the cold war
end.'
- source_sentence: 'Objective Compared with povidone iodine solution to clean,glutaraldehyde
immersion,highpressure steam sterilization of three disinfection methods for dental
handpiece sterilization effect.Methods Select the dental clinical used over phone,were
randomly divided into A group,B group,C group,D group ,A group for the control
group,only cleaning method did not use any disinfection,B group was % Polyvinylpyrrolidone
iodine solution,wipe,C group were soaked in % glutaraldehyde soluton,D group were
treated with high-pressure steam sterilization,after each phone were inoculated
with bacteria sample dish,the more monitoring of four groups of bacterial culture.Results
A group of bacterial culture for the intensive growth of bacteria,B group had
bacterial growth for the(+ ~+ + +),bacterial growth;C group had bacterial culturegrowth
was(+ ~ + +),bacterial growth;D group of bacterial culture Growth of(-),no bacterial
growth.Conclusion High-pressure steam sterilization was the disinfection of dental
handpieces most effective way. Key words: Disinfection;Dental high-speed equipment'
sentences:
- 'Objective To study the self-locking brackets SmartclipTM 0MXTM MBTTM brackets
and traditional pain comparison.Methods patients with non-extraction orthodontic
treatment were randomly divided into two groups,a group treated with self-locking
brackets,the other group treated with traditional care slot.Patients in orthodontic
treatment of pain within a week were inoestigated by way of a questionnaire survey,including
orthodontic pain,soft tissue irritation,and the strength of a normal life for
patients with the impact.Results Questionnaire response rate was %.The level of
pain was similar in self-ligating bracket group and the traditional bracket group.However,time-related,including
pain after orthodontic treatment was 0h,0 d time,the most intense pain and continued
to 0d,back pain relief,0w about pain relief.Conclusion Self-locking brackets and
brackets have noobvious pain intensity differences,but related with orthodontic
force to the clinical use of force should pay attention to light. Key words: Self-ligating
bracket;Traditional brackets;Orthodontic treatment;Pain'
- Simulation of inflorescences is an important part of virtual plant growth. The
past works about simulation of inflorescences focus mainly on how to generate
inflorescences by However, the productions of L system are difficult to understand
and implement since it is described with rule based language, and especially it
needs too many parameters in simulating inflorescence development and flowering
sequences. Dual scale automaton is a plant growth model based on plant growth
mechanisms, which is easy to understand and implement in programming. In this
paper, the method of simulating inflorescence using dual scale automaton model
is discussed. The dual scale automaton model is improved by introducing the rule
of synchronization development, mechanism of reiteration and delay law of plant
growth from the viewpoints of botany,which make it possible to generate almost
all types of the inflorescences defined by botanists, and to simulate acropetal
and basipetal flowering sequences. Several examples of simulation of typical inflorescences
are given for explaining the theory. The improved model is demonstrated a simpler
but more effective method in simulating inflorescences in comparison with L system.
- 'I N late and through the summer of , the York County Court launched a concerted
attack against Quakers in its part of Massachusetts.* York county magistrate Richard
Waldron arrested three visiting Quaker women and had them beaten out of the jurisdiction;
then, apparently in response to his recommendations, the court proceeded to cite
local Quakers living in Kittery for their failure to attend orthodox church services.l
Although Waldron''s actions did not occur until five years after the General Court
had begun its program of suppressing Quakerism, they seemed consistent with the
general pattern of persecution in seventeenth-century Massachusetts Bay: the discovery
of heterodoxy followed by an immediate attempt to produce local conformity.0 Yet
the considerable delay in Kittery in attacking Quakerism and the lack of any subsequent
systematic effort to produce conformity raise a number of questions regarding
the extent to which prejudice against heterodoxy was the sole motive for suppressing
Quakerism in Massachusetts. Between and the York County Court attacked Quaker
heterodoxy only on a limited number of occasions. Each of the incidents suggests
that Kittery Quakers were punished not because their religious beliefs offended
the court but because those beliefs denoted certain positions on secular issues,
and men like Waldron could employ ecclesiastical sanctions to enlist sources'
- source_sentence: Androgen therapy is the mainstay of treatment for the hypogonadotropic
hypogonadal micropenis because it obviously enhances penis growth in prepubescent
microphallic patients. However, the molecular mechanisms of androgen treatment
leading to penis growth are still largely unknown. To clarify this well-known
phenomenon, we successfully generated a castrated male Sprague Dawley rat model
at puberty followed by testosterone administration. Interestingly, compared with
the control group, testosterone treatment stimulated a dose-dependent increase
of penis weight, length, and width in castrated rats accompanied with a dramatic
recovery of the pathological changes of the penis. Mechanistically, testosterone
administration substantially increased the expression of androgen receptor (AR)
protein. Increased AR protein in the penis could subsequently initiate transcription
of its target genes, including keratin 00B (Krt00b). Importantly, we demonstrated
that KRT00B is generally expressed in the rat penis and that most KRT00B expression
is cytoplasmic. Furthermore, AR could directly modulate its expression by binding
to a putative androgen response element sequence of the Krt00b promoter. Overall,
this study reveals a novel mechanism facilitating penis growth after testosterone
treatment in precastrated prepubescent animals, in which androgen enhances the
expression of AR protein as well as its target genes, such as Krt00b.
sentences:
- This study develops statistical learning models to assess the probability of undergraduate
students graduating within a predetermined period, utilizing admission, performance,
and demographic data. The urgency of addressing student attrition is highlighted
by recent data from the National Center for Education Statistics (NCES), indicating
a % completion rate by full-time undergraduates within six years. This research
leverages institutional data from a Saudi University, focusing on freshmen enrolled
in the - and - academic years, to identify students at risk of dropping out, thereby
enabling timely interventions. Ten algorithms, including decision trees, ensemble
models, SVM, and ANN, were built and evaluated on a test set representing % of
the entire dataset using precision, recall, accuracy, and Matthews correlation
coefficient (MCC). The findings show that SVM and Random Forest models were the
most reliable, achieving accuracies of and respectively, and maintaining balance
in precision, recall, and MCC. Conversely, the naive Bayes model recorded the
worst performance. The comparative analysis revealed the superior performance
of ensemble models over decision tree models in predicting student attrition,
emphasizing the importance of model selection in developing effective early intervention
strategies. In addition, our analysis revealed that academic data is a better
predictor of on-time graduation than admission data, emphasizing the need for
institutions to focus on continuous academic assessment data.
- 'Timely and accurate prediction of human movement in urban areas offers instructive
insights into transportation management, public safety, and location-based services,
to name a few. Yet, modeling urban mobility is challenging and complex because
of the spatiotemporal dynamics of movement behavior and the influence of exogenous
factors such as weather, holidays, and local events. In this paper, we use bus
transportation as a proxy to mine spatiotemporal travel patterns. We propose a
deep-learning-based urban mobility prediction model that collectively forecasts
passenger flows between pairs of city regions in an origin-destination (OD) matrix.
We first process OD matrices in a convolutional neural network to capture spatial
correlations. Intermediate results are reconstructed into three multivariate time
series: hourly, daily, and weekly time series. Each time series is aggregated
in a long short-term memory (LSTM) network with a novel attention mechanism to
guide the aggregation. In addition, our model is context-aware by using contextual
embeddings learned from exogenous factors. We dynamically merge results from LSTM
components and context embeddings in a late fusion network to make a final prediction.
The proposed model is implemented and evaluated using a large-scale transportation
data set of more than million bus trips with a suite of Big Data technologies
developed for data processing. Through performance comparison, we show that our
approach achieves sizable accuracy improvements in urban mobility prediction.
Our work has major implications for efficient transportation system design and
performance improvement. The proposed deep neural network structure is generally
applicable for sequential graph data prediction.'
- Various methods are currently under investigation to preserve fertility in males
treated with high-dose chemotherapy and radiation for malignant and nonmalignant
disorders. Human umbilical cord mesenchymal stem cells (HUC-MSCs), which possess
potent immunosuppressive function and secrete various cytokines and growth factors,
have the potential clinical applications. As a potential alternative, we investigate
whether injection of HUC-MSCs into the interstitial compartment of the testes
to promote spermatogenic regeneration efficiently. HUC-MSCs were isolated from
different sources of umbilical cords and injected into the interstitial space
of one testis from busulfan-treated mice (saline and HEK000 cells injections were
performed in a separate set of mice) and the other testis remained uninjected.
Three weeks after MSCs injection, Relative quantitative reverse transcription
polymerase chain reaction was used to identify the expression of of germ cell
associated, which are all related to meiosis, demonstrated higher levels of spermatogenic
gene expression ( fold) in HUC-MSCs injected testes compared to the contralateral
uninjected testes (five mice). Protein levels for germ cell-specific genes, miwi,
vasa and synaptonemal complex protein (Scp0) were also higher in MSC-treated testes
compared to injected controls weeks after treatment. However, no different expression
was detected in saline water and HEK000 cells injection control group. We have
demonstrated HUC-MSCs could affect mouse germ cell-specific genes expression.
The results also provide a possibility that the transplanted HUC-MSCs may promote
the recovery of spermatogenesis. This study provides further evidence for preclinical
therapeutic effects of HUC-MSCs, and explores a new approach to the treatment
of azoospermia.
- source_sentence: Twenty one surviving infants of pregnancies complicated by rupture
of the membranes during the second trimester that lasted at least one week have
been followed up for a median of months. Five infants ( %) had recurrent respiratory
problems (episodes of wheezing and coughing occurring at least once a week) which
related significantly to the use of neonatal ventilation and to very preterm delivery.
Five of the infants who were born preterm and with birth weights of less than
g had recurrent respiratory symptoms ( %). This compares favourably with an incidence
of symptoms of % among surviving low birthweight infants born at this hospital
after pregnancies not complicated by premature rupture of the membranes. Neither
recurrent respiratory symptoms nor admission to hospital for chest related disorders
were associated with the timing of onset or duration of rupture of the membranes.
We conclude that, among survivors of premature rupture of the membranes, chronic
respiratory morbidity would best be prevented by avoiding very preterm delivery,
regardless of the duration of the rupture.
sentences:
- We report a case of prosthetic valve nocardia endocarditis. A year old farmer
underwent aortic valve replacement with a bioprosthetic valve. The immediate post-operative
course was uneventful but weeks later he developed fever. A trans-oesophageal
echocardiogram (TEE) showed a string like structure attached to the prosthetic
valve. Blood cultures grew N. farcinica. He was initially treated with trimethoprim/sulfamethoxazole
(TMP/SMZ), but due to eosinophilia and leucopenia his treatment was changed to
imipenem and amikacin. He developed a rash, presumed to be due to imipenem, which
was then substituted with linezolid. He completed a week course of intravenous
(i.v.) antibiotics. Desensitization with amoxicillin/clavulanic acid was successful
and the patient received oral amoxicillin/clavulanic acid for months. At present,
months from diagnosis, he is afebrile and TEE is normal. To our knowledge, this
case is the fifth reported case of successful treatment of prosthetic valve nocardia
endocarditis treated without surgery.
- 'The effect of different variants of compiling integrated samples for biochemical
oxygen demand (BOD) kinetics was studied in long-term experiments (up to days)
with water samples taken from the central deep-water region of Lake Onego. It
was a series of experiments carried out simultaneously at and in different seasons
of . Five sampling variants were employed with different horizon combinations:
near surface, near bottom, from different depths in the water column, from the
photic and profundal layers. Two experiments were performed with winter water,
three with summer water, four with autumn water, and seven experiments with spring
water. The most representative sample for studying BOD in long-term experiments
is an sample composed of water from different horizons of the photic layer ( m).
For each variant of integrated sample composition, BOD development in the experiments
was modeled by a corresponding kinetic equation whose parameters represented the
oxidation characteristics of components of the organic matter present in the water
and transformed in the long-term BOD experiment. The resultant kinetic parameters
of BOD were analyzed in relation to the factors determining the final oxidation
of the organic matter components. The patterns in which the type of BOD development
is formed depend on the integrated water sample collection/compilation conditions
and are characterized by the average values of the organic matter contained in
the water, estimated either analytically or from empirical equations, as well
as by the temperature of exposure of water samples in the experiment. Synthesis
of the resultant information showed that the values of BOD kinetic parameters
were generally lower in spring water taken from the central part of Lake Onego
as compared with other seasons, since the oxidation potential of organic matter
components in spring water is higher.'
- Doppler ultrasound measurements of pulmonary blood flow in babies with severe
respiratory distress syndrome treated in a randomised controlled trial of surfactant
replacement showed that the immediate improvement of oxygenation was not associated
with a significant increase in pulmonary blood flow. Reduction in ventilator settings
and increases in the extent of chest wall movements measured by a cardiorespiratory
monitor suggested that the improvement after surfactant had been given was a result
of alveolar stabilisation and increased pulmonary compliance. Further simultaneous
studies of pulmonary blood flow and pulmonary compliance are needed to confirm
these findings.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
- cosine_accuracy
model-index:
- name: SentenceTransformer based on allenai/specter2_aug2023refresh_base
results:
- task:
type: triplet
name: Triplet
dataset:
name: discipline tuned specter 2 022
type: discipline-tuned_specter_2_022
metrics:
- type: cosine_accuracy
value: 0.9713793103448276
name: Cosine Accuracy
- task:
type: triplet
name: Triplet
dataset:
name: discipline tuned specter 2 024
type: discipline-tuned_specter_2_024
metrics:
- type: cosine_accuracy
value: 0.9710344827586207
name: Cosine Accuracy
---
# SentenceTransformer based on allenai/specter2_aug2023refresh_base
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [allenai/specter2_aug2023refresh_base](https://huggingface.co/allenai/specter2_aug2023refresh_base). It maps sentences & paragraphs to a 768-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [allenai/specter2_aug2023refresh_base](https://huggingface.co/allenai/specter2_aug2023refresh_base) <!-- at revision 084e9624d354a1cbc464ef6cc1e3646d236b95d9 -->
- **Maximum Sequence Length:** 512 tokens
- **Output Dimensionality:** 768 dimensions
- **Similarity Function:** Cosine Similarity
<!-- - **Training Dataset:** Unknown -->
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 512, 'do_lower_case': False}) with Transformer model: BertModel
(1): Pooling({'word_embedding_dimension': 768, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
(2): Normalize()
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("m7n/discipline-tuned_specter_2_024")
# Run inference
sentences = [
'Twenty one surviving infants of pregnancies complicated by rupture of the membranes during the second trimester that lasted at least one week have been followed up for a median of months. Five infants ( %) had recurrent respiratory problems (episodes of wheezing and coughing occurring at least once a week) which related significantly to the use of neonatal ventilation and to very preterm delivery. Five of the infants who were born preterm and with birth weights of less than g had recurrent respiratory symptoms ( %). This compares favourably with an incidence of symptoms of % among surviving low birthweight infants born at this hospital after pregnancies not complicated by premature rupture of the membranes. Neither recurrent respiratory symptoms nor admission to hospital for chest related disorders were associated with the timing of onset or duration of rupture of the membranes. We conclude that, among survivors of premature rupture of the membranes, chronic respiratory morbidity would best be prevented by avoiding very preterm delivery, regardless of the duration of the rupture.',
'Doppler ultrasound measurements of pulmonary blood flow in babies with severe respiratory distress syndrome treated in a randomised controlled trial of surfactant replacement showed that the immediate improvement of oxygenation was not associated with a significant increase in pulmonary blood flow. Reduction in ventilator settings and increases in the extent of chest wall movements measured by a cardiorespiratory monitor suggested that the improvement after surfactant had been given was a result of alveolar stabilisation and increased pulmonary compliance. Further simultaneous studies of pulmonary blood flow and pulmonary compliance are needed to confirm these findings.',
'The effect of different variants of compiling integrated samples for biochemical oxygen demand (BOD) kinetics was studied in long-term experiments (up to days) with water samples taken from the central deep-water region of Lake Onego. It was a series of experiments carried out simultaneously at and in different seasons of . Five sampling variants were employed with different horizon combinations: near surface, near bottom, from different depths in the water column, from the photic and profundal layers. Two experiments were performed with winter water, three with summer water, four with autumn water, and seven experiments with spring water. The most representative sample for studying BOD in long-term experiments is an sample composed of water from different horizons of the photic layer ( m). For each variant of integrated sample composition, BOD development in the experiments was modeled by a corresponding kinetic equation whose parameters represented the oxidation characteristics of components of the organic matter present in the water and transformed in the long-term BOD experiment. The resultant kinetic parameters of BOD were analyzed in relation to the factors determining the final oxidation of the organic matter components. The patterns in which the type of BOD development is formed depend on the integrated water sample collection/compilation conditions and are characterized by the average values of the organic matter contained in the water, estimated either analytically or from empirical equations, as well as by the temperature of exposure of water samples in the experiment. Synthesis of the resultant information showed that the values of BOD kinetic parameters were generally lower in spring water taken from the central part of Lake Onego as compared with other seasons, since the oxidation potential of organic matter components in spring water is higher.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 768]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
<!--
### Direct Usage (Transformers)
<details><summary>Click to see the direct usage in Transformers</summary>
</details>
-->
<!--
### Downstream Usage (Sentence Transformers)
You can finetune this model on your own dataset.
<details><summary>Click to expand</summary>
</details>
-->
<!--
### Out-of-Scope Use
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->
## Evaluation
### Metrics
#### Triplet
* Datasets: `discipline-tuned_specter_2_022` and `discipline-tuned_specter_2_024`
* Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
| Metric | discipline-tuned_specter_2_022 | discipline-tuned_specter_2_024 |
|:--------------------|:-------------------------------|:-------------------------------|
| **cosine_accuracy** | **0.9714** | **0.971** |
<!--
## Bias, Risks and Limitations
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->
<!--
### Recommendations
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->
## Training Details
### Training Dataset
#### Unnamed Dataset
* Size: 43,494 training samples
* Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
* Approximate statistics based on the first 1000 samples:
| | anchor | positive | negative |
|:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
| type | string | string | string |
| details | <ul><li>min: 80 tokens</li><li>mean: 232.53 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 81 tokens</li><li>mean: 230.16 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 86 tokens</li><li>mean: 229.66 tokens</li><li>max: 512 tokens</li></ul> |
* Samples:
| anchor | positive | negative |
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <code>Lupus nephritis (LN) is one of the major risk factors for morbidity and overall mortality in systemic lupus erythematosus (SLE). Its pathogenesis is multifactorial, and a number of risk factors, including serological markers, have been identified in recent years, correlating with clinical course and disease severity. Furthermore, a distinctive autoantibody profile has recently been reported in African- American SLE women with LN. The aim of this study was to characterize the autoantibody profile in African-American SLE patients, with LN and without. Only anti-dsDNA achieved statistical significance between the two groups (P < ). Fourteen ( %) patients with LN and ( %) without it exhibited positive anti-Ro/SS-A, anti-Sm, and anti-nRNP, but without anti-La/SS B (P > ). We conclude that African-American SLE patients with LN do not exhibit a specific or distinctive autoantibody profile. However, our data confirm the value of anti-dsDNA in SLE patients with LN.</code> | <code>TRIM00 is a member of the tripartite motif family proteins and is one of the autoantigens which react with anti-SS-A antibody (Ab) present in sera of patients with systemic lupus erythematosus (SLE) and Sjogren's syndrome. Previous studies have shown that TRIM00 dysfunction promotes aberrant B-cell differentiation and Ab production in SLE, and anti-TRIM00 Ab may be related to the TRIM00 dysfunction in human SLE pathogenesis. Here, we examined the relationship between anti-TRIM00 Ab and clinical and immunological characteristics in SLE patients.Twenty-seven patients with SLE ( women and four men) before immunosuppressive therapies, who fulfilled the revised American College of Rheumatology criteria for SLE, and four healthy controls ( women and one man) were enrolled in the study. SLE patients were divided into two groups according to the seropositivity for anti-TRIM00 Ab. Serum anti-TRIM00 Ab levels were measured using enzyme-linked immunosorbent assays. The serum levels of cytokines a...</code> | <code>We construct a stochastic model of real estate pricing. The method of the pricing construction is based on a sequential comparison of the supply prices. We proof that under standard assumptions imposed upon the comparison coefficients there exists an unique non-degenerated limit in distribution and this limit has the lognormal law of distribution. The accordance of empirical distributions of prices to thetheoretically obtained log-normal distribution we verify by numerous statistical data of real estate prices from Saint-Petersburg (Russia). For establishing this accordance we essentially apply the efficient and sensitive test of fit of Kolmogorov-Smirnov. Basing on "The Russian Federal Estimation Standard N0", we conclude that the most probable price, i.e. mode of distribution, is correctly and uniquely defined under the log-normal approximation. Since the mean value of log-normal distribution exceeds the mode - most probable value, it follows that the prices valued by the mathematica...</code> |
| <code>A laboratory prototype of an enzyme biosensor based on pHsensitive field-effect transistors has been developed to determine the total content of indole alkaloids in Rauwolfia serpentina Benth. Ex Kurz tissue culture. The biosensor was characterized by high sensitivity to th A laboratory prototype of an enzyme biosensor based on pHsensitive field effect transistors has been developed to determine the total content of indole alkaloids in Rauwolfia serpentina Benth. Ex Kurz tissue culture. The biosensor was characterized by high sensitivity to the total content of indole alkaloids (minimum limit of determination g/ml of the total content of indole alkaloids contained in the juice obtained from tissue culture of Rauwolfia serpentina). The linear range of biosensor determination of the analyte was from to g / ml of the total content of indole alkaloids. Analysis of indole alkaloids using a biosensor is simple and fast and does not require expensive equipment and special sample preparation f...</code> | <code>A procedure of separate biosensor analysis of the multicomponent sample with aflatoxins and pesticides has been developed and optimized. Biosensor determination of aflatoxins and pesticides was performed using enzyme inhibition analysis. For creation of bioselective element we used enzyme acetylcholinesterase which is co-immobilized with bovine serum albumin on the surface of potentiometric transducer by glutaraldehyde covalent crosslinking. As transducers were pH-sensitive field effect transistors. The concentration of acetylcholine chloride as a substrate for subsequent inhibition analysis was fit; optimal time of inhibition by toxins solution was determinate together with concentration of reactivator (pyridine- -aldoxymmethyliodyd) and time of enzyme reactivation after inhibition. A synergism between trichlorfon and aflatoxin B0 in inhibition of immobilized on a surface pH-sensitive field-effect transistors acetylcholinesterase was investigated. The proposed procedure allows selecti...</code> | <code>Objective: To observe the effect of modified Zhenwu decoction on blood glucose and blood lipid of experimental diabetic rats.Methods: Diabetic model rats randomly were divided into normal control group,diabetic modeling group,modified Zhenwu decoction group.Establish intraperitoneal injection of Streptozotocin diabetic animal models by,after eight weeks blood glucose and blood lipids were detrmined.Results: After the treatment by modified Zhenwu decoction,blood glucose,blood lipid and other indicators improved significantly.Conclusion: Modified Zhenwu decotion can improve the level of renal lower blood glucose and lipid in diabetic rats.</code> |
| <code>In two successive years ( and ), a set of commercial sugar beet cultivars was established in Randomized Complete Block experiments at two sites in central Greece. Cultivar combination was different between years, but not between sites. Leaf sampling took place once during the growing season and leaf area, LA [cm0], leaf midvein length, L [cm] and maximum leaf width, W [cm] were determined using an image analysis system. Leaf parameters were mainly affected by cultivars. Leaf dimensions and their squares (L0, W0) did not provide an accurate model for LA predictions. Using LW as an independent variable, a quadratic model (y = x0 - x + , r = , p< , n = ) provided the most accurate estimation of LA. With compromises in accuracy, the linear relationship between LW and LA (y = x + , r = , p< , n = ) could be used as a prediction model thanks to its simplicity.</code> | <code>The general increase in temperature, together with sudden episodes of extreme temperatures, are increasingly impacting plant species in the present climate change scenario. Limoniastrum monopetalum is a halophyte from the Mediterranean Basin, exposed to broad daily and seasonal changes in temperature and extreme high temperatures. We studied the photosynthetic responses (chlorophyll fluorescence dynamics and gas exchange) of L. monopetalum leaves exposed to temperatures from .0C to .0C under darkness in controlled laboratory conditions. L. monopetalum presented its optimum temperature for photosynthesis around +00C. The photosynthetic apparatus of L. monopetalum exhibited permanent damages at > .0C. L. monopetalum tolerated, without permanent damages, temperatures as low as .0C in darkness. L. monopetalum appears as a plant species very well adapted to the seasonality of the Mediterranean climate, which may work as a pre-adaptation to stand more extreme temperatures in the actual conte...</code> | <code>The article depicts direct and hidden (implicit and explicit) information giving in advertisement discourse, meaning advertising slogans. Having investigated this topic thoroughly, the author found out that cognitive types of presupposition and communicative implicatures played a great role in advertising slogans. There are definitions of phenomena "implicit" and "explicit" with examples. The cognitive types of presupposition (semantic and pragmatic) and their typology is discussed in the article. There is a possibility to figure out what strategy of communicative influence on human's cognition is. Some laws of neurolinguistic programming is also discussed.</code> |
* Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
```json
{
"distance_metric": "TripletDistanceMetric.COSINE",
"triplet_margin": 0.4
}
```
### Evaluation Dataset
#### Unnamed Dataset
* Size: 2,174 evaluation samples
* Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
* Approximate statistics based on the first 1000 samples:
| | anchor | positive | negative |
|:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
| type | string | string | string |
| details | <ul><li>min: 83 tokens</li><li>mean: 235.71 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 82 tokens</li><li>mean: 234.64 tokens</li><li>max: 512 tokens</li></ul> | <ul><li>min: 86 tokens</li><li>mean: 225.92 tokens</li><li>max: 512 tokens</li></ul> |
* Samples:
| anchor | positive | negative |
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <code>In Organic Law / of 0rd October of the general arrangement of the educational system (LOGSE), the educational system includes the general regime education and the special regime education. Dance is included in the special regime as part of the artistic disciplines together with music, drama, the plastic arts and design. The aim of this article is to analyse the treatment given to Dance in the general regime. Thus, we will try to emphasize the inconsistency that exists between the areas of primary education, which will be obligatory and will have a global and integrated character, and the training of future teachers.</code> | <code>This work aims to analyze the treatment of health education in school textbooks during the period , and to compare it with the one that is conducted at present. It will attempt to verify how many current concepts and ideas were already present in those decades. In addition, the differences in the way of carrying out health education then and now will be outlined, especially those referred to pedagogic strategies and didactic materials. All this will be done from a double perspective: . The concept of health, hygiene and pedagogy of health education. . The program contents of health education in the didactic materials.</code> | <code>The vane-in-cup (VIC) geometry has been widely used for the rheological characterization of yield-stress fluids because it minimizes slip effects at the liquid/solid interface of the rotating geometry and reduces sample damage during the loading process. However, severe kinematic limitations arising from the spatial complexity of mixed shear and extensional flow have been identified for quantitative rheometrical measurements in complex fluids. Recently, vanes with fractal cross sections have been suggested as alternatives for accurate rheometry of elastoviscoplastic fluids. In this work, the steady fractal vane-in-cup (fVIC) flow of a Newtonian fluid and a nonthixotropic Carbopol®️ microgel as well as the unsteady flow of a thixotropic -Carrageenan gel are analyzed using rheo-particle image velocimetry (Rheo-PIV). We describe the velocity distributions in all cases and show that the fVIC produces an almost axisymmetric flow field and rotation rate-independent "effective radius" when us...</code> |
| <code>An ultrahigh vacuum three-axis cryogenic sample manipulator suitable for angle-resolved photoelectron spectroscopy experiments was developed. The sample manipulator is constructed by combining three modules with translation, polar rotation, and azimuthal-tilt rotation capabilities. Polar rotation and the azimuthal-tilt rotation are performed using a differentially pumped rotary stage and a sample goniometer, respectively. Continuous rotation around the polar axis is possible. The sample goniometer is capable of azimuthal rotation of up to and tilt rotation from to , measured from the plane normal to the polar axis. Nonmagnetic materials are used near the sample holder of the goniometer. The sample holder can be cooled using a continuous-flow cryostat. To serve as a radiation shield, the lower portion of the goniometer surrounding the sample holder is cooled separately by another cell filled with liquid nitrogen. With liquid nitrogen or liquid helium for the cryostat, the sample holder ...</code> | <code>In the soft x-ray region below keV, various electron yield (EY) techniques have been employed in x-ray absorption fine structure (XAFS) measurements of bulk materials. The fluorescent x-ray yield (FY) is also utilized for samples of low concentration. Although FY becomes much smaller for lighter elements, it has several advantages compared with EY to measure XAFS spectra; for example, a higher signal-to-background ratio and applicability to insulating materials. However, it has been thought to be unsuitable for concentrated samples due to a self-absorption effect. In this report, the sampling depth and self-absorption effect for bulk concentrated samples are discussed concerning XAFS measurements in a few keV energy region. Some typical FY XAFS spectra of concentrated materials, including insulators, are presented.</code> | <code>To investigate the distribution characteristics of TCM syndromes and the related herbal prescriptions for malignant tumors (MT). A clinical database of the TCM syndromes and the herbal prescriptions in treatment of MT patients were established. The data were then analyzed using cluster and frequency analysis. According to the cluster analysis, the TCM syndromes in MT patients mainly included two patterns: deficiency of both Qi and Yin and internal accumulation of toxic heat. The commonly-prescribed herbs were Huangqi (Astraglus), Nuzhenzi (Fructus Ligustri Lucidi), Lingzhi (Ganoderma Lucidum), Huaishan (Dioscorea Opposita), Xiakucao (Prunella Vulgaris), and Baihuasheshecao (Herba Hedyotidis). Deficiency of Qi and Yin is the primary syndrome of MT, and internal accumulation of toxic heat is the secondary syndrome. The herbs for Qi supplementation and Yin nourishment are mainly used, with the assistance of herbs for heat-clearance and detoxification.</code> |
| <code>Abstract Abstract Worldwide opposition to different aspects of globalisation indicates the emergence of a global social movement that typically targets the international bodies that regulate global trade and global finance, as well as the regulations themselves. The significance of the movement calls for a synthetic analysis that moves beyond the currently used fragmentary descriptions. A more profound conceptual framework will enable researchers to better understand the full dynamic of the movement within its global context In this article we explore the possibilities of applying David Korten's ideal-typical notion of fourth generation development to the anti-globalisation movement. We ask whether anti-globalisation organisation exhibits so-called Fourth Generation characteristics and activities. Our goal is to determine the extent to which the movement as a whole, and the individual organisations which constitute it, conform to the fourth generation development conceptual framework. ...</code> | <code>Abstract Globalisation is a complex, multi-faceted, phenomenon with widely contested meanings. While it has roots in the history of colonialism, capitalist development and imperialism, there are strong indications that what we are witnessing, since the 0000s, is a qualitative break with the past. Old boundaries, categories and meanings are being challenged in profound ways. New forms of exploitation and subjugation emerge in such a way that stark brutal force coexists with and may be increasingly supplanted by more subtle, pervasive forces of hegemonic rule. The latter, however, has opened up new terrains of struggle for people, movements, and governments opposed to one-dimensional 'corporate globalisation', seeking instead the globalisation of social and environmental justice. A continent like Africa much of which has sunk deeper into a 'fourth world' status of extreme under-development, social instability and neo-colonial dependence faces stark choices. Does it seek to partially or f...</code> | <code>So much has been written about the nation vis-a-vis other fields in the humanities, literature in particular. My interest in dance lies in its peculiar location within and vis-a-vis the discourse of the nation. An ephemeral form, dance has elicited various, and even contradictory, valuations; most of the time it is considered a mere form of entertainment. It is undeniable, though, that dance has articulated and informed our ideas of the nation and nationhood. In this paper, I explore how three contemporary dance companies based in Quezon City (The University of the Philippines Dance Company, Airdance, and Dance Forum) have rendered their imaginings of the Philippine nation. I focus on Philippine contemporary dance because as a cultural practice, I believe that it has choreographed the many trajectories and issues embodied in the Philippines's imagining of itself. A number of choreographies by the three companies mobilize motifs, forms, structures, and styles that constitute and signify...</code> |
* Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
```json
{
"distance_metric": "TripletDistanceMetric.COSINE",
"triplet_margin": 0.4
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `per_device_train_batch_size`: 4
- `per_device_eval_batch_size`: 32
- `learning_rate`: 7e-06
- `weight_decay`: 0.01
- `num_train_epochs`: 1
- `warmup_ratio`: 0.5
- `fp16`: True
- `batch_sampler`: no_duplicates
#### All Hyperparameters
<details><summary>Click to expand</summary>
- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 4
- `per_device_eval_batch_size`: 32
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 1
- `eval_accumulation_steps`: None
- `torch_empty_cache_steps`: None
- `learning_rate`: 7e-06
- `weight_decay`: 0.01
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1.0
- `num_train_epochs`: 1
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.5
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `use_ipex`: False
- `bf16`: False
- `fp16`: True
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: False
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: None
- `hub_always_push`: False
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `include_for_metrics`: []
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`:
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `dispatch_batches`: None
- `split_batches`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: False
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `use_liger_kernel`: False
- `eval_use_gather_object`: False
- `average_tokens_across_devices`: False
- `prompts`: None
- `batch_sampler`: no_duplicates
- `multi_dataset_batch_sampler`: proportional
</details>
### Training Logs
| Epoch | Step | Training Loss | Validation Loss | discipline-tuned_specter_2_022_cosine_accuracy | discipline-tuned_specter_2_024_cosine_accuracy |
|:------:|:----:|:-------------:|:---------------:|:----------------------------------------------:|:----------------------------------------------:|
| 0.0023 | 25 | 0.2976 | 0.2980 | 0.9518 | - |
| 0.0046 | 50 | 0.3008 | 0.2969 | 0.9518 | - |
| 0.0069 | 75 | 0.3088 | 0.2953 | 0.9524 | - |
| 0.0092 | 100 | 0.3047 | 0.2929 | 0.9530 | - |
| 0.0115 | 125 | 0.2879 | 0.2897 | 0.9530 | - |
| 0.0138 | 150 | 0.2705 | 0.2855 | 0.9532 | - |
| 0.0161 | 175 | 0.2771 | 0.2804 | 0.9536 | - |
| 0.0184 | 200 | 0.2737 | 0.2744 | 0.9548 | - |
| 0.0207 | 225 | 0.2737 | 0.2676 | 0.9553 | - |
| 0.0230 | 250 | 0.2569 | 0.2600 | 0.9557 | - |
| 0.0253 | 275 | 0.2518 | 0.2512 | 0.9579 | - |
| 0.0276 | 300 | 0.2445 | 0.2416 | 0.9580 | - |
| 0.0299 | 325 | 0.2214 | 0.2310 | 0.9591 | - |
| 0.0322 | 350 | 0.2359 | 0.2204 | 0.9606 | - |
| 0.0345 | 375 | 0.2072 | 0.2090 | 0.9615 | - |
| 0.0368 | 400 | 0.1907 | 0.1976 | 0.9618 | - |
| 0.0391 | 425 | 0.1881 | 0.1850 | 0.9624 | - |
| 0.0414 | 450 | 0.1842 | 0.1733 | 0.9637 | - |
| 0.0437 | 475 | 0.1618 | 0.1628 | 0.9646 | - |
| 0.0460 | 500 | 0.1638 | 0.1533 | 0.9645 | - |
| 0.0483 | 525 | 0.1569 | 0.1440 | 0.9648 | - |
| 0.0506 | 550 | 0.1473 | 0.1354 | 0.9657 | - |
| 0.0529 | 575 | 0.1333 | 0.1281 | 0.9671 | - |
| 0.0552 | 600 | 0.1481 | 0.1223 | 0.9671 | - |
| 0.0575 | 625 | 0.1263 | 0.1167 | 0.9675 | - |
| 0.0598 | 650 | 0.114 | 0.1120 | 0.9684 | - |
| 0.0621 | 675 | 0.1097 | 0.1081 | 0.9693 | - |
| 0.0644 | 700 | 0.1152 | 0.1044 | 0.9698 | - |
| 0.0667 | 725 | 0.1009 | 0.0999 | 0.9705 | - |
| 0.0690 | 750 | 0.0895 | 0.0961 | 0.9709 | - |
| 0.0713 | 775 | 0.0855 | 0.0934 | 0.9711 | - |
| 0.0736 | 800 | 0.0853 | 0.0912 | 0.9715 | - |
| 0.0759 | 825 | 0.0942 | 0.0885 | 0.9714 | - |
| 0.0782 | 850 | 0.1035 | - | - | 0.9710 |
### Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.3.1
- Transformers: 4.49.0.dev0
- PyTorch: 2.5.1+cu121
- Accelerate: 1.2.1
- Datasets: 3.2.0
- Tokenizers: 0.21.0
## Citation
### BibTeX
#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
```
#### TripletLoss
```bibtex
@misc{hermans2017defense,
title={In Defense of the Triplet Loss for Person Re-Identification},
author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
year={2017},
eprint={1703.07737},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
```
<!--
## Glossary
*Clearly define terms in order to be accessible across audiences.*
-->
<!--
## Model Card Authors
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->
<!--
## Model Card Contact
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
-->