File size: 77,447 Bytes
22013c6 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71 72 73 74 75 76 77 78 79 80 81 82 83 84 85 86 87 88 89 90 91 92 93 94 95 96 97 98 99 100 101 102 103 104 105 106 107 108 109 110 111 112 113 114 115 116 117 118 119 120 121 122 123 124 125 126 127 128 129 130 131 132 133 134 135 136 137 138 139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204 205 206 207 208 209 210 211 212 213 214 215 216 217 218 219 220 221 222 223 224 225 226 227 228 229 230 231 232 233 234 235 236 237 238 239 240 241 242 243 244 245 246 247 248 249 250 251 252 253 254 255 256 257 258 259 260 261 262 263 264 265 266 267 268 269 270 271 272 273 274 275 276 277 278 279 280 281 282 283 284 285 286 287 288 289 290 291 292 293 294 295 296 297 298 299 300 301 302 303 304 305 306 307 308 309 310 311 312 313 314 315 316 317 318 319 320 321 322 323 324 325 326 327 328 329 330 331 332 333 334 335 336 337 338 339 340 341 342 343 344 345 346 347 348 349 350 351 352 353 354 355 356 357 358 359 360 361 362 363 364 365 366 367 368 369 370 371 372 373 374 375 376 377 378 379 380 381 382 383 384 385 386 387 388 389 390 391 392 393 394 395 396 397 398 399 400 401 402 403 404 405 406 407 408 409 410 411 412 413 414 415 416 417 418 419 420 421 422 423 424 425 426 427 428 429 430 431 432 433 434 435 436 437 438 439 440 441 442 443 444 445 446 447 448 449 450 451 452 453 454 455 456 457 458 459 460 461 462 463 464 465 466 467 468 469 470 471 472 473 474 475 476 477 478 479 480 481 482 483 484 485 486 487 488 489 490 491 492 493 494 495 496 497 498 499 500 501 502 503 504 505 506 507 508 509 510 511 512 513 514 515 516 517 518 519 520 521 522 523 524 525 526 527 528 529 530 531 532 533 534 535 536 537 538 539 540 541 542 543 544 545 546 547 548 549 550 551 552 553 554 555 556 557 558 559 560 561 562 563 564 565 566 567 568 569 570 571 572 573 574 575 576 577 578 579 580 581 582 583 584 585 586 587 588 589 590 591 592 593 594 595 596 597 598 599 600 601 602 603 604 605 606 607 608 609 610 611 612 613 614 615 616 617 618 619 620 621 622 623 624 625 626 627 628 629 630 631 632 633 634 635 636 637 638 639 640 641 642 643 644 645 646 647 648 649 650 651 652 653 654 655 656 657 658 659 660 661 662 663 664 665 666 667 668 669 670 671 672 673 674 675 676 677 678 679 680 681 682 683 684 685 686 687 688 689 690 691 692 693 694 695 696 697 698 699 700 701 702 703 704 705 706 707 708 709 710 711 712 713 714 715 716 717 718 719 720 721 722 723 724 725 726 727 728 729 730 731 732 733 734 735 736 737 738 739 740 741 742 743 744 745 746 747 748 749 750 751 752 753 |
---
tags:
- sentence-transformers
- sentence-similarity
- feature-extraction
- generated_from_trainer
- dataset_size:7828
- loss:TripletLoss
base_model: answerdotai/ModernBERT-large
widget:
- source_sentence: Pleural effusion is a frequently observed lesion in the course
of respiratory diseases such as inflammatory process and cancer metastasis. Its
cause may be either tuberculosis (the most common extrapulmonary location is the
pleura) and malignant disease of the pleura. Confirmation of tuberculosis is often
troublesome. The primary site of cancer may be als difficult to find despite the
application of difficult diagnostic methods. Below we present history of -year
old female in whom carcinomatous cells and positive result of PCR for Mycobacterium
tuberculosis in pleural fluid were discovered simultaneously suggesting the tuberculosis
and cancer of unknown primary origin.
sentences:
- Coronaviruses are a large family of viruses that cause illness ranging from mild
to severe symptoms. Coronaviruses are known to cause diseases that cause severe
symptoms such as Middle East Respiratory Syndrome (MERS) and Severe Acute Respiratory
Syndrome (SARS). This study aims to determine the factors related to compliance
with the use of personal protective equipment by health workers during the COVID-
pandemic at Bahteramas Hospital, Southeast Sulawesi province in . This study used
a case control design. The population in this study were health workers at Bahtermas
Hospital totaling health workers. The sample in this study amounted to respondents
consisting of groups of health workers. Sampling using the Lemeshow formula. The
results showed that based on the results of the chis square test, the P-value
of the knowledge variable was , the Attitude variable, a P-Value of was obtained
and the PPE availability variable was a P-Value of . From the research samples
used, it can be concluded that the Knowledge, Attitude and availability of PPE
are related to compliance with the use of PPE by health workes during the COVID-
pandemic at Bahteramas Hospital, Southeast Sulawesi Province.
- Recent developments in treatment have steadily raised the median predicted age
of survival for people with Cystic Fibrosis (CF). We report the health-related
quality of life (HRQoL) in CF adult patients and correlate our findings with the
patients' demographic characteristics.The Cystic Fibrosis Quality of Life (CFQoL)
questionnaire was answered by CF adult patients. The questionnaire included questions
pertaining to age, sex and level of education and covered eight sections of functioning.The
highest score was reported in the "Social Functioning" section, while the lowest
in the "Concerns for the Future" section. When different age groups were compared,
statistical significances were reported in "Physical Functioning", "Interpersonal
Relationships", and the "Career Concerns" section, with older patients reporting
statistically higher HRQoL scores than younger ones (p < ). No statistically significant
difference was reported amongst the scoring between male and female CF patients.
When different educational levels were compared, patients that had received a
higher educational training scored statistically higher in all but one sections
of the questionnaire when compared with patients of a lower educational level
(p < ).More than half Greek adult CF patients report that they are capable to
participate in social activities but most of them are worried about the outcome
of their disease and its effect on their lives.
- 'BACKGROUND: The global amount of investment in companies developing artificial
intelligence (AI)-based software technologies for medical diagnostics reached
million in , rose to million in , and is expected to continue growing. While software
manufacturing companies should comply with existing clinical, bioethical, legal,
and methodological frameworks and standards, there is a lack of uniform national
and international standards and protocols for testing and monitoring AI-based
software. AIM: This objective of this study is to develop a universal methodology
for testing and monitoring AI-based software for medical diagnostics, with the
aim of improving its quality and implementing its integration into practical healthcare.
MATERIALS AND METHODS: The research process involved an analytical phase in which
a literature review was conducted on the PubMed and eLibrary databases. The practical
stage included the approbation of the developed methodology within the framework
of an experiment focused on the use of innovative technologies in the field of
computer vision to analyze medical images and further application in the health
care system of the city of Moscow. RESULTS: A methodology for testing and monitoring
AI-based software for medical diagnostics has been developed, aimed at improving
its quality and introducing it into practical healthcare. The methodology consists
of seven stages: self-testing, functional testing, calibration testing, technological
monitoring, clinical monitoring, feedback, and refinement. CONCLUSION: Distinctive
features of the methodology include its cyclical stages of monitoring and software
development, leading to continuous improvement of its quality, the presence of
detailed requirements for the results of the software work, and the participation
of doctors in software evaluation. The methodology will allow software developers
to achieve significant outcomes and demonstrate achievements across various areas.
It also empowers users to make informed and confident choices among software options
that have passed an independent and comprehensive quality check.'
- source_sentence: Abstract The molecule based bilayer system composed of hard Ni
[Fe(CN) ] nH O and soft Ni [Cr(CN) ] nH O ferromagnetic Prussian blue analogues
has been fabricated on a solid substrate by "layer by layer" deposition. The structure
and morphology characterization as well as results of magnetic measurements are
described. The thickness of the bilayer is ca. nm including a nm interface. This
bilayer system shows anisotropic magnetic properties reflected in the shape of
magnetic hysteresis measured in various film orientation with respect to the direction
of external magnetic field. There is no exchange interaction between hard and
soft magnetic layer and irrespective of bilayer orientation, the magnetization
and demagnetization process of both Ni [Fe(CN) ] nH O and Ni [Cr(CN) ] nH O layers
occurs independently.
sentences:
- 'Previous articleNext article No AccessBook ReviewsThe Gospel According to Renan:
Reading, Writing, and Religion in Nineteenth-Century France. By Robert D. Priest.
Oxford Historical Monographs. Edited by P. Clavin et al.Oxford: Oxford University
Press, . Pp. xii+ . . La vie de Jesus de Renan: La fabrique d''un best-seller.
By Nathalie Richard.Rennes: Presses Universitaires de Rennes, . Pp. . .Stephane
GersonStephane GersonNew York University Search for more articles by this author
PDFPDF PLUSFull Text Add to favoritesDownload CitationTrack CitationsPermissionsReprints
Share onFacebookTwitterLinkedInRedditEmail SectionsMoreDetailsFiguresReferencesCited
by The Journal of Modern History Volume , Number 0September Article DOIhttps://doi.org/
. Views: 00Total views on this site For permission to reuse, please contact [email
protected]PDF download Crossref reports no articles citing this article.'
- Purpose The purpose of this paper is to present a case study describing a collaboration
with Last Mile Health, a non-governmental organization, to develop a framework
to inform its community healthcare networks in remote Liberia. Design/methodology/approach
The authors detail the process of using the unique problem setting and available
data to inform modeling and solution approaches. Findings The authors show how
the characteristics of the Liberian setting can be used to develop a two-tier
modeling framework. Given the operating constraints and remote setting the authors
are able to model the problem as a special case of the location-routing problem
that is computationally simple to solve. The results of the models applied to
three districts of Liberia are discussed, as well as the collaborative process
of the multidisciplinary team. Originality/value Importantly, the authors describe
how the problem setting can enable the development of a properly scoped model
that is implementable in practice. Thus the authors provide a case study that
bridges the gap between theory and practice.
- Abstract Poor electrical conductivities, structural instabilities and long synthesis
procedures, limit the application of metal organic frameworks (MOFs) in energy
storage systems. In the present work, we synthesize a cobaltbenzene tricarboxylic
acid based MOF (CoBTC MOF) via two different approaches i. e. solvothermal route
and mechanochemical grinding for its utility in energy storage. When characterized
structurally and electrochemically, the CoBTC MOF synthesized by mechanochemical
method is found to be superior because of large surface area, enhanced porosity/diffusion
process through MOF and structural robustness along with less time requirement.
Further, its hybrid composite with graphene nanosheets (CoBTC MOF/GNS) was prepared
for its performance as a supercapacitor material. The characterization reveals
the formation of sandwich structure where CoBTC MOF rods (thickness ranging from
to m) are placed in between GNS. This arrangement has resulted into high specific
capacitance of F.g at current density of A.g in M KOH electrolyte along with excellent
capacitance retention up to % after charge/discharge cycles. Also, a symmetric
supercapacitor has been assembled for practical application of CoBTC MOF/GNS which
demonstrates specific capacitance of F.g with high energy density and power density
of Wh.kg and W.kg respectively, along with % retention of initial capacitance
after chargedischarge cycles.
- source_sentence: Patients with cancer are at increased risk of venous thromboembolism
(VTE). Risk assessment models can help identifying high-risk populations that
might benefit from primary thromboprophylaxis. Currently, the Khorana score is
suggested to select patients for primary thromboprophylaxis. However, risk stratification
with the Khorana-score remains imperfect, which led to the development of subsequent
clinical risk assessment models (PROTECHT-, CONKO-, ONKOTEV-, TiCat-, COMPASS-CAT-score).
Further, recently, a simplified, personalized risk prediction tool for cancer-associated
VTE, incorporating cancer type and D-Dimer levels has been proposed by Pabinger
et al. (CATSCORE). Also, novel models have been designed specifically for specific
tumour types, such as lung cancer (ROADMAP-CAT), gynaecological cancer (THROMBOGYN),
lymphoma (THROLY), or multiple myeloma (SAVED-; IMPEDE VTE-score). In the present
narrative review, we comprehensively summarize available data on currently available
risk assessment models for VTE in patients with cancer, provide a critical discussion
on their clinical utility, and give an outlook towards future developments.
sentences:
- Besides the cancer itself, venous thromboembolism (VTE) is the leading cause of
death in cancer patients receiving outpatient chemotherapy (CT). Data on VTE development
and impact on treatment course and outcome in real-life NSCLC patients receiving
immune check-point inhibitors (ICI) is currently sparse. More knowledge within
this area is warranted due to the emerging use of ICI in clinical practice. To
quantify risk of VTE and recurrent VTE in NSCLC patients receiving ICI. Explore
the clinical impact of VTE on ICI course and survival and explore potential risk
factors for VTE. Patients with advanced/metastatic NSCLC treated with an immune
checkpoint inhibitor (ICI) at the University Hospital of Odense, Denmark during
were identified and data gathered retrospectively from electronic medical records
(n = ). All patients had finished ICI at the time of data-cut off. Baseline Khorana
Score (KRS) was calculated within one week prior to ICI initiation. Based on follow-up
data cumulative incidence of VTE and its impact on outcome and survival was performed
using Kaplan Meier and cox-regression hazard estimation. Risk of VTE was % during
ICI and % at any time point after ICI initiation. Cumulative incidence rates of
VTE at , , and months after first ICI was %, %, % and % respectively. Median time
to VTE during ICI was months [IQR .0]. Having VTE during ICI lead to discontinuation
of ICI in % of cases, most due to fatal PE. History of VTE before onset of ICI
was a significant risk factor for recurrent VTE during ICI ( % within this subgroup)
despite use of anticoagulant therapy. The incidence and impact of VTE during ICI
for real-life NSCLC patients is not negligible with almost % developing VTE leading
to termination of further ICI in the majority of cases - many due to fatal PE.
The risk of recurrent anticoagulant resistant VTE in patients with known VTE during
ICI is also considerable, which calls for better management and prevention of
VTE including development of treatment specific VTE risk assessment models.
- In September , the New York Supreme Court, Second District, reversed a decision
made by the Division of Human Rights for a dentist to pay a patient in compensatory
damages. The agency ruled that disability-based discrimination is prohibited in
places of public accommodation. The state Supreme Court, however, found that dental
offices are not places of public accommodation as defined by the state human rights
law. The Division of Human Rights plans to appeal the ruling to the New York Court
of Appeals, citing case law which supports the proposition that private medical
offices are places of public accommodation.
- Hemoglobin concentrations in endometriotic cyst fluids have been found to be associated
with distinct clinical manifestations, such as pelvic pain and infertility, as
well as with malignant transformation. However, the measurement of the hemoglobin
concentration in cyst fluid is an invasive procedure. The present study aimed
to evaluate the usefulness of visible and nearinfrared interactance spectroscopy
as a noninvasive technique for estimating the hemoglobin concentration in endometriotic
cystic fluid. Optical fibers were directly placed onto sliced raw pork (up to
00mmthick as an anatomical barrier on the cyst's surface) that covers a cuvette
containing hemoglobin solution or endometriotic cyst fluid. Partial least square
regression based on the second derivative using visible and nearinfrared interactance
spectroscopy (wavelength region, nm) was used to estimate the hemoglobin concentration.
The samples were categorized into the evaluation sets (i.e., calibration set)
to create calibration curves and test sets (i.e., validation set) to validate
equations. The cyst fluid at mm of pork thickness achieved a high correlation
between actual and predicted hemoglobin concentrations (calibration (R0= ) and
validation (R0= ) data). However, the correlation slightly decreased at 00mm pork
thickness (i.e., calibration (R0= ) and validation (R0= ) data). Interactance
spectroscopy may thus be a noninvasive tool which can be used to estimate the
hemoglobin concentration in endometriotic cyst fluid when the anatomical barrier
is mm. This technology is a reliable modality for predicting the severity of dysmenorrhea
and infertility, as well as malignant transformation, in a number of patients
with endometriotic cysts. Such quantitative optical spectroscopic imaging technologies
may enable the accurate diagnosis of the pathological processes in endometriotic
cysts in clinical practice.
- source_sentence: Numerous industries provide investors with various funding options
in today's rapidly evolving business and technology landscape. One particularly
intriguing area in this regard is investment. Investment refers to allocating
cash into various assets for a specific duration to generate profits, such as
income or capital appreciation. Infrastructure development has led to the management
of several industries, including property and real estate. Property can stimulate
other economic sectors by providing employment opportunities and enhancing overall
societal well-being. This is further bolstered by the rapid growth of the property
sector, driven by the consistent availability of land and the rising public demand
for housing and office spaces. Based on the data results, it is evident that there
was an upsurge in demand for property and real estate in . In contrast, production
was sluggish expansion across all industries during the Covid- pandemic. Share
prices will rise with increased demand and fall with less demand. This is evident
in the company's effective management of shareholders. Financial reports are crucial
for the company's future. Financial report data can be utilized as a decisive
factor in decision-making. By assessing the financial performance of PT. Alam
Sutera Reality Tbk, PT. Bumi Serpong Tbk, and PT. Bekasi Fajar Industri Estate
Tbk, investors can make well-informed investment decisions. The liquid or illiquid
ratio, which is based on the company's debt-to-equity ratio, current ratio, net
profit margin, and total asset turnover, can be calculated to complete this assessment.
sentences:
- Fraud in accounting reporting is one of the factors that need to consider in presenting
quality financial reports. Based on the existing phenomena, this study investigates
accounting fraud that is suspected to be influenced by Good Corporate Governance
(GCG), compliance with accounting rules to present financial reports and information
asymmetry, and internal control. Testing the hypotheses secondary data from BUMN
listed on the Jakarta Stock Exchange is used to test the allegations. Testing
the hypothesis proposed using a quantitative approach with a sample of BUMNs listed
on the Jakarta Stock Exchange. The calculation results show that all the proposed
hypotheses are empirically proven. This condition indicates that accounting fraud
to be influenced by Good Corporate Governance (GCG), adherence to accounting rules
for the presentation of financial statements and information asymmetry, and internal
control.
- The number of multilingual signs in Japan was increasing rapidly; however, there
were still disputes over the information of signs, such as low recognition of
information and language selection, etc. In this case, this study was carried
out.BR The purpose of the study was to define benchmarks for foreigner-friendly
multilingual signs. Moreover, the possibility of how Chinese information was marked
in the multilingual signs of Japanese Tourist Attractions was explored.BR The
research contents and results were as follows. Firstly, the representative tourist
attractions in Tokyo were surveyed on the spot and photographed for record. Secondly,
the data from the fieldwork were organized into charts and graphs and analyzed
for multilingual markers. Thirdly, through interviews with H Tourism Association
in Tokyo, some issues with the signs of the current situation of scenic spots
were revealed. Fourthly, from the perspective of the characteristics of Chinese
language and the thinking method about Chinese characters, the field surveys and
interviews about the need for a large area of multilingual information marking
in signs were analyzed. The possibility of marking Chinese messages in signs of
Tourist Attractions in Japan was discussed.BR Guidance signs and induction signs
were more informative, and the information was generally presented in words rather
than sentences. If adopted together with non-verbal communication such as map
and diagram, the Chinese characters in the guidance signs and induction signs
of historical scenic spots with a high proportion of Chinese characters could
be omitted.BR So far, there have been many studies on the issue of multilingual
signs from the perspective of fonts and layout. What's more, from this new perspective
on language features, the issue of multilingual signs was explored in this study.
It was expected that the results of this research can be applied into practice
in practical projects.
- Dialect Recognition Systems (DRS) are systems that group dialects, according to
similar acoustic features found in dialect regions. The speaker's age, gender,
and dialect characteristics negatively affect the performance of speech recognition
systems. To handle dialect differences, dialect recognition systems can be integrated
into speech recognition systems. By determining the spoken dialect, the system
can be switched to the corresponding speech recognition model. There is no dataset
that can be used for Turkish automatic dialect recognition systems. In this study,
it is thought that this deficiency should be eliminated in some way. In addition,
an experimental study has been carried out to classify the generated data set
by convolutional neural networks. The resulting % accuracy is satisfactory.
- source_sentence: The social sciences have long shown that health is not born of
pure biology, empirically (re)centred the social and material causes of disease,
and affirmed the subjective experiences of disease. Disputed both in popular and
academic discourses, social health has variously attempted to stress the social
aspects of health. Existing conceptions remain analytically limited as they are
predominantly used as descriptors for populational health. This article theorises
social health as an analytical lens for making sense of the relations, affects
and events where health unfolds and comes into expression. Drawing on social practice
theory, feminist care ethics and posthumanism this conceptual paper re-imagines
how social health might be conceived as lived social practices anchored in care.
Care within our framework acknowledges the unavoidable interdependency foundational
to the existence of beings and stresses the 'know how' and embodied practices
of care in the mundane in order to emphasise that care itself is absolutely integral
to the maintenance of social health. The article argues that health needs to be
understood as a verb intrinsically (re)made in and through social contexts and
structures and comprised of meaningful, human-human and human-non-human interactions.
Ultimately, in theorising social health through mundane care practices, we hope
to open up research to making sense of how the doing of health unfolds inside
often banal, patterned forms of social activity. Such taken-for-granted social
practices exemplify the often overlooked lived realities that comprise our health.
To understand health in its own right, we argue, these everyday practices need
to be interrogated.
sentences:
- This paper proposes a methodology to create an interpretable fuzzy model for monthly
rainfall time series prediction. The proposed methodology incorporates the advantages
of artificial neural network, fuzzy logic and genetic algorithm. In the first
step, the differences between the time series data are calculated and they are
used to define the interval between the membership functions of a Mamdani-type
fuzzy inference system. Next, artificial neural network is used to develop the
model from input-output data and the established model is then used to extract
the fuzzy rules. The parameters of the created fuzzy model are then optimized
by using genetic algorithm. The proposed model was applied to eight monthly rainfall
time series data in the northeast region of Thailand. The experimental results
showed that the proposed model provided satisfactory prediction accuracy when
compared to other commonly-used prediction models. Due to the interpretability
nature of the model, human analysts can gain insight knowledge of the data to
be modeled.
- A dB dynamic range and cm spatial resolution tunable photon-counting optical time-domain
reflectometer (PC-OTDR) is presented along with a Field Programmable Gate Array
(FPGA)-based detection management system that allows several regions of the fiber
to be interrogated by the same optical pulse, increasing the data acquisition
rate when compared to previous solutions. The optical pulse generation is implemented
by a tunable figure- passive mode-locked laser providing pulses with the desired
bandwidth and center wavelength for WDM applications in the C-band. The acquisition
rate is limited by the afterpulse effect and dead time of the employed gated avalanche
single-photon detectors. The devised acquisition system not only allows for centimeter-resolution
monitoring of fiber links as long as km in under minutes but is also readily adapted
to any other photon-counting strategy for increased acquisition rate. The system
provides a -fold decrease in acquisition times when compared with state-of-the-art
solutions, allowing affordable times in centimeter-resolution long-distance fiber
measurements.
- Care has been theorised in relationship to eating disorders as a central consideration
across diagnoses. In the context of avoidant restrictive food intake disorder
(ARFID) specifically, there is room to further develop the nuances around layers
of care involved in working towards well-being. In this paper, we engage with
the stories of caregivers of people with ARFID, exploring their pathways to care
(or lack thereof) through the healthcare system in Aotearoa New Zealand. We explore
the material, affective and relational aspects of care and care-seeking, engaging
with the power and politics of care as it flows through care-seeking assemblages.
Using postqualitative methods of analysis, we discuss how while participants were
seeking care, they received (or, at times, did not receive) treatment, and unpack
how care and treatment are not always synonymous. We work up extracts from parents'
stories surrounding their caring for their children and how their actions were,
at times, interpreted in ways that made them feel blame and shame rather than
care. Participants' stories also offer glimmers of care within a resource-strapped
healthcare system, which invite us to consider the potentiality of a relational
ethics of care as an assemblage-shifting moment.
pipeline_tag: sentence-similarity
library_name: sentence-transformers
metrics:
- cosine_accuracy
model-index:
- name: SentenceTransformer based on answerdotai/ModernBERT-large
results:
- task:
type: triplet
name: Triplet
dataset:
name: modernBERT
type: modernBERT
metrics:
- type: cosine_accuracy
value: 0.9846547314578005
name: Cosine Accuracy
- task:
type: triplet
name: Triplet
dataset:
name: modernBERT disciplines
type: modernBERT_disciplines
metrics:
- type: cosine_accuracy
value: 0.9789272030651341
name: Cosine Accuracy
---
# SentenceTransformer based on answerdotai/ModernBERT-large
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large). It maps sentences & paragraphs to a 1024-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
## Model Details
### Model Description
- **Model Type:** Sentence Transformer
- **Base model:** [answerdotai/ModernBERT-large](https://huggingface.co/answerdotai/ModernBERT-large) <!-- at revision e829787a68677321312ff287fda2f8ef1a36e02a -->
- **Maximum Sequence Length:** 8192 tokens
- **Output Dimensionality:** 1024 dimensions
- **Similarity Function:** Cosine Similarity
<!-- - **Training Dataset:** Unknown -->
<!-- - **Language:** Unknown -->
<!-- - **License:** Unknown -->
### Model Sources
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
### Full Model Architecture
```
SentenceTransformer(
(0): Transformer({'max_seq_length': 8192, 'do_lower_case': False}) with Transformer model: ModernBertModel
(1): Pooling({'word_embedding_dimension': 1024, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
)
```
## Usage
### Direct Usage (Sentence Transformers)
First install the Sentence Transformers library:
```bash
pip install -U sentence-transformers
```
Then you can load this model and run inference.
```python
from sentence_transformers import SentenceTransformer
# Download from the 🤗 Hub
model = SentenceTransformer("m7n/discipline-bert-modern-large_v02")
# Run inference
sentences = [
"The social sciences have long shown that health is not born of pure biology, empirically (re)centred the social and material causes of disease, and affirmed the subjective experiences of disease. Disputed both in popular and academic discourses, social health has variously attempted to stress the social aspects of health. Existing conceptions remain analytically limited as they are predominantly used as descriptors for populational health. This article theorises social health as an analytical lens for making sense of the relations, affects and events where health unfolds and comes into expression. Drawing on social practice theory, feminist care ethics and posthumanism this conceptual paper re-imagines how social health might be conceived as lived social practices anchored in care. Care within our framework acknowledges the unavoidable interdependency foundational to the existence of beings and stresses the 'know how' and embodied practices of care in the mundane in order to emphasise that care itself is absolutely integral to the maintenance of social health. The article argues that health needs to be understood as a verb intrinsically (re)made in and through social contexts and structures and comprised of meaningful, human-human and human-non-human interactions. Ultimately, in theorising social health through mundane care practices, we hope to open up research to making sense of how the doing of health unfolds inside often banal, patterned forms of social activity. Such taken-for-granted social practices exemplify the often overlooked lived realities that comprise our health. To understand health in its own right, we argue, these everyday practices need to be interrogated.",
"Care has been theorised in relationship to eating disorders as a central consideration across diagnoses. In the context of avoidant restrictive food intake disorder (ARFID) specifically, there is room to further develop the nuances around layers of care involved in working towards well-being. In this paper, we engage with the stories of caregivers of people with ARFID, exploring their pathways to care (or lack thereof) through the healthcare system in Aotearoa New Zealand. We explore the material, affective and relational aspects of care and care-seeking, engaging with the power and politics of care as it flows through care-seeking assemblages. Using postqualitative methods of analysis, we discuss how while participants were seeking care, they received (or, at times, did not receive) treatment, and unpack how care and treatment are not always synonymous. We work up extracts from parents' stories surrounding their caring for their children and how their actions were, at times, interpreted in ways that made them feel blame and shame rather than care. Participants' stories also offer glimmers of care within a resource-strapped healthcare system, which invite us to consider the potentiality of a relational ethics of care as an assemblage-shifting moment.",
'A dB dynamic range and cm spatial resolution tunable photon-counting optical time-domain reflectometer (PC-OTDR) is presented along with a Field Programmable Gate Array (FPGA)-based detection management system that allows several regions of the fiber to be interrogated by the same optical pulse, increasing the data acquisition rate when compared to previous solutions. The optical pulse generation is implemented by a tunable figure- passive mode-locked laser providing pulses with the desired bandwidth and center wavelength for WDM applications in the C-band. The acquisition rate is limited by the afterpulse effect and dead time of the employed gated avalanche single-photon detectors. The devised acquisition system not only allows for centimeter-resolution monitoring of fiber links as long as km in under minutes but is also readily adapted to any other photon-counting strategy for increased acquisition rate. The system provides a -fold decrease in acquisition times when compared with state-of-the-art solutions, allowing affordable times in centimeter-resolution long-distance fiber measurements.',
]
embeddings = model.encode(sentences)
print(embeddings.shape)
# [3, 1024]
# Get the similarity scores for the embeddings
similarities = model.similarity(embeddings, embeddings)
print(similarities.shape)
# [3, 3]
```
<!--
### Direct Usage (Transformers)
<details><summary>Click to see the direct usage in Transformers</summary>
</details>
-->
<!--
### Downstream Usage (Sentence Transformers)
You can finetune this model on your own dataset.
<details><summary>Click to expand</summary>
</details>
-->
<!--
### Out-of-Scope Use
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
-->
## Evaluation
### Metrics
#### Triplet
* Datasets: `modernBERT` and `modernBERT_disciplines`
* Evaluated with [<code>TripletEvaluator</code>](https://sbert.net/docs/package_reference/sentence_transformer/evaluation.html#sentence_transformers.evaluation.TripletEvaluator)
| Metric | modernBERT | modernBERT_disciplines |
|:--------------------|:-----------|:-----------------------|
| **cosine_accuracy** | **0.9847** | **0.9789** |
<!--
## Bias, Risks and Limitations
*What are the known or foreseeable issues stemming from this model? You could also flag here known failure cases or weaknesses of the model.*
-->
<!--
### Recommendations
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
-->
## Training Details
### Training Dataset
#### Unnamed Dataset
* Size: 7,828 training samples
* Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
* Approximate statistics based on the first 1000 samples:
| | anchor | positive | negative |
|:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
| type | string | string | string |
| details | <ul><li>min: 86 tokens</li><li>mean: 240.32 tokens</li><li>max: 633 tokens</li></ul> | <ul><li>min: 84 tokens</li><li>mean: 243.66 tokens</li><li>max: 668 tokens</li></ul> | <ul><li>min: 88 tokens</li><li>mean: 237.15 tokens</li><li>max: 681 tokens</li></ul> |
* Samples:
| anchor | positive | negative |
|:----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <code>Flash memory devices are investigated to confirm their application as physically unclonable functions (PUFs). Inherent fluctuations in the characteristics of flash memory devices, even with identical fabrication processes, produce different outputs, which are useful for device fingerprints. A difference in programming/erasing efficiency arises from a widely distributed threshold voltage. However, statistical fluctuations in the threshold voltage represent an advantage for PUF applications. The characteristics of PUFs, such as their unclonability, uncontrollability, unpredictability, and robustness, are investigated using fabricated flash memory devices. A simulation study is performed to support the experimental results and to show that the unpredictability is induced by variations in the gate dielectric thickness.</code> | <code>Ternary Content Addressable Memory (TCAM) is used in applications that require a low power dissipation and fast data retrieval. This paper presents a domain wall-based spintronic TCAM cell. The proposed design exploits the resistive behavior of this nonvolatile memory, reduces total power dissipation by reducing the voltage swing at the match line, and minimizes delay by employing a tiny sensing unit within each cell. Our experimental evaluation on nm technology for a -bit word-size TCAM at an V supply voltage and mV sense margin show that the delay is less than ps. The per-bit search energy is approximately fJ. Experimental evaluation on benchmark applications on the AMD Southern Islands GPU reveal that the GPU always dissipates less power when enhanced with the proposed TCAM design. Furthermore, the proposed method consumes at least % less energy when compared to state-of-the-art TCAM designs.</code> | <code>Abstract. The main focus of the paper is to present a flood and landslide early warning system, named HEWS (Hydrohazards Early Warning System), specifically developed for the Civil Protection Department of Sicily, based on the combined use of rainfall thresholds, soil moisture modelling and quantitative precipitation forecast (QPF). The warning system is referred to different Alert Zones in which Sicily has been divided into and based on a threshold system of three different increasing critical levels: ordinary, moderate and high. In this system, for early flood warning, a Soil Moisture Accounting (SMA) model provides daily soil moisture conditions, which allow to select a specific set of three rainfall thresholds, one for each critical level considered, to be used for issue the alert bulletin. Wetness indexes, representative of the soil moisture conditions of a catchment, are calculated using a simple, spatially-lumped rainfallstreamflow model, based on the SCS-CN method, and on the u...</code> |
| <code>A new method for the determination of trace levels of bromates by selective membrane collection is presented. Various membranes containing a few micrograms of different complexing reagents in a poly(vinyl chloride) matrix were tested. These membranes were produced on the surface of quartz glass (reflectors), and they were immersed in solutions containing bromate and bromide ions. At the first stage the prepared membranes collected both bromate and bromide ions, so different bromide masking agents were put in the analyzed solutions to avoid bromide collection. By the end of the equilibration time, the reflectors were left to dry, and they were analyzed by total reflection X-ray fluorescence (TXRF). The poly(vinyl chloride) with aliquat- membrane and o-dianisidin complexing agent gave the best results. The minimum detection limit was equal to ng/mL for ultrapure water and ng/mL for drinking water.</code> | <code>ADVERTISEMENT RETURN TO ISSUEPREVArticleNEXTVoltammetric anion responsive sensors based on modulation of ion permeability through Langmuir-Blodgett films containing synthetic anion receptorsShinobu. Nagase, Masamitsu. Kataoka, Ryuichi. Naganawa, Ryoko. Komatsu, Kazunori. Odashima, and Yoshio. UmezawaCite this: Anal. Chem. , , , 00000000Publication Date (Print):July , 0000Publication History Published online0 May 0000Published inissue July 0000https://pubs.acs.org/doi/ /ac00000a000https://doi.org/ /ac00000a000research-articleACS PublicationsRequest reuse permissionsArticle Views000Altmetric-Citations00LEARN ABOUT THESE METRICSArticle Views are the COUNTER-compliant sum of full text article downloads since November (both PDF and HTML) across all institutions and individuals. These metrics are regularly updated to reflect usage leading up to the last few days.Citations are the number of other articles citing this article, calculated by Crossref and updated daily. Find more information abo...</code> | <code>This study investigated whether performance of an interceptive skill requires an intact visual-perception-action cycle. Eleven skilled male Australian rules football athletes (M age = , SD = ) were recruited from an elite developmental pathway squad for a within-subject study. Participants were required to kick a ball directly at a goal from a -meter distance while wearing a pair of stroboscopic glasses. The glasses were used to create four vision conditions. Condition one kept intact the visual-perception-action cycle with uninterrupted vision of the motor skill. Three other conditions included stroboscopic vision that presented temporal samples of vision, which interrupted the perception-action cycle through progressive increases to intermittent vision occlusion of the motor skill. Goal kick error of ball position relative to a central target line within the goal and number of successful goals kicked were measured. Written report of internal and external focus of attention was also m...</code> |
| <code>The study aimed to determine the effectiveness of Contextual Teaching and Learning (CTL) in reducing and improving learning outcomes and math anxiety among students at a private elementary school in Indonesia. The research utilized a one-group control pre-posttest design with a sample of 0th-grade students. The study used a combination of pre-test and post-test and a closed-ended questionnaire as the data collection instruments. The independent variable in the study was CTL, while the dependent variables were learning outcomes and math anxiety. The paired t-test showed a significant increase in the students' average learning outcomes and a decrease in the average math anxiety levels. The findings suggest that implementing CTL is a practical approach to reducing math anxiety and improving student learning outcomes.</code> | <code>This study aims to determine the problem-solving ability of field independent (FI) and field dependent (FD) students in solving HOTS story problems. This type of research is qualitative research. The research strategy used is a descriptive model. This research was carried out at a junior school in Malang, Indonesia. The respondent was tenth-grade students. Data collection methods in this study include tests and interviews. Data analysis techniques include data collection, reduction, presentation, and concluding. The results of this study show that FI and FD students understand the problem. There is no difference between the two; FI and FD students are good at understanding the problem. FI students plan solutions well and can correctly create mathematical models, while FD students have difficulty developing mathematical models. In getting answers, FI and FD students have something in common: they are not quite right in the final solution.</code> | <code>The recently proposed recursive least-squares (RLS) algorithm for trilinear forms, namely RLS-TF, was designed for the identification of third-order tensors of rank one. In this context, a high-dimension system identification problem can be efficiently addressed (gaining in terms of both performance and complexity) based on tensor decompositions and modelling. In this paper, following the framework of the RLS-TF, we propose a regularized version of this algorithm, where the regularization terms are incorporated within the cost functions. Furthermore, the optimal regularization parameters are derived, aiming at attenuating the effects of the system noise. Simulation results support the performance features of the proposed algorithm, especially in terms of its robustness in noisy environments.</code> |
* Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
```json
{
"distance_metric": "TripletDistanceMetric.COSINE",
"triplet_margin": 0.05
}
```
### Evaluation Dataset
#### Unnamed Dataset
* Size: 391 evaluation samples
* Columns: <code>anchor</code>, <code>positive</code>, and <code>negative</code>
* Approximate statistics based on the first 391 samples:
| | anchor | positive | negative |
|:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
| type | string | string | string |
| details | <ul><li>min: 85 tokens</li><li>mean: 237.84 tokens</li><li>max: 629 tokens</li></ul> | <ul><li>min: 93 tokens</li><li>mean: 239.31 tokens</li><li>max: 610 tokens</li></ul> | <ul><li>min: 83 tokens</li><li>mean: 234.79 tokens</li><li>max: 499 tokens</li></ul> |
* Samples:
| anchor | positive | negative |
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
| <code>The aim of the study was to determine the relationship between emotional intelligence and cohesion in a sports team of girls engaged in synchronized figure skating. The following psychological tests were used in the study: the Emotional Intelligence test by D.V. Lyusin, a test to determine the index of group cohesion of the Sisor. The study was conducted on the basis of the sports school "Yunost" in Yekaterinburg. Two teams of different age groups took part in the experiment: athletes performing in the category of "novices" ( years old), girls performing in the team of "CMS" ( - years old). Testing was conducted twice: at the beginning of the season and after the competitive season. The study revealed positive dynamics of the development of cohesion in both teams. It also revealed reliable relationships between interpersonal emotional intelligence and the level of cohesion in the team. Further research may be aimed at developing a strategy to increase emotional intelligence as a factor...</code> | <code>Recreational swimming can be used as a reliable preventive measure for those diseases that are widespread among students. The purpose of the research is to study the effect of swimming on the functional state of students. The study involved male students who are selfemployed in swimming and male students who are professionally engaged in the swimming section. The research methods used samples of Martinet-Kushelevsky, Rufier, Stange and Genchi, as well as chest excursions. It was revealed that students, who practice swimming in the section, have more favorable conditions for a comprehensive effect on the body than students who swim independently, due to a greater load during training and their systematic nature.</code> | <code>Mg Ni + x% Ti . Mn . V . ( x = ,00,and ) composites were prepared by hydriding combustion synthesis( HCS) and the HCS products were mechanically milled( MM) to obtain Mg-based hydrogen-storage composites. The dehydriding properties,phase structure,surface morphology,and particle composition were studied by pressure-composition-temperature( pcT),X-ray diffraction( XRD) and scanning electron microscopy( SEM). Results showed that addition of %( mass fraction) Ti . Mn . V . exhibited the best desorption property for the HCS + MM product of Mg Ni , which could completely desorb . % H in s at K. The apparent dehydrogenation activation energy of the system was decreased to . kJ / mol from . kJ / mol of Mg Ni . The improvement of the desorption property could be attributed to the enhancement of diffusion and the hydrogen pumpingof Ti . Mn . V . .</code> |
| <code>This article has been retracted: please see Elsevier Policy on Article Withdrawal ( ). This article has been retracted at the request of authors due to scientific errors reported by authors. The author reported errors are: : In the " Case Description" section, Fig. A0 (wind and PV output power) is the input data for the simulation calculation. The authors report that, due to an oversight, they did not use real wind and PV output power data, which would lead to inaccurate results for the system simulation calculations. : For the " Model solving algorithm", the authors found that it is incorrect to use the properties of Gaussian functions to improve the CDE algorithm because Gaussian functions do not have the properties of concave functions. This is evidenced in the literature "DOI: : Fig. (Iterative Convergence Curve of Rastrigin Function) is tested using the benchmark test function (Rastrigin function) in order to demonstrate the feasibility of the GCDE algorithm. However, it is clear ...</code> | <code>Energy accessibility especially electrical energy is considered as one of the most appealing factors to achieve energy sustainability. The purpose of this study is to investigate energy sustainability using renewable energies for two high potential cities in the south-east of Iran until the year . In this regard, Homer software is used to evaluate economic and technical analyses of PV-wind-diesel hybrid system for the two cities by the data gathering which was collected from Iran's meteorological organization. Therefore, the average of solar radiation per month for Zabol and Zahak were about and (h/d). Also, mean wind speeds are calculated m/s and m/s for Zabol and Zahak respectively which proposed that these cities have high potential in order to electrical production by a hybrid system. Furthermore, the amount of electricity production by PV array for Zabol and Zahak were (kWh/yr) and (kWh/yr) respectively, and the amount of electricity production by wind turbine were (kWh/yr) and (k...</code> | <code>The philosophy that built by German Idealism is obtained and never neglected religion, this is not about the religious dogmas or the fantasy and legendary nature of religion, but it is about the spirit and the crux of religion. Nevertheless, there is always struggled to deprive it from fantasies and rebuilt by philosophical ideas. These ideal philosophers are asserted to reconstruct the stories and imaginary schemes of religion into philosophical and rational thinking. There is a change in the result of this process which is religion is retreated and the metaphysics is slightly appeared. In other word, this change is directed from revelation to metaphysical views. In the light of this, the German Idealism is taking two different ways toward religion: the negative direction; which is involved to the critical studies of the basis and construction of religion, and the positive direction; this direction is returned to religion, but this return is happened after reconstruct religion by the ...</code> |
| <code>In this paper measurements of momentum and current transport caused by current driven tearing instability are reported. The measurements are done in the Madison Symmetric Torus reversed-field pinch [R. N. Dexter, D. W. Kerst, T. W. Lovell, S. C. Prager, and J. C. Sprott, Fusion Technol. , ( )] in a regime with repetitive bursts of tearing instability causing magnetic field reconnection. It is established that the plasma parallel momentum profile flattens during these reconnection events: The flow decreases in the core and increases at the edge. The momentum relaxation phenomenon is similar in nature to the well established relaxation of the parallel electrical current and could be a general feature of self-organized systems. The measured fluctuation-induced Maxwell and Reynolds stresses, which govern the dynamics of plasma flow, are large and almost balance each other such that their difference is approximately equal to the rate of change of plasma momentum. The Hall dynamo, which is d...</code> | <code>We present measurements of magnetic fields generated in laser-driven coil targets irradiated by laser pulses of nanosecond duration, m wavelength, J energy, and W/cm0 intensity, at the LULI0000 facility. Using two perpendicular probing axes, proton deflectometry is used to characterize the coil current and static charge at different times. Results reveal various deflection features that can be unambiguously linked to a looping quasi-steady current of well-understood polarity or to a static charging of the coil surface. Measured currents are broadly consistent with predictions from a laser-driven diode-current source and lumped circuit model, supporting the quasi-steady assessment of the discharges. Peak magnetic fields of T at the center of -m-diameter coils, obtained at the moderate laser intensity, open up the use of such laser-driven coil targets at facilities worldwide to study numerous phenomena in magnetized high-energy-density plasmas, and its potential applications.</code> | <code>EU , , , , , . . . , . . , . , - -EU . .In August , the UK launched a new export strategy to increase UK total exports as a proportion of gross domestic product (GDP) to % and to build trading relationships around the world after Brexit. And the government aims to strengthen UK's position as one of the 00st century's great trading nations and to expand the export of traders by setting the five principle. These principles are a business-led approach, doing what only government can do, joining up across government with local partners and the private sector, digital by design and value for money. This paper examines the background, purpose and main contents of the UK new export strategy in UK and the countermeasures for the new UK export strategy. First of all, we should prepare a scenarios based on directions of Brexit. Second, it is necessary to discuss the redefinition of relationship with Korea-UK and Korea-EU. And finally, Korean companies should enter the UK by utilizing the e-comme...</code> |
* Loss: [<code>TripletLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#tripletloss) with these parameters:
```json
{
"distance_metric": "TripletDistanceMetric.COSINE",
"triplet_margin": 0.05
}
```
### Training Hyperparameters
#### Non-Default Hyperparameters
- `eval_strategy`: steps
- `per_device_train_batch_size`: 4
- `per_device_eval_batch_size`: 4
- `learning_rate`: 1e-05
- `weight_decay`: 0.01
- `num_train_epochs`: 2
- `warmup_ratio`: 0.1
- `batch_sampler`: no_duplicates
#### All Hyperparameters
<details><summary>Click to expand</summary>
- `overwrite_output_dir`: False
- `do_predict`: False
- `eval_strategy`: steps
- `prediction_loss_only`: True
- `per_device_train_batch_size`: 4
- `per_device_eval_batch_size`: 4
- `per_gpu_train_batch_size`: None
- `per_gpu_eval_batch_size`: None
- `gradient_accumulation_steps`: 1
- `eval_accumulation_steps`: None
- `torch_empty_cache_steps`: None
- `learning_rate`: 1e-05
- `weight_decay`: 0.01
- `adam_beta1`: 0.9
- `adam_beta2`: 0.999
- `adam_epsilon`: 1e-08
- `max_grad_norm`: 1.0
- `num_train_epochs`: 2
- `max_steps`: -1
- `lr_scheduler_type`: linear
- `lr_scheduler_kwargs`: {}
- `warmup_ratio`: 0.1
- `warmup_steps`: 0
- `log_level`: passive
- `log_level_replica`: warning
- `log_on_each_node`: True
- `logging_nan_inf_filter`: True
- `save_safetensors`: True
- `save_on_each_node`: False
- `save_only_model`: False
- `restore_callback_states_from_checkpoint`: False
- `no_cuda`: False
- `use_cpu`: False
- `use_mps_device`: False
- `seed`: 42
- `data_seed`: None
- `jit_mode_eval`: False
- `use_ipex`: False
- `bf16`: False
- `fp16`: False
- `fp16_opt_level`: O1
- `half_precision_backend`: auto
- `bf16_full_eval`: False
- `fp16_full_eval`: False
- `tf32`: None
- `local_rank`: 0
- `ddp_backend`: None
- `tpu_num_cores`: None
- `tpu_metrics_debug`: False
- `debug`: []
- `dataloader_drop_last`: False
- `dataloader_num_workers`: 0
- `dataloader_prefetch_factor`: None
- `past_index`: -1
- `disable_tqdm`: False
- `remove_unused_columns`: True
- `label_names`: None
- `load_best_model_at_end`: False
- `ignore_data_skip`: False
- `fsdp`: []
- `fsdp_min_num_params`: 0
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
- `fsdp_transformer_layer_cls_to_wrap`: None
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
- `deepspeed`: None
- `label_smoothing_factor`: 0.0
- `optim`: adamw_torch
- `optim_args`: None
- `adafactor`: False
- `group_by_length`: False
- `length_column_name`: length
- `ddp_find_unused_parameters`: None
- `ddp_bucket_cap_mb`: None
- `ddp_broadcast_buffers`: False
- `dataloader_pin_memory`: True
- `dataloader_persistent_workers`: False
- `skip_memory_metrics`: True
- `use_legacy_prediction_loop`: False
- `push_to_hub`: False
- `resume_from_checkpoint`: None
- `hub_model_id`: None
- `hub_strategy`: every_save
- `hub_private_repo`: None
- `hub_always_push`: False
- `gradient_checkpointing`: False
- `gradient_checkpointing_kwargs`: None
- `include_inputs_for_metrics`: False
- `include_for_metrics`: []
- `eval_do_concat_batches`: True
- `fp16_backend`: auto
- `push_to_hub_model_id`: None
- `push_to_hub_organization`: None
- `mp_parameters`:
- `auto_find_batch_size`: False
- `full_determinism`: False
- `torchdynamo`: None
- `ray_scope`: last
- `ddp_timeout`: 1800
- `torch_compile`: False
- `torch_compile_backend`: None
- `torch_compile_mode`: None
- `dispatch_batches`: None
- `split_batches`: None
- `include_tokens_per_second`: False
- `include_num_input_tokens_seen`: False
- `neftune_noise_alpha`: None
- `optim_target_modules`: None
- `batch_eval_metrics`: False
- `eval_on_start`: False
- `use_liger_kernel`: False
- `eval_use_gather_object`: False
- `average_tokens_across_devices`: False
- `prompts`: None
- `batch_sampler`: no_duplicates
- `multi_dataset_batch_sampler`: proportional
</details>
### Training Logs
| Epoch | Step | Training Loss | Validation Loss | modernBERT_cosine_accuracy | modernBERT_disciplines_cosine_accuracy |
|:------:|:----:|:-------------:|:---------------:|:--------------------------:|:--------------------------------------:|
| 0 | 0 | - | - | 0.8951 | - |
| 0.0511 | 100 | 0.0064 | 0.0049 | 0.9616 | - |
| 0.1022 | 200 | 0.002 | 0.0071 | 0.9565 | - |
| 0.1533 | 300 | 0.0076 | 0.0034 | 0.9795 | - |
| 0.2044 | 400 | 0.0074 | 0.0039 | 0.9668 | - |
| 0.2555 | 500 | 0.0036 | 0.0036 | 0.9693 | - |
| 0.3066 | 600 | 0.0035 | 0.0029 | 0.9770 | - |
| 0.3577 | 700 | 0.004 | 0.0035 | 0.9693 | - |
| 0.4088 | 800 | 0.0027 | 0.0034 | 0.9770 | - |
| 0.4599 | 900 | 0.0044 | 0.0032 | 0.9719 | - |
| 0.5110 | 1000 | 0.0037 | 0.0053 | 0.9565 | - |
| 0.5621 | 1100 | 0.0048 | 0.0029 | 0.9795 | - |
| 0.6132 | 1200 | 0.0032 | 0.0031 | 0.9744 | - |
| 0.6643 | 1300 | 0.0023 | 0.0036 | 0.9744 | - |
| 0.7154 | 1400 | 0.0044 | 0.0029 | 0.9821 | - |
| 0.7665 | 1500 | 0.0022 | 0.0032 | 0.9795 | - |
| 0.8176 | 1600 | 0.0036 | 0.0034 | 0.9770 | - |
| 0.8687 | 1700 | 0.0022 | 0.0031 | 0.9821 | - |
| 0.9198 | 1800 | 0.0028 | 0.0025 | 0.9821 | - |
| 0.9709 | 1900 | 0.0054 | 0.0025 | 0.9821 | - |
| 1.0220 | 2000 | 0.003 | 0.0029 | 0.9770 | - |
| 1.0731 | 2100 | 0.0018 | 0.0026 | 0.9795 | - |
| 1.1242 | 2200 | 0.0021 | 0.0024 | 0.9847 | - |
| 1.1753 | 2300 | 0.0015 | - | - | 0.9789 |
### Framework Versions
- Python: 3.10.12
- Sentence Transformers: 3.3.1
- Transformers: 4.48.0.dev0
- PyTorch: 2.5.1+cu121
- Accelerate: 1.2.1
- Datasets: 3.2.0
- Tokenizers: 0.21.0
## Citation
### BibTeX
#### Sentence Transformers
```bibtex
@inproceedings{reimers-2019-sentence-bert,
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
author = "Reimers, Nils and Gurevych, Iryna",
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
month = "11",
year = "2019",
publisher = "Association for Computational Linguistics",
url = "https://arxiv.org/abs/1908.10084",
}
```
#### TripletLoss
```bibtex
@misc{hermans2017defense,
title={In Defense of the Triplet Loss for Person Re-Identification},
author={Alexander Hermans and Lucas Beyer and Bastian Leibe},
year={2017},
eprint={1703.07737},
archivePrefix={arXiv},
primaryClass={cs.CV}
}
```
<!--
## Glossary
*Clearly define terms in order to be accessible across audiences.*
-->
<!--
## Model Card Authors
*Lists the people who create the model card, providing recognition and accountability for the detailed work that goes into its construction.*
-->
<!--
## Model Card Contact
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
--> |