Update README.md
Browse files
README.md
CHANGED
@@ -1,563 +1,51 @@
|
|
1 |
---
|
|
|
|
|
2 |
tags:
|
3 |
-
-
|
4 |
-
-
|
5 |
-
-
|
6 |
-
-
|
7 |
-
-
|
8 |
-
-
|
9 |
-
|
10 |
-
|
11 |
-
- source_sentence: 'Constraints on the range lambda of Yukawa-like modifications to
|
12 |
-
the Newtonian inverse-square law of gravitation from Solar System planetary motions
|
13 |
-
In this paper we use the latest corrections to the Newton-Einstein secular perihelion
|
14 |
-
rates of some planets of the Solar System, phenomenologically estimated with the
|
15 |
-
EPM2004 ephemerides by the Russian astronomer E.V. Pitjeva, to put severe constraints
|
16 |
-
on the range parameter lambda characterizing the Yukawa-like modifications of
|
17 |
-
the Newtonian inverse-square law of gravitation. It turns out that the range cannot
|
18 |
-
exceed about one tenth of an Astronomical Unit. We assumed neither equivalence
|
19 |
-
principle violating effects nor spatial variations of alpha and lambda .
|
20 |
-
This finding may have important consequences on all the modified theories of gravity
|
21 |
-
involving Yukawa-type terms with range parameters much larger than the Solar System
|
22 |
-
size. However, caution is advised since we, currently have at our disposal only
|
23 |
-
the periehlion extra-rates estimated by Pitjeva: if and when other groups will
|
24 |
-
estimate their own corrections to the secular motion of perihelia, more robust
|
25 |
-
and firm tests may be conducted.'
|
26 |
-
sentences:
|
27 |
-
- Ore extensions satisfying a polynomial identity Necessary and sufficient conditions
|
28 |
-
for an Ore extension S R x; si, de to be a rm PI ring are given in the case si is
|
29 |
-
an injective endomorphism of a semiprime ring R satisfying the rm ACC on
|
30 |
-
annihilators. Also, for an arbitrary endomorphism tau of R , a characterization
|
31 |
-
of Ore extensions R x; tau which are rm PI rings is given, provided the
|
32 |
-
coefficient ring R is noetherian.
|
33 |
-
- LARES WEBER-SAT and the equivalence principle It has often been claimed that the
|
34 |
-
proposed Earth artificial satellite LARES WEBER-SAT-whose primary goal is, in
|
35 |
-
fact, the measurement of the general relativistic Lense-Thirring effect at a some
|
36 |
-
percent level-would allow to greatly improve, among (many) other things, the present-day
|
37 |
-
(10 -13) level of accuracy in testing the equivalence principle as well. Recent
|
38 |
-
claims point towards even two orders of magnitude better, i.e. 10 -15. In this
|
39 |
-
note we show that such a goal is, in fact, unattainable by many orders of magnitude
|
40 |
-
being, instead, the achievable level of the order of 10 -9.
|
41 |
-
- The Field Perturbation Theory of the Double Correlated Phase in High Temperature
|
42 |
-
Superconductors The Double-Correlated phase in HTSC, and its treatment by field
|
43 |
-
perturbation theory, is established. In particular, we define the ground state,
|
44 |
-
the quasi-particle excitations, and construct an appropriate field. We also derive
|
45 |
-
the unperturbed Hamiltonian, and the propagators for the unperturbed state. Then
|
46 |
-
we discuss the perturbation Hamiltonian, and show that the Hartree diagram is
|
47 |
-
significant for both the pseudogap and the superconductive order parameter, and
|
48 |
-
suggest that it yields the major contribution to these parameters.
|
49 |
-
- source_sentence: Coupling of whispering-gallery modes in size-mismatched microdisk
|
50 |
-
photonic molecules Mechanisms of whispering-gallery (WG) modes coupling in microdisk
|
51 |
-
photonic molecules (PMs) with slight and significant size mismatch are numerically
|
52 |
-
investigated. The results reveal two different scenarios of modes interaction
|
53 |
-
depending on the degree of this mismatch and offer new insight into how PM parameters
|
54 |
-
can be tuned to control and modify WG-modes wavelengths and Q-factors. From a
|
55 |
-
practical point of view, these findings offer a way to fabricate PM microlaser
|
56 |
-
structures that exhibit low thresholds and directional emission, and at the same
|
57 |
-
time are more tolerant to fabrication errors than previously explored coupled-cavity
|
58 |
-
structures composed of identical microresonators.
|
59 |
-
sentences:
|
60 |
-
- Silver mode for heavy Higgs search in the presence of a fourth SM family We investigate
|
61 |
-
the possible enhancement to the discovery of the heavy Higgs boson through the
|
62 |
-
possible fourth SM family heavy neutrino. Using the channel h- v4 v4- mu W mu
|
63 |
-
W- mu j j mu j j, it is found that for certain ranges of Higgs boson and v4 masses
|
64 |
-
LHC could discover both of them simultaneously with 1 fb -1 integrated luminosity.
|
65 |
-
- Wavelength-scale stationary-wave integrated Fourier-transform spectrometry Spectrometry
|
66 |
-
is a general physical-analysis approach for investigating light-matter interactions.
|
67 |
-
However, the complex designs of existing spectrometers render them resistant to
|
68 |
-
simplification and miniaturization, both of which are vital for applications in
|
69 |
-
micro- and nanotechnology and which are now undergoing intensive research. Stationary-wave
|
70 |
-
integrated Fourier-transform spectrometry (SWIFTS)-an approach based on direct
|
71 |
-
intensity detection of a standing wave resulting from either reflection (as in
|
72 |
-
the principle of colour photography by Gabriel Lippmann) or counterpropagative
|
73 |
-
interference phenomenon-is expected to be able to overcome this drawback. Here,
|
74 |
-
we present a SWIFTS-based spectrometer relying on an original optical near-field
|
75 |
-
detection method in which optical nanoprobes are used to sample directly the evanescent
|
76 |
-
standing wave in the waveguide. Combined with integrated optics, we report a way
|
77 |
-
of reducing the volume of the spectrometer to a few hundreds of cubic wavelengths.
|
78 |
-
This is the first attempt, using SWIFTS, to produce a very small integrated one-dimensional
|
79 |
-
spectrometer suitable for applications where microspectrometers are essential.
|
80 |
-
- 'Discussion of 2004 IMS Medallion Lecture: Local Rademacher complexities and
|
81 |
-
oracle inequalities in risk minimization by V. Koltchinskii Discussion of 2004
|
82 |
-
IMS Medallion Lecture: Local Rademacher complexities and oracle inequalities in
|
83 |
-
risk minimization by V. Koltchinskii arXiv:0708.0083'
|
84 |
-
- source_sentence: Additive preserving rank one maps on Hilbert C ast -modules In
|
85 |
-
this paper, we characterize a class of additive maps on Hilbert C ast -modules
|
86 |
-
which maps a rank one adjointable operators to another rank one operators.
|
87 |
-
sentences:
|
88 |
-
- The Statistics of the Points Where Nodal Lines Intersect a Reference Curve We
|
89 |
-
study the intersection points of a fixed planar curve Gamma with the nodal
|
90 |
-
set of a translationally invariant and isotropic Gaussian random field Psi(
|
91 |
-
bi r ) and the zeros of its normal derivative across the curve. The intersection
|
92 |
-
points form a discrete random process which is the object of this study. The field
|
93 |
-
probability distribution function is completely specified by the correlation G( bi
|
94 |
-
r - bi r ) Psi( bi r ) Psi( bi r ) . Given an arbitrary G( bi r - bi
|
95 |
-
r ) , we compute the two point correlation function of the point process on
|
96 |
-
the line, and derive other statistical measures (repulsion, rigidity) which characterize
|
97 |
-
the short and long range correlations of the intersection points. We use these
|
98 |
-
statistical measures to quantitatively characterize the complex patterns displayed
|
99 |
-
by various kinds of nodal networks. We apply these statistics in particular to
|
100 |
-
nodal patterns of random waves and of eigenfunctions of chaotic billiards. Of
|
101 |
-
special interest is the observation that for monochromatic random waves, the number
|
102 |
-
variance of the intersections with long straight segments grows like L ln L
|
103 |
-
, as opposed to the linear growth predicted by the percolation model, which was
|
104 |
-
successfully used to predict other long range nodal properties of that field.
|
105 |
-
- Concrete Classification and Centralizers of Certain mathbb Z 2 rtimes rm
|
106 |
-
SL (2, mathbb Z ) -actions We introduce a new class of actions of the group G on
|
107 |
-
finite von Neumann algebras and call them twisted Bernoulli shift actions. We
|
108 |
-
classify these actions up to conjugacy and give an explicit description of their
|
109 |
-
centralizers. We also distinguish many of those actions on the AFD mathrm II 1 factor
|
110 |
-
in view of outer conjugacy.
|
111 |
-
- Liquid-Solid Transition and Phase Diagram of 4He Confined in Nanoporous Glass
|
112 |
-
We have studied the liquid - solid (L-S) phase transition of 4He confined in
|
113 |
-
nanoporous glass, which has interconnected nanopores of 2.5 nm in diameter. The
|
114 |
-
L-S boundary is determined by the measurements of pressure and thermal response
|
115 |
-
during slow cooling and warming. Below 1 K, the freezing pressure is elevated
|
116 |
-
to 1.2 MPa from the bulk freezing pressure, and appears to be independent of temperature.
|
117 |
-
The T-independent L-S boundary implies the existence of a localized Bose-Einstein
|
118 |
-
condensation state, in which long-range superfluid coherence is destroyed by narrowness
|
119 |
-
of the nanopores and random potential.
|
120 |
-
- source_sentence: Competition between unconventional superconductivity and incommensurate
|
121 |
-
antiferromagnetic order in CeRh1-xCoxIn5 Elastic neutron diffraction measurements
|
122 |
-
were performed on the quasi-two dimensional heavy fermion system CeRh1-xCoxIn5,
|
123 |
-
ranging from an incommensurate antiferromagnet for low x to an unconventional
|
124 |
-
superconductor on the Co-rich end of the phase diagram. We found that the superconductivity
|
125 |
-
competes with the incommensurate antiferromagnetic (AFM) order characterized by
|
126 |
-
qI (1 2, 1 2, delta) with delta 0.298, while it coexists with the commensurate
|
127 |
-
AFM order with qc (1 2, 1 2, 1 2). This is in sharp contrast to the CeRh1-xIrxIn5
|
128 |
-
system, where both the commensurate and incommensurate magnetic orders coexist
|
129 |
-
with the superconductivity. These results reveal that particular areas on the
|
130 |
-
Fermi surface nested by qI play an active role in forming the superconducting
|
131 |
-
state in CeCoIn5.
|
132 |
-
sentences:
|
133 |
-
- 'Existence and convergence properties of physical measures for certain dynamical
|
134 |
-
systems with holes We study two classes of dynamical systems with holes: expanding
|
135 |
-
maps of the interval and Collet-Eckmann maps with singularities. In both cases,
|
136 |
-
we prove that there is a natural absolutely continuous conditionally invariant
|
137 |
-
measure mu (a.c.c.i.m.) with the physical property that strictly positive H o
|
138 |
-
lder continuous functions converge to the density of mu under the renormalized
|
139 |
-
dynamics of the system. In addition, we construct an invariant measure nu ,
|
140 |
-
supported on the Cantor set of points that never escape from the system, that
|
141 |
-
is ergodic and enjoys exponential decay of correlations for H o lder observables.
|
142 |
-
We show that nu satisfies an equilibrium principle which implies that the escape
|
143 |
-
rate formula, familiar to the thermodynamic formalism, holds outside the usual
|
144 |
-
setting. In particular, it holds for Collet-Eckmann maps with holes, which are
|
145 |
-
not uniformly hyperbolic and do not admit a finite Markov partition. We use a
|
146 |
-
general framework of Young towers with holes and first prove results about the accim
|
147 |
-
and the invariant measure on the tower. Then we show how to transfer results to
|
148 |
-
the original dynamical system. This approach can be expected to generalize to
|
149 |
-
other dynamical systems than the two above classes.'
|
150 |
-
- New results of intersection numbers on moduli spaces of curves We present a series
|
151 |
-
of new results we obtained recently about the intersection numbers of tautological
|
152 |
-
classes on moduli spaces of curves, including a simple formula of the n-point
|
153 |
-
functions for Witten s tau classes, an effective recursion formula to compute
|
154 |
-
higher Weil-Petersson volumes, several new recursion formulae of intersection
|
155 |
-
numbers and our proof of a conjecture of Itzykson and Zuber concerning denominators
|
156 |
-
of intersection numbers. We also present Virasoro and KdV properties of generating
|
157 |
-
functions of general mixed kappa and psi intersections.
|
158 |
-
- Algebraic charge liquids High temperature superconductivity emerges in the cuprate
|
159 |
-
compounds upon changing the electron density of an insulator in which the electron
|
160 |
-
spins are antiferromagnetically ordered. A key characteristic of the superconductor
|
161 |
-
is that electrons can be extracted from them at zero energy only if their momenta
|
162 |
-
take one of four specific values (the nodal points ). A central enigma has been
|
163 |
-
the evolution of the zero energy electrons in the metallic state between the antiferromagnet
|
164 |
-
and the superconductor, and recent experiments yield apparently contradictory
|
165 |
-
results. The oscillation of the resistance in this metal as a function of magnetic
|
166 |
-
field indicate that the zero energy electrons carry momenta which lie on elliptical Fermi
|
167 |
-
pockets , while ejection of electrons by high intensity light indicates that the
|
168 |
-
zero energy electrons have momenta only along arc-like regions. We present a theory
|
169 |
-
of new states of matter, which we call algebraic charge liquids , which arise
|
170 |
-
naturally between the antiferromagnet and the superconductor, and reconcile these
|
171 |
-
observations. Our theory also explains a puzzling dependence of the density of
|
172 |
-
superconducting electrons on the total electron density, and makes a number of
|
173 |
-
unique predictions for future experiments.
|
174 |
-
- source_sentence: Detecting Directional Selection from the Polymorphism Frequency
|
175 |
-
Spectrum The distribution of genetic polymorphisms in a population contains information
|
176 |
-
about the mutation rate and the strength of natural selection at a locus. Here,
|
177 |
-
we show that the Poisson Random Field (PRF) method of population-genetic inference
|
178 |
-
suffers from systematic biases that tend to underestimate selection pressures
|
179 |
-
and mutation rates, and that erroneously infer positive selection. These problems
|
180 |
-
arise from the infinite-sites approximation inherent in the PRF method. We introduce
|
181 |
-
three new inference techniques that correct these problems. We present a finite-site
|
182 |
-
modification of the PRF method, as well as two new methods for inferring selection
|
183 |
-
pressures and mutation rates based on diffusion models. Our methods can be used
|
184 |
-
to infer not only a weighted average of selection pressures acting on a gene
|
185 |
-
sequence, but also the distribution of selection pressures across sites. We evaluate
|
186 |
-
the accuracy of our methods, as well that of the original PRF approach, by comparison
|
187 |
-
with Wright-Fisher simulations.
|
188 |
-
sentences:
|
189 |
-
- Changeover from Glassy ferromagnetism of the orbital domain state to long range
|
190 |
-
ferromagnetic ordering in La 0.9 Sr 0.1 MnO 3 An attempt is made to resolve
|
191 |
-
the controversy related to the low temperature phase (ground state) of the low-doped
|
192 |
-
ferromagnetic (FM)- insulator(I) manganite through bulk magnetic measurements
|
193 |
-
on La 0.9 Sr 0.1 MnO 3 sample. It is shown that the FM phase, formed
|
194 |
-
out of well defined transition in the low-doped system, becomes inhomogeneous
|
195 |
-
with decrease in temperature. This inhomogeniety is considered to be an outcome
|
196 |
-
of the formation of orbital domain state of e g -electrons having hole rich (metallic)
|
197 |
-
walls separating the hole deficient (insulating) regions. The resulting complexity
|
198 |
-
brings in metastability and glassy behaviour within the FM phase at low temperature,
|
199 |
-
however, with no resemblance to spin glass, cluster glass or reentrant phases.
|
200 |
-
It shows ageing effect without memory but magnetic relaxation shows signatures
|
201 |
-
of inter-cluster interaction. The energy landscape picture of this glassy phase
|
202 |
-
is described in terms of hierarchical model. Further, it is shown that this inhomogeneity
|
203 |
-
disappear in La 0.9 Sr 0.1 MnO 3.08 where, the orbital domain state
|
204 |
-
is destroyed by self doping resulting in reduction of Mn 3 and hence e g
|
205 |
-
-electrons. The ferromagnetic phase of the non-stoichiometric sample, does not
|
206 |
-
show glassy behaviour. It neither follows hierarchical model nor droplet model generally
|
207 |
-
used to explain glassy or inhomogeneous systems. Its magnetic response can be
|
208 |
-
explained simply from the domain wall dynamics of otherwise homogeneous ferromagnet.
|
209 |
-
- Additional Symmetry of CKP hierarchy Based on the Orlov and Shulman s M operator,
|
210 |
-
the additional symmetries and the string equation of the CKP hierarchy are established,
|
211 |
-
and then the higher order constraints on L l are obtained. In addition, the
|
212 |
-
generating function and some properties are also given. In particular, the additional
|
213 |
-
symmetry flows form a new infinite dimensional algebra W C 1 infty , which
|
214 |
-
is a subalgebra of W 1 infty .
|
215 |
-
- Selection Against Demographic Stochasticity in Age-Structured Populations It has
|
216 |
-
been shown that differences in fecundity variance can influence the probability
|
217 |
-
of invasion of a genotype in a population, i.e. a genotype with lower variance
|
218 |
-
in offspring number can be favored in finite populations even if it has a somewhat
|
219 |
-
lower mean fitness than a competitor. In this paper, Gillespie s results are extended
|
220 |
-
to population genetic systems with explicit age structure, where the demographic
|
221 |
-
variance (variance in growth rate) calculated in the work of Engen and colleagues
|
222 |
-
is used as a generalization of variance in offspring number to predict the interaction
|
223 |
-
between deterministic and random forces driving change in allele frequency. By
|
224 |
-
calculating the variance from the life history parameters, it is shown that selection
|
225 |
-
against variance in the growth rate will favor a genotypes with lower stochasticity
|
226 |
-
in age specific survival and fertility rates. A diffusion approximation for selection
|
227 |
-
and drift in a population with two genotypes with different life history matrices
|
228 |
-
(and therefore, different growth rates and demographic variances) is derived and
|
229 |
-
shown to be consistent with individual based simulations. It is also argued that
|
230 |
-
for finite populations, perturbation analyses of both the growth rate and demographic
|
231 |
-
variances may be necessary to determine the sensitivity of fitness (broadly
|
232 |
-
defined) to changes in the life history parameters.
|
233 |
-
pipeline_tag: sentence-similarity
|
234 |
-
library_name: sentence-transformers
|
235 |
---
|
236 |
|
237 |
-
#
|
238 |
-
|
239 |
-
This is a [sentence-transformers](https://www.SBERT.net) model finetuned from [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2). It maps sentences & paragraphs to a 384-dimensional dense vector space and can be used for semantic textual similarity, semantic search, paraphrase mining, text classification, clustering, and more.
|
240 |
|
241 |
## Model Details
|
|
|
|
|
|
|
|
|
|
|
242 |
|
243 |
-
|
244 |
-
- **Model Type:** Sentence Transformer
|
245 |
-
- **Base model:** [sentence-transformers/all-MiniLM-L6-v2](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2) <!-- at revision c9745ed1d9f207416be6d2e6f8de32d1f16199bf -->
|
246 |
-
- **Maximum Sequence Length:** 256 tokens
|
247 |
-
- **Output Dimensionality:** 384 dimensions
|
248 |
-
- **Similarity Function:** Cosine Similarity
|
249 |
-
<!-- - **Training Dataset:** Unknown -->
|
250 |
-
<!-- - **Language:** Unknown -->
|
251 |
-
<!-- - **License:** Unknown -->
|
252 |
-
|
253 |
-
### Model Sources
|
254 |
-
|
255 |
-
- **Documentation:** [Sentence Transformers Documentation](https://sbert.net)
|
256 |
-
- **Repository:** [Sentence Transformers on GitHub](https://github.com/UKPLab/sentence-transformers)
|
257 |
-
- **Hugging Face:** [Sentence Transformers on Hugging Face](https://huggingface.co/models?library=sentence-transformers)
|
258 |
-
|
259 |
-
### Full Model Architecture
|
260 |
-
|
261 |
-
```
|
262 |
-
SentenceTransformer(
|
263 |
-
(0): Transformer({'max_seq_length': 256, 'do_lower_case': False}) with Transformer model: BertModel
|
264 |
-
(1): Pooling({'word_embedding_dimension': 384, 'pooling_mode_cls_token': False, 'pooling_mode_mean_tokens': True, 'pooling_mode_max_tokens': False, 'pooling_mode_mean_sqrt_len_tokens': False, 'pooling_mode_weightedmean_tokens': False, 'pooling_mode_lasttoken': False, 'include_prompt': True})
|
265 |
-
(2): Normalize()
|
266 |
-
)
|
267 |
-
```
|
268 |
-
|
269 |
-
## Usage
|
270 |
-
|
271 |
-
### Direct Usage (Sentence Transformers)
|
272 |
-
|
273 |
-
First install the Sentence Transformers library:
|
274 |
-
|
275 |
-
```bash
|
276 |
-
pip install -U sentence-transformers
|
277 |
-
```
|
278 |
-
|
279 |
-
Then you can load this model and run inference.
|
280 |
```python
|
281 |
from sentence_transformers import SentenceTransformer
|
282 |
|
283 |
-
|
284 |
-
model = SentenceTransformer("sentence_transformers_model_id")
|
285 |
-
# Run inference
|
286 |
-
sentences = [
|
287 |
-
'Detecting Directional Selection from the Polymorphism Frequency Spectrum The distribution of genetic polymorphisms in a population contains information about the mutation rate and the strength of natural selection at a locus. Here, we show that the Poisson Random Field (PRF) method of population-genetic inference suffers from systematic biases that tend to underestimate selection pressures and mutation rates, and that erroneously infer positive selection. These problems arise from the infinite-sites approximation inherent in the PRF method. We introduce three new inference techniques that correct these problems. We present a finite-site modification of the PRF method, as well as two new methods for inferring selection pressures and mutation rates based on diffusion models. Our methods can be used to infer not only a weighted average of selection pressures acting on a gene sequence, but also the distribution of selection pressures across sites. We evaluate the accuracy of our methods, as well that of the original PRF approach, by comparison with Wright-Fisher simulations.',
|
288 |
-
'Selection Against Demographic Stochasticity in Age-Structured Populations It has been shown that differences in fecundity variance can influence the probability of invasion of a genotype in a population, i.e. a genotype with lower variance in offspring number can be favored in finite populations even if it has a somewhat lower mean fitness than a competitor. In this paper, Gillespie s results are extended to population genetic systems with explicit age structure, where the demographic variance (variance in growth rate) calculated in the work of Engen and colleagues is used as a generalization of variance in offspring number to predict the interaction between deterministic and random forces driving change in allele frequency. By calculating the variance from the life history parameters, it is shown that selection against variance in the growth rate will favor a genotypes with lower stochasticity in age specific survival and fertility rates. A diffusion approximation for selection and drift in a population with two genotypes with different life history matrices (and therefore, different growth rates and demographic variances) is derived and shown to be consistent with individual based simulations. It is also argued that for finite populations, perturbation analyses of both the growth rate and demographic variances may be necessary to determine the sensitivity of fitness (broadly defined) to changes in the life history parameters.',
|
289 |
-
'Additional Symmetry of CKP hierarchy Based on the Orlov and Shulman s M operator, the additional symmetries and the string equation of the CKP hierarchy are established, and then the higher order constraints on L l are obtained. In addition, the generating function and some properties are also given. In particular, the additional symmetry flows form a new infinite dimensional algebra W C 1 infty , which is a subalgebra of W 1 infty .',
|
290 |
-
]
|
291 |
-
embeddings = model.encode(sentences)
|
292 |
-
print(embeddings.shape)
|
293 |
-
# [3, 384]
|
294 |
-
|
295 |
-
# Get the similarity scores for the embeddings
|
296 |
-
similarities = model.similarity(embeddings, embeddings)
|
297 |
-
print(similarities.shape)
|
298 |
-
# [3, 3]
|
299 |
-
```
|
300 |
-
|
301 |
-
<!--
|
302 |
-
### Direct Usage (Transformers)
|
303 |
-
|
304 |
-
<details><summary>Click to see the direct usage in Transformers</summary>
|
305 |
-
|
306 |
-
</details>
|
307 |
-
-->
|
308 |
-
|
309 |
-
<!--
|
310 |
-
### Downstream Usage (Sentence Transformers)
|
311 |
-
|
312 |
-
You can finetune this model on your own dataset.
|
313 |
-
|
314 |
-
<details><summary>Click to expand</summary>
|
315 |
-
|
316 |
-
</details>
|
317 |
-
-->
|
318 |
-
|
319 |
-
<!--
|
320 |
-
### Out-of-Scope Use
|
321 |
-
|
322 |
-
*List how the model may foreseeably be misused and address what users ought not to do with the model.*
|
323 |
-
-->
|
324 |
-
|
325 |
-
<!--
|
326 |
-
## Bias, Risks and Limitations
|
327 |
|
328 |
-
|
329 |
-
|
330 |
|
331 |
-
|
332 |
-
### Recommendations
|
333 |
-
|
334 |
-
*What are recommendations with respect to the foreseeable issues? For example, filtering explicit content.*
|
335 |
-
-->
|
336 |
-
|
337 |
-
## Training Details
|
338 |
-
|
339 |
-
### Training Dataset
|
340 |
-
|
341 |
-
#### Unnamed Dataset
|
342 |
-
|
343 |
-
* Size: 11,358 training samples
|
344 |
-
* Columns: <code>sentence_0</code> and <code>sentence_1</code>
|
345 |
-
* Approximate statistics based on the first 1000 samples:
|
346 |
-
| | sentence_0 | sentence_1 |
|
347 |
-
|:--------|:-------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------|
|
348 |
-
| type | string | string |
|
349 |
-
| details | <ul><li>min: 23 tokens</li><li>mean: 157.04 tokens</li><li>max: 256 tokens</li></ul> | <ul><li>min: 22 tokens</li><li>mean: 158.44 tokens</li><li>max: 256 tokens</li></ul> |
|
350 |
-
* Samples:
|
351 |
-
| sentence_0 | sentence_1 |
|
352 |
-
|:---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|:-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
|
353 |
-
| <code>Universal scaling of current fluctuations in disordered graphene We analyze the full transport statistics of graphene with smooth disorder at low dopings. First we consider the case of 1D disorder for which the transmission probability distribution is given analytically in terms of the graphene-specific mean free path. All current cumulants are shown to scale with system parameters (doping, size, disorder strength and correlation length) in an identical fashion for large enough systems. In the case of 2D disorder, numerical evidence is given for the same kind of identical scaling of all current cumulants, so that the ratio of any two such cumulants is universal. Specific universal values are given for the Fano factor, which is smaller than the pseudodiffusive value of ballistic graphene (F 1 3) both for 1D (F 0.243) and 2D (F 0.295) disorder. On the other hand, conductivity in wide samples is shown to grow without saturation as sqrt L and Log L with system length L in the 1D and 2D c...</code> | <code>Levitation and percolation in quantum Hall systems with correlated disorder We investigate the integer quantum Hall system in a two dimensional lattice model with spatially correlated disorder by using the efficient method to calculate the Chern number proposed by Fukui textit et al . Distribution of charge density indicates that the extended states at the center of each Landau band have percolating current paths, which are topologically equivalent to the edge states that exist in a system with boundaries. As increasing the strength of disorder, floating feature is observed in an averaged Hall conductance as a function of filling factor. Its relation to the observed experiments is also discussed.</code> |
|
354 |
-
| <code>Tautological relations in Hodge field theory We propose a Hodge field theory construction that captures algebraic properties of the reduction of Zwiebach invariants to Gromov-Witten invariants. It generalizes the Barannikov-Kontsevich construction to the case of higher genera correlators with gravitational descendants. We prove the main theorem stating that algebraically defined Hodge field theory correlators satisfy all tautological relations. From this perspective the statement that Barannikov-Kontsevich construction provides a solution of the WDVV equation looks as the simplest particular case of our theorem. Also it generalizes the particular cases of other low-genera tautological relations proven in our earlier works; we replace the old technical proofs by a novel conceptual proof.</code> | <code>Equivariant Lefschetz number of differential operators Let G be a compact Lie group acting on a compact complex manifold M . We prove a trace density formula for the G -Lefschetz number of a differential operator on M . We generalize Engeli and Felder s recent results to orbifolds.</code> |
|
355 |
-
| <code>Precision Test of Mass Ratio Variations with Lattice-Confined Ultracold Molecules We propose a precision measurement of time variations of the proton-electron mass ratio using ultracold molecules in an optical lattice. Vibrational energy intervals are sensitive to changes of the mass ratio. In contrast to measurements that use hyperfine-interval-based atomic clocks, the scheme discussed here is model-independent and does not require separation of time variations of different physical constants. The possibility of applying the zero-differential-Stark-shift optical lattice technique is explored to measure vibrational transitions at high accuracy.</code> | <code>Production of high energy particles in laser and Coulomb fields and e e - antenna A strong laser field and the Coulomb field of a nucleus can produce e e - pairs. It is shown for the first time that there is a large probability that electrons and positrons created in this process collide after one or several oscillations of the laser field. These collisions can take place at high energy resulting in several phenomena. The quasielastic collision e e - - e e - allows acceleration of leptons in the laser field to higher energies. The inelastic collisions allow production of high energy photons e e - - 2 gamma and muons, e e - - mu mu - . The yield of high-energy photons and muons produced via this mechanism exceeds exponentially their production through conventional direct creation in laser and Coulomb fields. A relation of the phenomena considered with the antenna-mechanism of multiphoton absorption in atoms is discussed.</code> |
|
356 |
-
* Loss: [<code>MultipleNegativesRankingLoss</code>](https://sbert.net/docs/package_reference/sentence_transformer/losses.html#multiplenegativesrankingloss) with these parameters:
|
357 |
-
```json
|
358 |
-
{
|
359 |
-
"scale": 20.0,
|
360 |
-
"similarity_fct": "cos_sim"
|
361 |
-
}
|
362 |
-
```
|
363 |
-
|
364 |
-
### Training Hyperparameters
|
365 |
-
#### Non-Default Hyperparameters
|
366 |
-
|
367 |
-
- `per_device_train_batch_size`: 20
|
368 |
-
- `per_device_eval_batch_size`: 20
|
369 |
-
- `num_train_epochs`: 10
|
370 |
-
- `multi_dataset_batch_sampler`: round_robin
|
371 |
-
|
372 |
-
#### All Hyperparameters
|
373 |
-
<details><summary>Click to expand</summary>
|
374 |
-
|
375 |
-
- `overwrite_output_dir`: False
|
376 |
-
- `do_predict`: False
|
377 |
-
- `eval_strategy`: no
|
378 |
-
- `prediction_loss_only`: True
|
379 |
-
- `per_device_train_batch_size`: 20
|
380 |
-
- `per_device_eval_batch_size`: 20
|
381 |
-
- `per_gpu_train_batch_size`: None
|
382 |
-
- `per_gpu_eval_batch_size`: None
|
383 |
-
- `gradient_accumulation_steps`: 1
|
384 |
-
- `eval_accumulation_steps`: None
|
385 |
-
- `torch_empty_cache_steps`: None
|
386 |
-
- `learning_rate`: 5e-05
|
387 |
-
- `weight_decay`: 0.0
|
388 |
-
- `adam_beta1`: 0.9
|
389 |
-
- `adam_beta2`: 0.999
|
390 |
-
- `adam_epsilon`: 1e-08
|
391 |
-
- `max_grad_norm`: 1
|
392 |
-
- `num_train_epochs`: 10
|
393 |
-
- `max_steps`: -1
|
394 |
-
- `lr_scheduler_type`: linear
|
395 |
-
- `lr_scheduler_kwargs`: {}
|
396 |
-
- `warmup_ratio`: 0.0
|
397 |
-
- `warmup_steps`: 0
|
398 |
-
- `log_level`: passive
|
399 |
-
- `log_level_replica`: warning
|
400 |
-
- `log_on_each_node`: True
|
401 |
-
- `logging_nan_inf_filter`: True
|
402 |
-
- `save_safetensors`: True
|
403 |
-
- `save_on_each_node`: False
|
404 |
-
- `save_only_model`: False
|
405 |
-
- `restore_callback_states_from_checkpoint`: False
|
406 |
-
- `no_cuda`: False
|
407 |
-
- `use_cpu`: False
|
408 |
-
- `use_mps_device`: False
|
409 |
-
- `seed`: 42
|
410 |
-
- `data_seed`: None
|
411 |
-
- `jit_mode_eval`: False
|
412 |
-
- `use_ipex`: False
|
413 |
-
- `bf16`: False
|
414 |
-
- `fp16`: False
|
415 |
-
- `fp16_opt_level`: O1
|
416 |
-
- `half_precision_backend`: auto
|
417 |
-
- `bf16_full_eval`: False
|
418 |
-
- `fp16_full_eval`: False
|
419 |
-
- `tf32`: None
|
420 |
-
- `local_rank`: 0
|
421 |
-
- `ddp_backend`: None
|
422 |
-
- `tpu_num_cores`: None
|
423 |
-
- `tpu_metrics_debug`: False
|
424 |
-
- `debug`: []
|
425 |
-
- `dataloader_drop_last`: False
|
426 |
-
- `dataloader_num_workers`: 0
|
427 |
-
- `dataloader_prefetch_factor`: None
|
428 |
-
- `past_index`: -1
|
429 |
-
- `disable_tqdm`: False
|
430 |
-
- `remove_unused_columns`: True
|
431 |
-
- `label_names`: None
|
432 |
-
- `load_best_model_at_end`: False
|
433 |
-
- `ignore_data_skip`: False
|
434 |
-
- `fsdp`: []
|
435 |
-
- `fsdp_min_num_params`: 0
|
436 |
-
- `fsdp_config`: {'min_num_params': 0, 'xla': False, 'xla_fsdp_v2': False, 'xla_fsdp_grad_ckpt': False}
|
437 |
-
- `fsdp_transformer_layer_cls_to_wrap`: None
|
438 |
-
- `accelerator_config`: {'split_batches': False, 'dispatch_batches': None, 'even_batches': True, 'use_seedable_sampler': True, 'non_blocking': False, 'gradient_accumulation_kwargs': None}
|
439 |
-
- `deepspeed`: None
|
440 |
-
- `label_smoothing_factor`: 0.0
|
441 |
-
- `optim`: adamw_torch
|
442 |
-
- `optim_args`: None
|
443 |
-
- `adafactor`: False
|
444 |
-
- `group_by_length`: False
|
445 |
-
- `length_column_name`: length
|
446 |
-
- `ddp_find_unused_parameters`: None
|
447 |
-
- `ddp_bucket_cap_mb`: None
|
448 |
-
- `ddp_broadcast_buffers`: False
|
449 |
-
- `dataloader_pin_memory`: True
|
450 |
-
- `dataloader_persistent_workers`: False
|
451 |
-
- `skip_memory_metrics`: True
|
452 |
-
- `use_legacy_prediction_loop`: False
|
453 |
-
- `push_to_hub`: False
|
454 |
-
- `resume_from_checkpoint`: None
|
455 |
-
- `hub_model_id`: None
|
456 |
-
- `hub_strategy`: every_save
|
457 |
-
- `hub_private_repo`: None
|
458 |
-
- `hub_always_push`: False
|
459 |
-
- `gradient_checkpointing`: False
|
460 |
-
- `gradient_checkpointing_kwargs`: None
|
461 |
-
- `include_inputs_for_metrics`: False
|
462 |
-
- `include_for_metrics`: []
|
463 |
-
- `eval_do_concat_batches`: True
|
464 |
-
- `fp16_backend`: auto
|
465 |
-
- `push_to_hub_model_id`: None
|
466 |
-
- `push_to_hub_organization`: None
|
467 |
-
- `mp_parameters`:
|
468 |
-
- `auto_find_batch_size`: False
|
469 |
-
- `full_determinism`: False
|
470 |
-
- `torchdynamo`: None
|
471 |
-
- `ray_scope`: last
|
472 |
-
- `ddp_timeout`: 1800
|
473 |
-
- `torch_compile`: False
|
474 |
-
- `torch_compile_backend`: None
|
475 |
-
- `torch_compile_mode`: None
|
476 |
-
- `dispatch_batches`: None
|
477 |
-
- `split_batches`: None
|
478 |
-
- `include_tokens_per_second`: False
|
479 |
-
- `include_num_input_tokens_seen`: False
|
480 |
-
- `neftune_noise_alpha`: None
|
481 |
-
- `optim_target_modules`: None
|
482 |
-
- `batch_eval_metrics`: False
|
483 |
-
- `eval_on_start`: False
|
484 |
-
- `use_liger_kernel`: False
|
485 |
-
- `eval_use_gather_object`: False
|
486 |
-
- `average_tokens_across_devices`: False
|
487 |
-
- `prompts`: None
|
488 |
-
- `batch_sampler`: batch_sampler
|
489 |
-
- `multi_dataset_batch_sampler`: round_robin
|
490 |
-
|
491 |
-
</details>
|
492 |
-
|
493 |
-
### Training Logs
|
494 |
-
| Epoch | Step | Training Loss |
|
495 |
-
|:------:|:----:|:-------------:|
|
496 |
-
| 0.8803 | 500 | 1.5123 |
|
497 |
-
| 1.7606 | 1000 | 1.179 |
|
498 |
-
| 2.6408 | 1500 | 1.0416 |
|
499 |
-
| 3.5211 | 2000 | 0.9197 |
|
500 |
-
| 4.4014 | 2500 | 0.833 |
|
501 |
-
| 5.2817 | 3000 | 0.7519 |
|
502 |
-
| 6.1620 | 3500 | 0.6842 |
|
503 |
-
| 7.0423 | 4000 | 0.6436 |
|
504 |
-
| 7.9225 | 4500 | 0.5913 |
|
505 |
-
| 8.8028 | 5000 | 0.5608 |
|
506 |
-
| 9.6831 | 5500 | 0.5436 |
|
507 |
-
|
508 |
-
|
509 |
-
### Framework Versions
|
510 |
-
- Python: 3.11.11
|
511 |
-
- Sentence Transformers: 3.4.1
|
512 |
-
- Transformers: 4.48.3
|
513 |
-
- PyTorch: 2.5.1+cu124
|
514 |
-
- Accelerate: 1.3.0
|
515 |
-
- Datasets: 3.3.2
|
516 |
-
- Tokenizers: 0.21.0
|
517 |
-
|
518 |
-
## Citation
|
519 |
-
|
520 |
-
### BibTeX
|
521 |
-
|
522 |
-
#### Sentence Transformers
|
523 |
-
```bibtex
|
524 |
-
@inproceedings{reimers-2019-sentence-bert,
|
525 |
-
title = "Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks",
|
526 |
-
author = "Reimers, Nils and Gurevych, Iryna",
|
527 |
-
booktitle = "Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing",
|
528 |
-
month = "11",
|
529 |
-
year = "2019",
|
530 |
-
publisher = "Association for Computational Linguistics",
|
531 |
-
url = "https://arxiv.org/abs/1908.10084",
|
532 |
-
}
|
533 |
-
```
|
534 |
-
|
535 |
-
#### MultipleNegativesRankingLoss
|
536 |
-
```bibtex
|
537 |
-
@misc{henderson2017efficient,
|
538 |
-
title={Efficient Natural Language Response Suggestion for Smart Reply},
|
539 |
-
author={Matthew Henderson and Rami Al-Rfou and Brian Strope and Yun-hsuan Sung and Laszlo Lukacs and Ruiqi Guo and Sanjiv Kumar and Balint Miklos and Ray Kurzweil},
|
540 |
-
year={2017},
|
541 |
-
eprint={1705.00652},
|
542 |
-
archivePrefix={arXiv},
|
543 |
-
primaryClass={cs.CL}
|
544 |
-
}
|
545 |
```
|
546 |
|
547 |
-
|
548 |
-
|
549 |
-
|
550 |
-
|
551 |
-
|
552 |
-
|
553 |
-
|
554 |
-
|
555 |
-
|
556 |
-
|
557 |
-
|
558 |
-
|
559 |
-
|
560 |
-
|
561 |
-
|
562 |
-
*Provides a way for people who have updates to the Model Card, suggestions, or questions, to contact the Model Card authors.*
|
563 |
-
-->
|
|
|
1 |
---
|
2 |
+
language: "en"
|
3 |
+
license: "apache-2.0"
|
4 |
tags:
|
5 |
+
- semantic-search
|
6 |
+
- research-papers
|
7 |
+
- arxiv
|
8 |
+
- sbert
|
9 |
+
model_name: "Fine-Tuned Semantic Search Model (Arxiv Papers)"
|
10 |
+
base_model: "sentence-transformers/all-MiniLM-L6-v2"
|
11 |
+
datasets:
|
12 |
+
- "arxiv_community/arxiv_dataset"
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
13 |
---
|
14 |
|
15 |
+
# arxiv-search
|
16 |
+
This model is a fine-tuned version of [`all-MiniLM-L6-v2`](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2), trained on **Arxiv research papers** to perform **semantic similarity search**.
|
|
|
17 |
|
18 |
## Model Details
|
19 |
+
- **Base Model:** `sentence-transformers/all-MiniLM-L6-v2`
|
20 |
+
- **Training Data:** Arxiv Research Papers (`title + abstract`)
|
21 |
+
- **Fine-Tuned Task:** Semantic Search
|
22 |
+
- **Use Case:** Find **similar research papers** based on a query
|
23 |
+
- **License:** Apache 2.0
|
24 |
|
25 |
+
## How to Use
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
26 |
```python
|
27 |
from sentence_transformers import SentenceTransformer
|
28 |
|
29 |
+
model = SentenceTransformer("Talina06/arxiv-search")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
30 |
|
31 |
+
query = "Neural networks in medicine"
|
32 |
+
query_embedding = model.encode(query)
|
33 |
|
34 |
+
# Use FAISS or cosine similarity to retrieve similar papers
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
35 |
```
|
36 |
|
37 |
+
## Training Details
|
38 |
+
- **Training Data:** 100k+ Arxiv research papers
|
39 |
+
- **Training Framework:** Sentence Transformers
|
40 |
+
- **Hyperparameters:**
|
41 |
+
- Learning Rate: `2e-5`
|
42 |
+
- Batch Size: `100`
|
43 |
+
- Epochs: `10`
|
44 |
+
- **Hardware Used:** TPU & GPU
|
45 |
+
|
46 |
+
|
47 |
+
## Example Search Results
|
48 |
+
| **Query** | **Top Matching Paper Title** | **Similarity Score** |
|
49 |
+
|----------|------------------------------|----------------------|
|
50 |
+
| "Neural networks in healthcare" | "Deep Learning for Medical Diagnosis" | 0.89 |
|
51 |
+
| "Quantum cryptography" | "A Survey on Quantum-Safe Encryption" | 0.87 |
|
|
|
|