arxiv-search

This model is a fine-tuned version of all-MiniLM-L6-v2, trained on Arxiv research papers to perform semantic similarity search.

Model Details

  • Base Model: sentence-transformers/all-MiniLM-L6-v2
  • Training Data: Arxiv Research Papers (title + abstract)
  • Fine-Tuned Task: Semantic Search
  • Use Case: Find similar research papers based on a query
  • License: Apache 2.0

How to Use

from sentence_transformers import SentenceTransformer

model = SentenceTransformer("Talina06/arxiv-search")

query = "Neural networks in medicine"
query_embedding = model.encode(query)

# Use FAISS or cosine similarity to retrieve similar papers

Training Details

  • Training Data: 100k+ Arxiv research papers
  • Training Framework: Sentence Transformers
  • Hyperparameters:
    • Learning Rate: 2e-5
    • Batch Size: 100
    • Epochs: 10
  • Hardware Used: TPU & GPU

Example Search Results

Query Top Matching Paper Title Similarity Score
"Neural networks in healthcare" "Deep Learning for Medical Diagnosis" 0.89
"Quantum cryptography" "A Survey on Quantum-Safe Encryption" 0.87
Downloads last month
40
Safetensors
Model size
22.7M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for Talina06/arxiv-search

Finetuned
(257)
this model