Tom Aarsen
tomaarsen
AI & ML interests
NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification
Recent Activity
updated
a model
about 3 hours ago
tomaarsen/reranker-msmarco-v1.1-ModernBERT-base-cmnrl
published
a model
about 3 hours ago
tomaarsen/reranker-msmarco-v1.1-ModernBERT-base-cmnrl
updated
a model
about 4 hours ago
tomaarsen/reranker-msmarco-v1.1-ModernBERT-base-bce
Organizations
tomaarsen's activity
Discrepancy in max tokens
2
#101 opened 5 days ago
by
KennethEnevoldsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/5ff5943752c26e9bc240bada/Exyzf3C_gJ2KdsL4K5_cq.png)
Entering on MTEB
4
#12 opened 12 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Clarification regarding dimensions for gtr-t5-large embedding model
5
#3 opened 8 days ago
by
ksridhar-123
nan or 0.0 loss when training with flash attention
16
#59 opened 12 days ago
by
roadtoagi
![](https://cdn-avatars.huggingface.co/v1/production/uploads/677a6a5ab06a2c07ece49e9d/JUYG31uT4i0SuYrbK2k7y.jpeg)
Unable to load sentence transformer ( was previously working)
1
#98 opened 12 days ago
by
avifin19
Clean up README slightly
1
#7 opened 19 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
NaN values when input is longer than context window?
3
#11 opened 12 days ago
by
AHuguet
Add Sentence Transformers integration
5
#7 opened 22 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Librarian Bot: Add language metadata for dataset
#2 opened 15 days ago
by
librarian-bot
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1674830754237-63d3e0e8ff1384ce6c5dd17d.jpeg)
Import fails on AWS lamba instance.
4
#55 opened 20 days ago
by
obeijbom
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6070c710227ff331937110ea/36xEaxRRjzXKQHDwiEF42.jpeg)
ModernBERT fails to work without FlashAttention !
3
#56 opened 18 days ago
by
benhachem
![](https://cdn-avatars.huggingface.co/v1/production/uploads/1672318259412-noauth.jpeg)
How to load ONNX version with CrossEncoder class?
1
#7 opened 19 days ago
by
hveigz
Update `base_model_relation` to `finetune`
#11 opened 19 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Update `base_model_relation` to `finetune`
#2 opened 19 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Update `base_model_relation` to `finetune`
#8 opened 19 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
Update `base_model_relation` to `finetune`
#10 opened 19 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
max_seq_length seems not to be properly reported in sentence_bert_config.json
1
#35 opened 19 days ago
by
yjoonjang
![](https://cdn-avatars.huggingface.co/v1/production/uploads/65a4c4ed2548c41ad9b1421c/bMQbowjHKvq-bKpzalvWm.jpeg)
Convert git-lfs md, py, json files to normal git files
1
#8 opened 21 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)
patch inference on CPU & Windows + Update README snippets
#2 opened 21 days ago
by
tomaarsen
![](https://cdn-avatars.huggingface.co/v1/production/uploads/6317233cc92fd6fee317e030/cJHSvvimr1kqgQfHOjO5n.png)