Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
48.7
TFLOPS
4
22
29
Nicolay Rusnachenko
nicolay-r
Follow
jeffersonqueiroz1's profile picture
SamuelMinouri's profile picture
Samazzy's profile picture
100 followers
Ā·
4 following
https://nicolay-r.github.io/
nicolayr_
nicolay-r
nicolay-r
AI & ML interests
Information Retrievalć»Medical Multimodal NLP (š¼+š) Research Fellow @BU_Researchć»software developer http://arekit.ioć»PhD in NLP
Recent Activity
reacted
to
mmhamdy
's
post
with š
about 10 hours ago
ā Evaluating Long Context #2: SCROLLS and ZeroSCROLLS In this series of posts about tracing the history of long context evaluation, we started with Long Range Arena (LRA). Introduced in 2020, Long Range Arens (LRA) is one of the earliest benchmarks designed to tackle the challenge of long context evaluation. But it wasn't introduced to evaluate LLMs, but rather the transformer architecture in general. š The SCROLLS benchmark, introduced in 2022, addresses this gap in NLP/LLM research. SCROLLS challenges models with tasks that require reasoning over extended sequences (according to 2022 standards). So, what does it offer? 1ļøā£ Long Text Focus: SCROLLS (unlike LRA) focus mainly on text and contain inputs with thousands of words, testing models' ability to synthesize information across lengthy documents. 2ļøā£ Diverse Tasks: Includes summarization, question answering, and natural language inference across domains like literature, science, and business. 3ļøā£ Unified Format: All datasets are available in a text-to-text format, facilitating easy evaluation and comparison of models. Building on SCROLLS, ZeroSCROLLS takes long text evaluation to the next level by focusing on zero-shot learning. Other features include: 1ļøā£ New Tasks: Introduces tasks like sentiment aggregation and sorting book chapter summaries. 2ļøā£ Leaderboard: A live leaderboard encourages continuous improvement and competition among researchers. š” What are some other landmark benchmarks in the history of long context evaluation? Feel free to share your thoughts and suggestions in the comments. - SCROLLS Paper: https://huggingface.co/papers/2201.03533 - ZeroSCROLLS Paper: https://huggingface.co/papers/2305.14196
reacted
to
sequelbox
's
post
with š§
about 10 hours ago
Raiden is here! 63k creative-reasoning and analytic-reasoning prompts answered by DeepSeek's 685b R1 model! - All prompts from https://huggingface.co/datasets/microsoft/orca-agentinstruct-1M-v1 and all responses from https://huggingface.co/deepseek-ai/DeepSeek-R1 - A deep look at R1's reasoning skills! Use as you will. Get it now: https://huggingface.co/datasets/sequelbox/Raiden-DeepSeek-R1 for everyone :)
reacted
to
sequelbox
's
post
with š
about 10 hours ago
Raiden is here! 63k creative-reasoning and analytic-reasoning prompts answered by DeepSeek's 685b R1 model! - All prompts from https://huggingface.co/datasets/microsoft/orca-agentinstruct-1M-v1 and all responses from https://huggingface.co/deepseek-ai/DeepSeek-R1 - A deep look at R1's reasoning skills! Use as you will. Get it now: https://huggingface.co/datasets/sequelbox/Raiden-DeepSeek-R1 for everyone :)
View all activity
Organizations
None yet
nicolay-r
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 models
1 day ago
facebook/mgenre-wiki
Text2Text Generation
ā¢
Updated
Jan 24, 2023
ā¢
558
ā¢
28
sapienzanlp/relik-entity-linking-base
Updated
Aug 7, 2024
ā¢
94
ā¢
2
liked
a dataset
7 days ago
open-thoughts/OpenThoughts-114k
Viewer
ā¢
Updated
about 10 hours ago
ā¢
228k
ā¢
47.1k
ā¢
409
liked
a Space
8 days ago
Running
502
502
Qwen2.5 Max Demo
š¢
Send messages for chatbot responses
liked
a model
13 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-7B
Text Generation
ā¢
Updated
3 days ago
ā¢
515k
ā¢
398
liked
a model
16 days ago
deepseek-ai/DeepSeek-R1
Text Generation
ā¢
Updated
3 days ago
ā¢
2.94M
ā¢
ā¢
8.3k
liked
a Space
28 days ago
Running
on
CPU Upgrade
337
337
Open Medical-LLM Leaderboard
š„
Browse and submit LLM evaluations
liked
a model
28 days ago
johnsnowlabs/JSL-MedLlama-3-8B-v2.0
Text Generation
ā¢
Updated
Apr 30, 2024
ā¢
12.1k
ā¢
30
liked
a model
4 months ago
meta-llama/Llama-3.2-3B-Instruct
Text Generation
ā¢
Updated
Oct 24, 2024
ā¢
1.59M
ā¢
ā¢
968
liked
a model
7 months ago
hyy-33/hyy33-WASSA-2024-Track-2
Updated
Jul 9, 2024
ā¢
2
liked
6 models
8 months ago
google/gemma-2-9b-it
Text Generation
ā¢
Updated
Aug 27, 2024
ā¢
552k
ā¢
ā¢
654
google/gemma-2-27b-it
Text Generation
ā¢
Updated
Aug 27, 2024
ā¢
215k
ā¢
ā¢
518
Qwen/Qwen2-7B-Instruct
Text Generation
ā¢
Updated
Aug 21, 2024
ā¢
698k
ā¢
611
mistralai/Mistral-7B-Instruct-v0.3
Text Generation
ā¢
Updated
Aug 21, 2024
ā¢
496k
ā¢
ā¢
1.33k
microsoft/Phi-3-small-8k-instruct
Text Generation
ā¢
Updated
Aug 30, 2024
ā¢
24.5k
ā¢
160
microsoft/Phi-3-mini-4k-instruct
Text Generation
ā¢
Updated
Sep 20, 2024
ā¢
908k
ā¢
ā¢
1.13k
liked
4 models
10 months ago
xtuner/llava-phi-3-mini-hf
Image-to-Text
ā¢
Updated
Apr 25, 2024
ā¢
3.95k
ā¢
49
xtuner/llava-llama-3-8b-v1_1
Image-Text-to-Text
ā¢
Updated
Apr 28, 2024
ā¢
53
ā¢
120
AIRI-Institute/OmniFusion
Updated
Apr 10, 2024
ā¢
56
google-bert/bert-base-uncased
Fill-Mask
ā¢
Updated
Feb 19, 2024
ā¢
85.8M
ā¢
2.1k
Load more