3 2

Aymane El Firdoussi

AymaneElfirdo

elfirdoussilab1

AI & ML interests

None yet

Recent Activity

updated a dataset about 11 hours ago

atlasia/chatbot-arena-db

updated a Space 1 day ago

atlasia/darija-chatbot-arena

published an article 1 day ago

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

View all activity

Organizations

AymaneElfirdo's activity

updated a dataset about 11 hours ago

atlasia/chatbot-arena-db

Updated about 11 hours ago • 74

updated a Space 1 day ago

Darija Chatbot Arena

🏆

Launch an interactive web interface for content generation

published an article 1 day ago

Article

Darija Chatbot Arena: Making LLMs Compete in the Moroccan Dialect

and 2 others •

1 day ago

• 8

New activity in atlasia/DODa-audio-dataset 7 days ago

[bot] Conversion to Parquet

#1 opened 9 days ago by

parquet-converter

liked a Space 14 days ago

329

NeuralJam

🚂

EscapeExpress : LLM AI detective puzzle game.

updated a Space 15 days ago

Darija Chatbot Arena

🏆

Launch an interactive web interface for content generation

liked a Space 18 days ago

Darija Chatbot Arena

🏆

Launch an interactive web interface for content generation

reacted to grimjim's post with 👍 5 months ago

Post

3247

I found this paper to be thought-provoking: "Smaller, Weaker, Yet Better: Training LLM Reasoners via Compute-Optimal Sampling" by Bansal, Hosseini, Agarwal, Tran, and Kazemi.
https://arxiv.org/abs/2408.16737
The direct implication is that smaller models could be used to create cost-effective synthetic datasets. And on that note, in the Gemma terms of use, Google explicitly claims no rights on outputs generated from those models, which means one is free to synthgen from the Gemma line. Meta's Llama 3 licence forbids synthetic generation of outputs if used to improve other models. Relevant Mistral, Qwen, and Yi models under the Apache 2.0 license are unrestricted for this purpose.

2 replies