CNTXT AI

company

https://www.cntxt.tech/

cntxtai

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Ahmad-ElShiekh-PhD updated a dataset 1 day ago

CNTXTAI0/CNTXTAI_Medical_Case_Studies

Ahmad-ElShiekh-PhD published a dataset 2 days ago

CNTXTAI0/CNTXTAI_Medical_Case_Studies

Ahmad-ElShiekh-PhD updated a dataset 4 days ago

CNTXTAI0/CNTXTAI_Medical_Review_Papers

View all activity

CNTXTAI0's activity

Ahmad-ElShiekh-PhD

updated a dataset 1 day ago

CNTXTAI0/CNTXTAI_Medical_Case_Studies

Updated 1 day ago • 13

Ahmad-ElShiekh-PhD

published a dataset 2 days ago

CNTXTAI0/CNTXTAI_Medical_Case_Studies

Updated 1 day ago • 13

Ahmad-ElShiekh-PhD

updated a dataset 4 days ago

CNTXTAI0/CNTXTAI_Medical_Review_Papers

Updated 4 days ago • 35

Ahmad-ElShiekh-PhD

published a dataset 5 days ago

CNTXTAI0/CNTXTAI_Medical_Review_Papers

Updated 4 days ago • 35

hasanabusheikh

posted an update 5 days ago

Post

908

🚀 Benchmarking Mistral Saba: Where Does It Stand?
The AI race is evolving rapidly, with new models emerging to cater to regional and domain-specific needs. Mistral Saba, a 24-billion-parameter model optimized for Arabic and South Asian languages, aims to bridge linguistic gaps in AI. But does it deliver? Our latest benchmarking report reveals some critical insights:
🔹 Strengths:
✅ Cost-effective—$0.20 per million input tokens, making it budget-friendly.
✅ High throughput—Processes 150+ tokens per second, ensuring efficiency.

🔻 Major Shortcomings:
❌ Struggles with Arabic dialects—Fails to handle Egyptian, Gulf, and Levantine variations.
❌ Poor performance in Modern Standard Arabic (MSA) languages.
❌ Severe hallucinations—Generates fabricated religious content and incorrect citations.
❌ Weak logical & mathematical reasoning—Falls short in benchmarks like HellaSwag and GSM8K.
❌ Poor factual accuracy—Mistral Saba underperforms against GPT-4o and Claude 3.5 in truthfulness tests.
While regional AI models are much needed, transparency, dataset curation, and ethical oversight remain crucial for their reliability. The industry must focus on community-driven dataset creation, third-party audits, and stakeholder collaboration to develop truly localized AI that serves its target populations accurately.

Solution in 2 words:
Hyper-Localization

💡 What are your thoughts on the need for region-specific AI models? Let’s discuss! 👇

hashtag#AI hashtag#Benchmarking hashtag#LLMs hashtag#MistralSaba hashtag#arabic hashtag#ethical

hashtag#mistralhttps://huggingface.co/mistralai/Mistral-Small-24B-Instruct-2501

Ahmad-ElShiekh-PhD

updated a dataset 8 days ago

CNTXTAI0/Medical-Articles

Updated 8 days ago • 62

Ahmad-ElShiekh-PhD

published a dataset 9 days ago

CNTXTAI0/Medical-Articles

Updated 8 days ago • 62

Ahmad-ElShiekh-PhD

updated a dataset 11 days ago

CNTXTAI0/CNTXTAI-Ranking-Dataset

Viewer • Updated 11 days ago • 50 • 64

Ahmad-ElShiekh-PhD

published a dataset 12 days ago

CNTXTAI0/CNTXTAI-Ranking-Dataset

Viewer • Updated 11 days ago • 50 • 64

hasanabusheikh

posted an update 16 days ago

Post

549

Data is the new currency—garbage in, garbage out. Ensure you have the right data, tailored to your needs, and in the right format for optimal results!

CNTXTAI0/arabic_dialects_question_and_answer

cntxtai

updated a Space 18 days ago

README

🦀

cntxtai

published a Space 19 days ago

README

🦀

AI & ML interests

Recent Activity

Team members 3

CNTXTAI0's activity

README

README