C4AI Community

community

https://cohere.com/research

CohereForAI

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

1024m updated a collection 8 days ago

Machine Generated Text Detection

danylo-boiko authored a paper 9 days ago

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

sarahooker authored a paper 23 days ago

Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress

View all activity

C4AI-Community's activity

mmhamdy

posted an update about 19 hours ago

Post

1475

⛓ Evaluating Long Context #2: SCROLLS and ZeroSCROLLS

In this series of posts about tracing the history of long context evaluation, we started with Long Range Arena (LRA). Introduced in 2020, Long Range Arens (LRA) is one of the earliest benchmarks designed to tackle the challenge of long context evaluation. But it wasn't introduced to evaluate LLMs, but rather the transformer architecture in general.

📜 The SCROLLS benchmark, introduced in 2022, addresses this gap in NLP/LLM research. SCROLLS challenges models with tasks that require reasoning over extended sequences (according to 2022 standards). So, what does it offer?

1️⃣ Long Text Focus: SCROLLS (unlike LRA) focus mainly on text and contain inputs with thousands of words, testing models' ability to synthesize information across lengthy documents.
2️⃣ Diverse Tasks: Includes summarization, question answering, and natural language inference across domains like literature, science, and business.
3️⃣ Unified Format: All datasets are available in a text-to-text format, facilitating easy evaluation and comparison of models.

Building on SCROLLS, ZeroSCROLLS takes long text evaluation to the next level by focusing on zero-shot learning. Other features include:

1️⃣ New Tasks: Introduces tasks like sentiment aggregation and sorting book chapter summaries.
2️⃣ Leaderboard: A live leaderboard encourages continuous improvement and competition among researchers.

💡 What are some other landmark benchmarks in the history of long context evaluation? Feel free to share your thoughts and suggestions in the comments.

- SCROLLS Paper: SCROLLS: Standardized CompaRison Over Long Language Sequences (2201.03533)
- ZeroSCROLLS Paper: ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding (2305.14196)

prithivMLmods

posted an update 4 days ago

Post

3652

QwQ Edge Gets a Small Update..! 💬
try now: prithivMLmods/QwQ-Edge

🚀Now, you can use the following commands for different tasks:

🖼️ @image 'prompt...' → Generates an image
🔉@tts1 'prompt...' → Generates speech in a female voice
🔉 @tts2 'prompt...' → Generates speech in a male voice
🅰️@text 'prompt...' → Enables textual conversation (If not specified, text-to-text generation is the default mode)

💬Multimodality Support : prithivMLmods/Qwen2-VL-OCR-2B-Instruct
💬For text generation, the FastThink-0.5B model ensures quick and efficient responses, prithivMLmods/FastThink-0.5B-Tiny
💬Image Generation: sdxl lightning model, SG161222/RealVisXL_V4.0_Lightning

Github: https://github.com/PRITHIVSAKTHIUR/QwQ-Edge

graph TD
    A[User Interface] --> B[Chat Logic]
    B --> C{Command Type}
    C -->|Text| D[FastThink-0.5B]
    C -->|Image| E[Qwen2-VL-OCR-2B]
    C -->|@image| F[Stable Diffusion XL]
    C -->|@tts| G[Edge TTS]
    D --> H[Response]
    E --> H
    F --> H
    G --> H

1024m

updated a collection 8 days ago

Machine Generated Text Detection

Collection

datasets and checkpoints specifically for Aya-23-35B and Command-R-plus • 1 item • Updated 8 days ago • 2

Tonic

posted an update 8 days ago

Post

1979

🙋🏻‍♂️hey there folks ,

Goedel's Theorem Prover is now being demo'ed on huggingface : Tonic/Math

give it a try !

danylo-boiko

authored a paper 9 days ago

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Paper • 2411.19799 • Published Nov 29, 2024 • 11

prithivMLmods

posted an update 10 days ago

Post

4734

o3-Mini and Deepseek R1
Worked out with some famous and weird examples.

🔥Blog: https://huggingface.co/blog/prithivMLmods/o3-mini-vs-deepseek-r1

Prompt : Using HTML, CSS, and JavaScript in a single HTML file to create a simulation of the solar system. Pay extreme attention to the UI to make it as intuitive as possible. Ensure that every planet appears as a sphere and is labeled with its corresponding name.

example 1: o3 Mini , example 2: Deepseek R1

Q2 : https://huggingface.co/blog/prithivMLmods/o3-mini-vs-deepseek-r1#q2--web-solar-system-explorer

1 reply

not-lain

posted an update 13 days ago

Post

3444

I have just released a new blogpost about kv caching and its role in inference speedup 🚀
🔗 https://huggingface.co/blog/not-lain/kv-caching/
some takeaways :

4 replies

prithivMLmods

posted an update 14 days ago

Post

5099

Deepswipe by
.
.
.
. Deepseek🐬🗿

Everything is now in recovery. 📉📈

4 replies

Tonic

posted an update 15 days ago

Post

2865

🙋🏻‍♂️ Hey there folks ,

our team made a game during the @mistral-game-jam and we're trying to win the community award !

try our game out and drop us a ❤️ like basically to vote for us !

Mistral-AI-Game-Jam/TextToSurvive

hope you like it !

prithivMLmods

posted an update 22 days ago

Post

3712

Q'n' Sketches ❤️‍🔥

🖼️ Adapters:
- Qs : strangerzonehf/Qs-Sketch
- Qd : strangerzonehf/Qd-Sketch
- Qx : strangerzonehf/Qx-Art
- Qc : strangerzonehf/Qc-Sketch
- Bb : strangerzonehf/Bg-Bag

🐍 Collection : strangerzonehf/q-series-sketch-678e3503bf3a661758429717

🔗Page : https://huggingface.co/strangerzonehf

.
.
.
@prithivMLmods 🤗

sarahooker

authored 6 papers 23 days ago

Multilingual Arbitrage: Optimizing Data Pools to Accelerate Multilingual Progress

Paper • 2408.14960 • Published Aug 27, 2024

Mix Data or Merge Models? Optimizing for Diverse Multi-Task Learning

Paper • 2410.10801 • Published Oct 14, 2024

INCLUDE: Evaluating Multilingual Language Understanding with Regional Knowledge

Paper • 2411.19799 • Published Nov 29, 2024 • 11

Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation

Paper • 2412.03304 • Published Dec 4, 2024 • 17

IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

Paper • 2406.03368 • Published Jun 5, 2024

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 8

not-lain

posted an update 25 days ago

Post

1567

we now have more than 2000 public AI models using ModelHubMixin🤗

prithivMLmods

posted an update 26 days ago

Post

3091

ChemQwen-vL [ Qwen for Chem Vision ] 🧑🏻‍🔬

🧪Model : prithivMLmods/ChemQwen-vL

📝ChemQwen-vL is a vision-language model fine-tuned based on the Qwen2VL-2B Instruct model. It has been trained using the International Chemical Identifier (InChI) format for chemical compounds and is optimized for chemical compound identification. The model excels at generating the InChI and providing descriptions of chemical compounds based on their images. Its architecture operates within a multi-modal framework, combining image-text-text capabilities. It has been fine-tuned using datasets from: https://iupac.org/projects/

📒Colab Demo: https://tinyurl.com/2pn8x6u7, Collection : https://tinyurl.com/2mt5bjju

Inference with the documentation is possible with the help of the ReportLab library. https://pypi.org/project/reportlab/

🤗: @prithivMLmods

1 reply

Tonic

posted an update 26 days ago

Post

1848

🙋🏻‍♂️ Hey there folks ,

Facebook AI just released JASCO models that make music stems .

you can try it out here : Tonic/audiocraft

hope you like it

MarziehFadaee

authored a paper 27 days ago

Towards Best Practices for Open Datasets for LLM Training

Paper • 2501.08365 • Published 28 days ago • 54

AI & ML interests

Recent Activity

Team members 156

C4AI-Community's activity