Dr. Joao Paulo Schwarz Schuler's picture

6 7

Dr. Joao Paulo Schwarz Schuler PRO

schuler

·

https://www.researchgate.net/profile/Joao-Paulo-Schwarz-Schuler

joaopauloschuler

AI & ML interests

artificial intelligence

Recent Activity

updated a model 1 day ago

schuler/experimental-JP47D54C

updated a model 1 day ago

schuler/experimental-JP47D54B

updated a model 1 day ago

schuler/experimental-JP47D54

View all activity

Organizations

None yet

Posts 1

Post

5327

📢 New Research Alert: Making Language Models Smaller & Smarter!

Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.

The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.

🔑 Key Findings:
• 77% parameter reduction.
• Maintained model capabilities.
• Improved generalization.

Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm

spaces 3

Experimental KPhi3 Model - Currently in Training

Experimental KPhi-3 micro 4k instruct gradio autoloader

Experimental KPhi-3 micro 4k instruct gradio autoloader

Experimental KPhi-3 nano 4k instruct gradio autoloader

Experimental KPhi-3 nano 4k instruct gradio autoloader

models 14

schuler/experimental-JP47D54C

Text Generation • Updated 1 day ago • 6

schuler/experimental-JP47D54B

Text Generation • Updated 1 day ago • 12

schuler/experimental-JP47D54

Text Generation • Updated 1 day ago • 10

schuler/experimental-JP47D55C

Text Generation • Updated 1 day ago • 8

schuler/experimental-JP47D55B

Text Generation • Updated 1 day ago • 9

schuler/experimental-JP47D55

Text Generation • Updated 1 day ago • 9

schuler/experimental-JP47D56C

Text Generation • Updated 1 day ago • 8

schuler/experimental-JP47D56B

Text Generation • Updated 1 day ago • 9

schuler/experimental-JP47D56

Text Generation • Updated 1 day ago • 8

schuler/experimental-JP47D21-KPhi-3-micro-4k-instruct

Text Generation • Updated Dec 5, 2024 • 39

datasets 7

schuler/cosmopedia-v2-textbook-and-howto-8.3m

Viewer • Updated Nov 20, 2024 • 8.3M • 42

schuler/cosmopedia-v2-textbook-and-howto-2.3m

Viewer • Updated Nov 17, 2024 • 2.27M • 60 • 1

schuler/open-orca-slimorca-deduped-cleaned-corrected-for-pascal-txt

Viewer • Updated Nov 17, 2024 • 132k • 50

schuler/cosmopedia-v2-textbook-and-howto-4.5m

Viewer • Updated Nov 17, 2024 • 4.46M • 103

schuler/TinyStories4PascalTxt

Viewer • Updated Oct 26, 2024 • 2.12M • 55

schuler/TinyStories4Pascal-Tokenized-v2

Updated Sep 16, 2024 • 66

schuler/TinyStories4Pascal

Preview • Updated Sep 14, 2024 • 73 • 2