Dr. Joao Paulo Schwarz Schuler PRO

schuler

AI & ML interests

artificial intelligence

Recent Activity

updated a model 1 day ago
schuler/experimental-JP47D54C
updated a model 1 day ago
schuler/experimental-JP47D54B
updated a model 1 day ago
schuler/experimental-JP47D54
View all activity

Organizations

None yet

schuler's activity

posted an update 1 day ago
view post
Post
5754
šŸ“¢ New Research Alert: Making Language Models Smaller & Smarter!

Thrilled to share the latest technical report demonstrating how to reduce language model parameters by 77% while maintaining performance.

The secret? Grouped pointwise convolutions. Yes. We brought a method from computer vision to the transformers arena.

šŸ”‘ Key Findings:
ā€¢ 77% parameter reduction.
ā€¢ Maintained model capabilities.
ā€¢ Improved generalization.

Paper: https://www.researchgate.net/publication/388835829_SAVING_77_OF_THE_PARAMETERS_IN_LARGE_LANGUAGE_MODELS_TECHNICAL_REPORT
Code: https://github.com/joaopauloschuler/less-parameters-llm