Vinh Nguyen's picture

48 129

Vinh Nguyen

vinhnx90

·

https://vinhnx.github.io

AI & ML interests

Learn by doing

Recent Activity

upvoted an article 3 days ago

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

upvoted a collection 3 days ago

OLMoE (January 2025)

liked a model 7 days ago

stas/ml-engineering-book

View all activity

Organizations

None yet

vinhnx90's activity

upvoted an article 3 days ago

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

By

•

3 days ago

• 3

upvoted a collection 3 days ago

OLMoE (January 2025)

Improved OLMoE for iOS app. Read more: https://allenai.org/blog/olmoe-app • 10 items • Updated 3 days ago • 9

upvoted an article 9 days ago

Article

🦸🏻#9: Does AI Remember? The Role of Memory in Agentic Workflows

By

•

12 days ago

• 11

upvoted an article 11 days ago

Article

Open-R1: Update #1

By

and 7 others •

13 days ago

• 276

upvoted a paper 15 days ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published 23 days ago • 318

upvoted an article 17 days ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

18 days ago

• 734

upvoted 2 articles 22 days ago

Article

Fine-tune ModernBERT for RAG with Synthetic Data

By

and 2 others •

25 days ago

• 36

Article

Exploring Synthetic Data Generation with DataDreamer

By

•

24 days ago

• 6

upvoted 2 collections 23 days ago

llama.vim

Recommended models for the llama.vim and llama.vscode plugins • 5 items • Updated 10 days ago • 21

DeepSeek-R1

8 items • Updated 25 days ago • 491

upvoted an article 24 days ago

Article

A Beginner-Friendly PyTorch Tutorial: Build and Train Your First Model

By

•

25 days ago

• 5

upvoted a collection 25 days ago

DeepSeek R1 (All Versions)

DeepSeek R1 - the most powerful reasoning open-source model - available in GGUF, original & 4-bit formats. Includes Llama & Qwen distilled models. • 29 items • Updated 6 days ago • 176

upvoted an article 25 days ago

Article

Simplifying Alignment: From RLHF to Direct Preference Optimization (DPO)

By

•

26 days ago

• 13

upvoted a paper 26 days ago

Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese

Paper • 2408.12480 • Published Aug 22, 2024 • 23

upvoted 6 articles 26 days ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 153

Article

Decoding Strategies in Large Language Models

By

•

Oct 29, 2024

• 40

Article

Diving into MiniMax01 405B MoE

By

•

about 1 month ago

• 17

Article

Code a simple RAG from scratch

By

•

Oct 29, 2024

• 23

Article

They Said It Couldn’t Be Done

By

and 2 others •

Dec 5, 2024

• 80

Article

RLHF 101: A Technical Dive into RLHF

By

•

Dec 11, 2024

• 5