Takuya Akiba
iwiwi
AI & ML interests
None yet
Recent Activity
authored
a paper
6 days ago
Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive
Branching Tree Search
authored
a paper
13 days ago
Drop-Upcycling: Training Sparse Mixture of Experts with Partial
Re-initialization
upvoted
a
paper
13 days ago
Drop-Upcycling: Training Sparse Mixture of Experts with Partial
Re-initialization
Organizations
iwiwi's activity
Add a link to SmolSwallow-1.5B-Instruct
#1 opened about 2 months ago
by
iwiwi

Update README.md
#1 opened 11 months ago
by
iwiwi

Update README.md
#1 opened 12 months ago
by
iwiwi

Fixing typo in code example
1
#1 opened over 1 year ago
by
iwiwi

Fix device errors on GPU environments
1
#1 opened over 1 year ago
by
iwiwi

Fix `max_position_embeddings` of text encoder
#1 opened over 1 year ago
by
iwiwi

Update README.md
1
#1 opened over 1 year ago
by
iwiwi
