3 40 199

Kristoffer Rolf Deinoff

gatepoet

AI & ML interests

None yet

Recent Activity

liked a dataset about 20 hours ago

agentica-org/DeepScaleR-Preview-Dataset

liked a model 1 day ago

agentica-org/DeepScaleR-1.5B-Preview

upvoted a collection 1 day ago

Recurrent Models

View all activity

Organizations

None yet

gatepoet's activity

upvoted a collection 1 day ago

Recurrent Models

Collection

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 14 items • Updated 6 days ago • 5

upvoted a paper 7 days ago

Llasa: Scaling Train-Time and Inference-Time Compute for Llama-based Speech Synthesis

Paper • 2502.04128 • Published 10 days ago • 22

upvoted 2 papers 21 days ago

Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass

Paper • 2501.13928 • Published 23 days ago • 17

FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces

Paper • 2501.12909 • Published 25 days ago • 67

upvoted a paper 23 days ago

O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning

Paper • 2501.12570 • Published 25 days ago • 24

upvoted an article 25 days ago

Article

The SOTA Text-to-speech and Zero Shot Voice cloning model that no one knows about...

•

26 days ago

• 60

upvoted a paper 27 days ago

RLHS: Mitigating Misalignment in RLHF with Hindsight Simulation

Paper • 2501.08617 • Published Jan 15 • 10

upvoted 2 papers 3 months ago

DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation

Paper • 2411.16657 • Published Nov 25, 2024 • 17

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 50

upvoted a paper 7 months ago

SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models

Paper • 2407.15841 • Published Jul 22, 2024 • 40

upvoted 2 collections 7 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 648

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated 30 days ago • 162

upvoted 2 papers 9 months ago

Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization

Paper • 2405.15071 • Published May 23, 2024 • 38

ResearchAgent: Iterative Research Idea Generation over Scientific Literature with Large Language Models

Paper • 2404.07738 • Published Apr 11, 2024 • 2

upvoted a paper 10 months ago

InstantFamily: Masked Attention for Zero-shot Multi-ID Image Generation

Paper • 2404.19427 • Published Apr 30, 2024 • 72

upvoted an article 10 months ago

Article

Fine Tuning a LLM Using Kubernetes with Intel® Xeon® Scalable Processors

•

Apr 24, 2024

• 6

upvoted 4 papers 10 months ago

MultiBooth: Towards Generating All Your Concepts in an Image from Text

Paper • 2404.14239 • Published Apr 22, 2024 • 9

The Instruction Hierarchy: Training LLMs to Prioritize Privileged Instructions

Paper • 2404.13208 • Published Apr 19, 2024 • 39

MA-LMM: Memory-Augmented Large Multimodal Model for Long-Term Video Understanding

Paper • 2404.05726 • Published Apr 8, 2024 • 21

Scaling (Down) CLIP: A Comprehensive Analysis of Data, Architecture, and Training Strategies

Paper • 2404.08197 • Published Apr 12, 2024 • 29