Dan Jacobellis's picture

3 52 5

Dan Jacobellis PRO

danjacobellis

·

https://danjacobellis.net

danjacobellis

AI & ML interests

Signal processing, information theory, data compression

Recent Activity

published a model about 18 hours ago

danjacobellis/dance

updated a model 1 day ago

danjacobellis/dance

updated a dataset 6 days ago

danjacobellis/LSDIR_512_f16c12_caption

View all activity

Organizations

None yet

danjacobellis's activity

upvoted 3 papers 9 days ago

SampleMix: A Sample-wise Pre-training Data Mixing Strategey by Coordinating Data Quality and Diversity

Paper • 2503.01506 • Published 10 days ago • 9

DiffRhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Paper • 2503.01183 • Published 10 days ago • 26

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published 10 days ago • 72

upvoted 5 papers 13 days ago

Towards an AI co-scientist

Paper • 2502.18864 • Published 15 days ago • 42

Training Consistency Models with Variational Noise Coupling

Paper • 2502.18197 • Published 16 days ago • 6

FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute

Paper • 2502.20126 • Published 14 days ago • 20

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published 15 days ago • 38

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Paper • 2502.20321 • Published 14 days ago • 29

upvoted a paper 20 days ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published 21 days ago • 129

upvoted 3 papers 22 days ago

HealthGPT: A Medical Large Vision-Language Model for Unifying Comprehension and Generation via Heterogeneous Knowledge Adaptation

Paper • 2502.09838 • Published 28 days ago • 10

Continuous Diffusion Model for Language Modeling

Paper • 2502.11564 • Published 24 days ago • 52

You Do Not Fully Utilize Transformer's Representation Capacity

Paper • 2502.09245 • Published 28 days ago • 34

upvoted a paper 23 days ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published 25 days ago • 142

upvoted 4 papers about 1 month ago

Music2Latent2: Audio Compression with Summary Embeddings and Autoregressive Decoding

Paper • 2501.17578 • Published Jan 29 • 1

Feasible Learning

Paper • 2501.14912 • Published Jan 24 • 5

iFormer: Integrating ConvNet and Transformer for Mobile Application

Paper • 2501.15369 • Published Jan 26 • 12

Baichuan-Omni-1.5 Technical Report

Paper • 2501.15368 • Published Jan 26 • 61

upvoted 3 papers about 2 months ago

MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine

Paper • 2408.02900 • Published Aug 6, 2024 • 28

The Geometry of Tokens in Internal Representations of Large Language Models

Paper • 2501.10573 • Published Jan 17 • 9

The GAN is dead; long live the GAN! A Modern GAN Baseline

Paper • 2501.05441 • Published Jan 9 • 88