SongHye's picture

31 11

SongHye

ADidennn

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

upvoted a paper 5 days ago

Perspectives on the State and Future of Deep Learning -- 2023

upvoted a paper 5 days ago

Faithful Persona-based Conversational Dataset Generation with Large Language Models

View all activity

Organizations

None yet

ADidennn's activity

upvoted 20 papers 5 days ago

The Stochastic Parrot on LLM's Shoulder: A Summative Assessment of Physical Concept Understanding

Paper • 2502.08946 • Published 8 days ago • 180

Perspectives on the State and Future of Deep Learning -- 2023

Paper • 2312.09323 • Published Dec 7, 2023 • 8

Faithful Persona-based Conversational Dataset Generation with Large Language Models

Paper • 2312.10007 • Published Dec 15, 2023 • 9

SlimmeRF: Slimmable Radiance Fields

Paper • 2312.10034 • Published Dec 15, 2023 • 9

Extending Context Window of Large Language Models via Semantic Compression

Paper • 2312.09571 • Published Dec 15, 2023 • 15

Stable Score Distillation for High-Quality 3D Generation

Paper • 2312.09305 • Published Dec 14, 2023 • 10

Challenges with unsupervised LLM knowledge discovery

Paper • 2312.10029 • Published Dec 15, 2023 • 10

DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models

Paper • 2312.09767 • Published Dec 15, 2023 • 27

Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models

Paper • 2312.09608 • Published Dec 15, 2023 • 16

MobileSAMv2: Faster Segment Anything to Everything

Paper • 2312.09579 • Published Dec 15, 2023 • 24

Point Transformer V3: Simpler, Faster, Stronger

Paper • 2312.10035 • Published Dec 15, 2023 • 20

Self-Evaluation Improves Selective Generation in Large Language Models

Paper • 2312.09300 • Published Dec 14, 2023 • 16

Weight subcloning: direct initialization of transformers using larger pretrained ones

Paper • 2312.09299 • Published Dec 14, 2023 • 19

TigerBot: An Open Multilingual Multitask LLM

Paper • 2312.08688 • Published Dec 14, 2023 • 7

SHAP-EDITOR: Instruction-guided Latent 3D Editing in Seconds

Paper • 2312.09246 • Published Dec 14, 2023 • 9

VL-GPT: A Generative Pre-trained Transformer for Vision and Language Understanding and Generation

Paper • 2312.09251 • Published Dec 14, 2023 • 10

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking

Paper • 2312.09244 • Published Dec 14, 2023 • 11

ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks

Paper • 2312.08583 • Published Dec 14, 2023 • 12

UniDream: Unifying Diffusion Priors for Relightable Text-to-3D Generation

Paper • 2312.08754 • Published Dec 14, 2023 • 11

General Object Foundation Model for Images and Videos at Scale

Paper • 2312.09158 • Published Dec 14, 2023 • 12