atayloraerospace's picture

atayloraerospace

Taylor658

·

atayloraerospace

AI & ML interests

Multimodal Gen AI 🤖 | Agentic AI 🧠🤖 | Computer Vision 🔭 | AI in Healthcare 🩺 | AI in Aerospace 🚀

Recent Activity

published a model 3 days ago

Taylor658/Photonics_Distill_Llama_70B

updated a model 4 days ago

Taylor658/Photonics_Distill_Llama_70B

new activity 4 days ago

Taylor658/photonic-integrated-circuit-yield:Upload photonic-integrated-circuit-yield.csv

View all activity

Organizations

Taylor658's activity

upvoted a collection 5 days ago

Hibiki fr-en

Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated 5 days ago • 45

upvoted an article 9 days ago

Article

Open-R1: Update #1

By

and 7 others •

10 days ago

• 268

upvoted a paper 12 days ago

GuardReasoner: Towards Reasoning-based LLM Safeguards

Paper • 2501.18492 • Published 12 days ago • 80

upvoted 9 collections 14 days ago

FuseO1-Preview

System-II Reasoning Fusion of LLMs • 10 items • Updated 12 days ago • 17

Meta's Llama2 models

12 items • Updated Dec 13, 2024 • 65

YuE

YuE: Open Full-song Generation Foundation Model • 9 items • Updated 14 days ago • 19

Eagle 2

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 20 days ago • 31

Deepseek Papers

Deepseek papers collection • 15 items • Updated 8 days ago • 58

Qwen2.5-1M

The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 16 days ago • 99

SmolVLM 256M & 500M

Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 19 days ago • 68

DeepSeek-R1

8 items • Updated 22 days ago • 472

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 3 items • Updated 16 days ago • 337

upvoted 8 papers 14 days ago

Visual Generation Without Guidance

Paper • 2501.15420 • Published 17 days ago • 8

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published 21 days ago • 10

CodeMonkeys: Scaling Test-Time Compute for Software Engineering

Paper • 2501.14723 • Published 18 days ago • 7

Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity

Paper • 2501.16295 • Published 15 days ago • 7

Are Vision Language Models Texture or Shape Biased and Can We Steer Them?

Paper • 2403.09193 • Published Mar 14, 2024 • 9

iFormer: Integrating ConvNet and Transformer for Mobile Application

Paper • 2501.15369 • Published 17 days ago • 12

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Paper • 2501.15907 • Published 16 days ago • 15

Qwen2.5-1M Technical Report

Paper • 2501.15383 • Published 17 days ago • 54