Yida Lu's picture

2 4

Yida Lu

lrxl

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

authored a paper 2 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

upvoted a paper 2 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

View all activity

Organizations

None yet

lrxl's activity

upvoted a paper about 2 months ago

Agent-SafetyBench: Evaluating the Safety of LLM Agents

Paper • 2412.14470 • Published Dec 19, 2024 • 12

authored a paper 2 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published Dec 16, 2024 • 18

upvoted a paper 2 months ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published Dec 16, 2024 • 18

authored a paper 8 months ago

AutoDetect: Towards a Unified Framework for Automated Weakness Detection in Large Language Models

Paper • 2406.16714 • Published Jun 24, 2024 • 10

updated a dataset 8 months ago

lrxl/AutoDetect-results

Viewer • Updated Jun 25, 2024 • 15 • 17 • 1

liked 4 models 12 months ago

thu-coai/ShieldLM-6B-chatglm3

Feature Extraction • Updated Feb 27, 2024 • 19 • 3

thu-coai/ShieldLM-13B-baichuan2

Text Generation • Updated Feb 27, 2024 • 18 • 3

thu-coai/ShieldLM-7B-internlm2

Feature Extraction • Updated Feb 27, 2024 • 82 • 10

thu-coai/ShieldLM-14B-qwen

Text Generation • Updated Feb 27, 2024 • 25 • 13