-
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation
Paper • 2311.08877 • Published • 7 -
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback
Paper • 2305.14975 • Published • 1 -
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models
Paper • 2305.13712 • Published • 2
Collections
Discover the best community collections!
Collections including paper arxiv:2311.08877
-
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Paper • 2311.07590 • Published • 17 -
A Survey on Language Models for Code
Paper • 2311.07989 • Published • 22 -
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation
Paper • 2311.08877 • Published • 7 -
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise
Paper • 2312.12436 • Published • 14
-
Contrastive Chain-of-Thought Prompting
Paper • 2311.09277 • Published • 36 -
Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying
Paper • 2311.09578 • Published • 16 -
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation
Paper • 2311.08877 • Published • 7 -
Fusion-Eval: Integrating Evaluators with LLMs
Paper • 2311.09204 • Published • 6
-
Language Models can be Logical Solvers
Paper • 2311.06158 • Published • 23 -
Fusion-Eval: Integrating Evaluators with LLMs
Paper • 2311.09204 • Published • 6 -
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation
Paper • 2311.08877 • Published • 7 -
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?
Paper • 2311.07587 • Published • 5
-
Eureka: Human-Level Reward Design via Coding Large Language Models
Paper • 2310.12931 • Published • 26 -
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Paper • 2311.04901 • Published • 11 -
Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems
Paper • 2311.05884 • Published • 11 -
PolyMaX: General Dense Prediction with Mask Transformer
Paper • 2311.05770 • Published • 11