Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2311.08877

Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7
Just Ask for Calibration: Strategies for Eliciting Calibrated Confidence Scores from Language Models Fine-Tuned with Human Feedback

Paper • 2305.14975 • Published May 24, 2023 • 1
Knowledge of Knowledge: Exploring Known-Unknowns Uncertainty with Large Language Models

Paper • 2305.13712 • Published May 23, 2023 • 2

Talking to Chatbots' papers collection

Collecting papers that I have commented on my website and social media, plus any others that I found interesting.

Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure

Paper • 2311.07590 • Published Nov 9, 2023 • 17
A Survey on Language Models for Code

Paper • 2311.07989 • Published Nov 14, 2023 • 22
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7
A Challenger to GPT-4V? Early Explorations of Gemini in Visual Expertise

Paper • 2312.12436 • Published Dec 19, 2023 • 14

Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7
microsoft/Orca-2-13b

Text Generation • Updated Nov 22, 2023 • 15.2k • 666
openchat/openchat_3.5

Text Generation • Updated May 18, 2024 • 18k • 1.12k

Weekly reading selection

Contrastive Chain-of-Thought Prompting

Paper • 2311.09277 • Published Nov 15, 2023 • 36
Tied-Lora: Enhacing parameter efficiency of LoRA with weight tying

Paper • 2311.09578 • Published Nov 16, 2023 • 16
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7
Fusion-Eval: Integrating Evaluators with LLMs

Paper • 2311.09204 • Published Nov 15, 2023 • 6

Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7

Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7

logical reasoning with llms

Language Models can be Logical Solvers

Paper • 2311.06158 • Published Nov 10, 2023 • 23
Fusion-Eval: Integrating Evaluators with LLMs

Paper • 2311.09204 • Published Nov 15, 2023 • 6
Llamas Know What GPTs Don't Show: Surrogate Models for Confidence Estimation

Paper • 2311.08877 • Published Nov 15, 2023 • 7
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5?

Paper • 2311.07587 • Published Nov 8, 2023 • 5

Eureka: Human-Level Reward Design via Coding Large Language Models

Paper • 2310.12931 • Published Oct 19, 2023 • 26
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs

Paper • 2311.04901 • Published Nov 8, 2023 • 11
Hiformer: Heterogeneous Feature Interactions Learning with Transformers for Recommender Systems

Paper • 2311.05884 • Published Nov 10, 2023 • 11
PolyMaX: General Dense Prediction with Mask Transformer

Paper • 2311.05770 • Published Nov 9, 2023 • 11

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs