Update app.py
Browse files
app.py
CHANGED
@@ -56,7 +56,7 @@ GraInter = gr.Blocks()
|
|
56 |
with GraInter:
|
57 |
gr.HTML("""<h1 align="center">UGI Leaderboard</h1>""")
|
58 |
gr.Markdown("""
|
59 |
-
**UGI: Uncensored General Intelligence**. The average
|
60 |
|
61 |
**W/10:** A more narrow, 10-point score, solely measuring the LLM's Willingness to answer controversial questions.
|
62 |
|
|
|
56 |
with GraInter:
|
57 |
gr.HTML("""<h1 align="center">UGI Leaderboard</h1>""")
|
58 |
gr.Markdown("""
|
59 |
+
**UGI: Uncensored General Intelligence**. The average score from 5 different subjects that LLMs are commonly steered away from. The leaderboard is made of roughly 60 questions/tasks, measuring both "willingness to answer" and "accuracy" in controversial fact-based questions.
|
60 |
|
61 |
**W/10:** A more narrow, 10-point score, solely measuring the LLM's Willingness to answer controversial questions.
|
62 |
|