Spaces:

DontPlanToEnd
/

UGI-Leaderboard

Running

DontPlanToEnd commited on Sep 30, 2024

Commit

d607f0f

verified ·

1 Parent(s): dd73a2f

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -220,7 +220,7 @@ with GraInter:
             **W/10:** Willingness/10. A more narrow subset of the UGI questions, solely focused on measuring how far a model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
             <br>
-            **I/10:** Intelligence/10. The average score of the UGI questions with the highest correlation with parameter size. This metric tries to show how much intrinsic knowledge and reasoning a model has.
             <br><br>
             A high UGI but low W/10 could mean for example that the model can provide a lot of accurate sensitive information, but will refuse to form the information into something it sees as dangerous. Or that it answers questions correctly, but appends a paragraph to its answer explaining why the question is immoral to ask.
             <br><br>

             **W/10:** Willingness/10. A more narrow subset of the UGI questions, solely focused on measuring how far a model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
             <br>
+            **I/10:** Intelligence/10. The average score of the UGI questions with the highest correlation with parameter size. This metric tries to show how much intrinsic knowledge and reasoning a model has (when it's willing to answer the questions).
             <br><br>
             A high UGI but low W/10 could mean for example that the model can provide a lot of accurate sensitive information, but will refuse to form the information into something it sees as dangerous. Or that it answers questions correctly, but appends a paragraph to its answer explaining why the question is immoral to ask.
             <br><br>