Update app.py
Browse files
app.py
CHANGED
@@ -220,7 +220,7 @@ with GraInter:
|
|
220 |
|
221 |
**W/10:** Willingness/10. A more narrow subset of the UGI questions, solely focused on measuring how far a model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
|
222 |
<br>
|
223 |
-
**I/10:** Intelligence/10. The average score of the UGI questions with the highest correlation with parameter size. This metric tries to show how much intrinsic knowledge and reasoning the model has. It is still effected by willingness due to the lack of non-uncensoredness-focused questions in the test set that can be used to construct the metric.
|
224 |
<br><br>
|
225 |
A high UGI but low W/10 could mean for example that the model can provide a lot of accurate sensitive information, but will refuse to form the information into something it sees as dangerous. Or that it answers questions correctly, but appends a paragraph to its answer explaining why the question is immoral to ask.
|
226 |
<br><br>
|
|
|
220 |
|
221 |
**W/10:** Willingness/10. A more narrow subset of the UGI questions, solely focused on measuring how far a model can be pushed before going against its instructions, refusing to answer, or adding an ethical disclaimer to its response.
|
222 |
<br>
|
223 |
+
**I/10:** Intelligence/10. The average score of the UGI questions with the highest correlation with parameter size. This metric tries to show how much intrinsic knowledge and reasoning the model has. It is still effected by willingness due to the lack of non-uncensoredness-focused questions in the current test set that can be used to construct the metric.
|
224 |
<br><br>
|
225 |
A high UGI but low W/10 could mean for example that the model can provide a lot of accurate sensitive information, but will refuse to form the information into something it sees as dangerous. Or that it answers questions correctly, but appends a paragraph to its answer explaining why the question is immoral to ask.
|
226 |
<br><br>
|