Spaces:

dar-tau
/

selfie

Sleeping

dar-tau commited on Apr 7, 2024

Commit

34b25c9

verified ·

1 Parent(s): b4d2f29

Update app.py

Files changed (1) hide show

app.py CHANGED Viewed

@@ -131,10 +131,10 @@ with gr.Blocks(theme=gr.themes.Default(), css=css) as demo:
             gr.Markdown('''
                 # 😎 Self-Interpreting Models 😎
-                👾 **This space follows the emerging trend of models interpreting their _own hidden states_ in free form natural language**!! 👾
                 This idea was explored in the paper **Patchscopes** ([Ghandeharioun et al., 2024](https://arxiv.org/abs/2401.06102)) and was later investigated further in **SelfIE** ([Chen et al., 2024](https://arxiv.org/abs/2403.10949)).
-                An honorary mention for **Speaking Probes** ([Dar, 2023](https://towardsdatascience.com/speaking-probes-self-interpreting-models-7a3dc6cb33d6) -- my post!! 🥳)  which was a less mature approach but with the same idea in mind.
-                We follow the SelfIE implementation in this space for concreteness. Patchscopes are so general that they encompass many other interpretation techniques too!!!
                 👾 **The idea is really simple: models are able to understand their own hidden states by nature!** 👾
                 If I give a model a prompt of the form ``User: [X] Assistant: Sure'll I'll repeat your message`` and replace ``[X]`` *during computation* with the hidden state we want to understand,

             gr.Markdown('''
                 # 😎 Self-Interpreting Models 😎
+                👾 **This space is a simple introduction to the emerging trend of models interpreting their _own hidden states_ in free form natural language**!! 👾
                 This idea was explored in the paper **Patchscopes** ([Ghandeharioun et al., 2024](https://arxiv.org/abs/2401.06102)) and was later investigated further in **SelfIE** ([Chen et al., 2024](https://arxiv.org/abs/2403.10949)).
+                An honorary mention of **Speaking Probes** ([Dar, 2023](https://towardsdatascience.com/speaking-probes-self-interpreting-models-7a3dc6cb33d6) -- my own work!! 🥳) which was a less mature but had the same idea in mind.
+                We will follow the SelfIE implementation in this space for concreteness. Patchscopes are so general that they encompass many other interpretation techniques too!!!
                 👾 **The idea is really simple: models are able to understand their own hidden states by nature!** 👾
                 If I give a model a prompt of the form ``User: [X] Assistant: Sure'll I'll repeat your message`` and replace ``[X]`` *during computation* with the hidden state we want to understand,