Extract attention from model

#15

by kaustabanv - opened Feb 27, 2024

Feb 27, 2024

Is there a way to extract attention values at every BertLayer? It would be useful for interpretability if the attention values can be accessed.
I'm new to transformers. Is the hidden layer output be used for explainability the same way attention is?

Thanks for your time!

May 14, 2024

Read this, I was finally able to extract attention. I hope this will help all of us :D

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment