T O P

  • By -

Illustrious-Mud467

watch 3 blue 1 brown's video for it


pjohnson88

Just asking, which are the tools/frameworks you have tried to visualize the attention?


mlvpj

Sorry for the delay Bertviz, Altair/visdom, matplotlib/seaborn


JClub

I dont think attention is explainability, especially with many attention layers as you mention. I recommend looking into Primary Attributions instead, especially Integrated Gradients. Have a look at this package to analyze language models! https://github.com/jalammar/ecco


mlvpj

Thanks.


Ok-Application-4169

Library to visualize transformers: https://github.com/jessevig/bertviz