Implementing Compositional Attention

rishit_dagli · June 24, 2022, 5:24am

Here is my TF/Keras implementation of the recent Compositional Attention paper by MILA which disentangles the search and retrieval components of the attention mechanism. This can be used as a drop-in replacement for standard multi-head attention and outperforms it for some tasks.

lgusm · June 27, 2022, 9:52am

Nice work! Congrats!