Tag
attention
5 verified claims carrying this tag. Each has 2+ primary sources and an HMAC-SHA256 signature.
Transformer architecture introduced in paper: Attention Is All You Need (Vaswani et al., 2017).
ad17e76a8baad7a1 · 3 sources · 100% confidence
Grouped-Query Attention (GQA) introduced in paper: GQA: Training Generalized Multi-Query Transformer Models from Multi-Head Checkpoints (Ainslie et al., 2023).
3e9122ba60a3fe99 · 3 sources · 92% confidence
FlashAttention-2 introduced in paper: FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning (Dao, 2023).
786f534a9f79a3be · 3 sources · 92% confidence
Additive (Bahdanau) attention introduced in paper: Neural Machine Translation by Jointly Learning to Align and Translate (Bahdanau et al., 2014).
bbf65d37f2df1971 · 2 sources · 82% confidence
Pointer Networks introduced in paper: Pointer Networks (Vinyals et al., 2015).
42bcab03ff502de2 · 2 sources · 82% confidence