numberOfAttentionHeads
P130209
predicate
Indicates the number of distinct attention heads used within an attention mechanism or layer in a model.
Observed surface forms (1)
- numAttentionHeads ×2