This video shows how the Transformer Encoder Layer Normalization works. This is the layer immediately after the Attention Layer and the Positional Encoding Layer.
0:00 Transformer Layer Normalization Equation
1:56 Expected Value
2:46 Variance
torch version - 1.10.0
Download
0 formats
No download links available.
torch.nn.TransformerEncoderLayer - Part 5 - Transformer Encoder Second Layer Normalization | NatokHD