Debugging Issues With Autograd | Multi Head Attention Crashed
Watch me live fixing issues with multi head attention and tensor softmax forward functions. Looks like I need to build this MHA from start for building computational graph for backpropagation
https://www.github.com/umairgillani93/miniTorch
#machinelearning #ai #coding #transformers
Download
0 formats
No download links available.
Debugging Issues With Autograd | Multi Head Attention Crashed | NatokHD