Wide or deep? Pros and cons; The vanishing gradients problem; Rectified Linear Units; Different activations: when and how; Loss functions
No download links available.