Second job: multi-layer perceptron
1, Linear neural network
(1) Linear regression
1. Linear model
The linear model is regarded as a single-layer neural network.
2. Loss function
The loss function can quantify the difference between the actual value and the predicted value of the target.
3. Analytical solution
4. Optimization method: small batch grad ...
Added by cyberlew15 on Mon, 13 Sep 2021 04:38:22 +0300
NIPS15 - STN Spatial Transformer Network (including code reproduction) of spatial transformation module in neural network
Original address
original text
Thesis reading methods
Three times thesis method
First acquaintance
CNN method is brilliant in the field of computer vision, and has replaced the traditional method in many fields. However, the architecture of convolutional neural network lacks spatial invariance. Even if convolution and Max pooling opera ...
Added by hacksurfin on Sun, 12 Sep 2021 23:09:02 +0300
NLP star sky intelligent dialogue robot series for natural language processing: in depth understanding of Transformer natural language processing multi head attention architecture-1
NLP star sky intelligent dialogue robot series for natural language processing: in depth understanding of Transformer's multi head attention architecture for natural language processing This paper starts with the architecture of Transformer's multi head attention layer, and then uses an example of Python coding to help readers understand the mu ...
Added by R_P on Sat, 11 Sep 2021 04:18:23 +0300