.css-1vyo7ug{position:fixed;bottom:0;left:0;margin-left:0.5rem;margin-bottom:0.5rem;display:-webkit-inline-box;display:-webkit-inline-flex;display:-ms-inline-flexbox;display:inline-flex;box-shadow:0 1px 3px 0 rgba(0,0,0,0.1),0 1px 2px 0 rgba(0,0,0,0.06);-webkit-align-items:center;-webkit-box-align:center;-ms-flex-align:center;align-items:center;border-radius:0.25rem;--text-opacity:1;color:rgba(255,255,255,var(--text-opacity));font-size:0.875rem;font-weight:700;padding-left:1rem;padding-right:1rem;padding-top:0.75rem;padding-bottom:0.75rem;background-color:#86a8e7;z-index:9999;}

Please enable JavaScript to use this site.
JavaScript를 활성화 시켜주세요.

Neural Networks

2021-08-09

TOC

Definition of Neural Networks

Neural Networks are function approximations that stack affine transformations followed by non-linear transformations.

Review: GoogLeNet (Inception v1)— Winner of ILSVRC 2014 (Image Classification) | by Sik-Ho Tsang | Coinmonks | Medium

1D Input Linear Neural Networks

input is 1d, output is 1d.
Data is dots on 2d plane
Model: y_hat = wx + b
Loss: mean squared error(MSE) as loss function

Minimizing mean squared error loss function based on partial derivative.

Backpropagation is (partial) differentiating loss function with all the parameters.
Gradient descent is the process of updating each individual weights based on partial differentiation value.
Eta(n) is stepsize.

Multi-Dimensional Input

Model: y = W_transpose * x + b

Multi-layer perceptron

Stacking Layers of Matrices and adding non-linear transformation(activation function) in between stacks

Model: W*p*W*x
Universal Approximation Theorem: There is single hidden layer feedforward netowrk that approximates any measurable function to any dessired degree of accuracy on some compact set K.

Loss function

Regression Task: Mean Squared Error Loss function
Classification Task: Cross Entropy Loss Function
Probabilistic Task:

Previous Post

Fixing Pylance Import Error

Next Post

Deep Learning Concepts

Written by

break, compose, display