Neural-Network

What are neural networks?

Neural Networks are a common architecture used for non-linear machine learning problems. In the real world a lot of the data we encounter is complex and non-linear, this is where Neural Networks shine.

Example

A simple example of this is the XOR gate problem. Below the table lists the inputs and outputs for the XOR gate.

Input 1	Input 2	Output
0	0	0
1	0	1
1	1	0
0	1	1

We see different input points for our problem plotted.

It is evident that a straight line cannot be drawn to seperate (1,0),(0,1) on one side and (0,0),(1,1) on another. This is a good problem for a Neural Network.

Initialization

Our Neural Network is made up of units referred to as Neurons. Each neuron has a weight, this is a value to represent how much of a say the neuron has in the network. A larger value indicates the neuron has more of a say while a lower value indicates the opposite. At initilization we want these weights to be a random value.

Forward Progpogation

Training data is passed to our neurons. The dot product of the inputs and weights are summed together with the bias. The resulting value is passed to the activation function. In this case we are using the sigmoid function as our activation function. $\sigma ({i_1} * {w_1} + {i_2} * {w_2} + b)$

What is an activation function?
An activation function allows us to introduce non-linearity into our network. The sumation of linear function will result in another linear function, regardless of the number of layers, we would still end up with a linear function.

As indicated in the photo we need a qudaratic function to repersent this relation, we can achieve this by applying an activation function. We are using the sigmoid function, which can be represented as the following, $\sigma (x) = {1 \over {1 + {e^{-x}}}}$

A few reaons Sigmoid makes for a good activation function:

The output is always between 0 and 1. As we approach $-\infty$ our function approaches 0 and as we approach $+\infty$ out function aproaches 1.
The function is differentiable, which is important for gradient descent.

Calculating error

To get an idea of how our network is performing we calculate error, based on this the weights and bias can be adjusted to get better results from our model. Error is calculated with the following formula: $\sigma′(output) * (expected - output)$
The slope is multiplied by the output to adjust predictions around the middle. Predicitions that are near 0 or 1 can be considered high confidence predictions, multiplying by the slope will essentially mean multiplying by 0 as theres not much slope near the top or bottom. We want to increase the error for predicitions near the middle. As seen in our $sigmoid$ function some sort of slope exists near the middle. The error will be multiplied to result in a larger number.

Backward Propogation

To train our network we want to go backwards and update the value of our weights and bias. Gradient Descent is used to find the lowest error value. We iteratively update our weights and bias to inch closer and closer to our minimum error value.

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
images		images
src		src
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural-Network

What are neural networks?

Example

Initialization

Forward Progpogation

Calculating error

Backward Propogation

About

Releases

Packages

Languages

RaviTriv/Neural-Network

Folders and files

Latest commit

History

Repository files navigation

Neural-Network

What are neural networks?

Example

Initialization

Forward Progpogation

Calculating error

Backward Propogation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages