Multilayer Perceptron

Implementing a multilayer perceptron from scratch with numpy as an introduction to artificial neural networks

Overview

This project aims to build a modular multilayer perceptron from scratch to learn basics of neural networks with only Numpy library. Using a dataset that describes the characteristics of a cell nucleus of breast mass extracted with fine-needle aspiration, the model is able to learn and predict whether a cell is tumoral or not with an accuracy higher than 90%

Data Visualization

Before designing a good model, we need to understand the dataset by visualizing it using graphs. This is accomplished using the visualize.py program.

(This pairplot does not represent the entire dataset, for reasons of legibility)

Training Part

We need to divide the dataset into two parts before training: one for training and one for validation, in order to assess the model's performance in a robust way. The training curves are displayed after training to visualize the model's ability to make correct predictions.

Features Available

To make the implementation modular and robust, there are several things that can be used:

Layer Type:
- Dense Layer
- Activation Layer
- Softmax Layer
- Dropout Layer
Activations Functions:
- Sigmoid
- ReLU (Rectified Linear Unit)
- Tanh
- ELU (Exponential Linear Unit)
Optimizers:
- SGD (Stochastic Gradient Descent)
- SGDM (Stochastic Gradient Descent with Momentum)
- ADAM (Adaptive Moment Estimation)
- RMSProp (Root Mean Squared Propagation)

This is an example of use in the code:

# Define the architecture of the model
net = NeuralNetwork(
    [
        DenseLayer(x_train.shape[1], 30),
        ActivationLayer("relu"),
        DropoutLayer(0.1),
        DenseLayer(30, 20),
        ActivationLayer("tanh"),
        DenseLayer(20, 10),
        ActivationLayer("relu"),
        DenseLayer(10, output_shape),
        SoftmaxLayer(),
    ]
)

# Train the model
net.fit(
    train=(x_train, y_train),
    test=(x_test, y_test),
    epochs=500,
    lr=0.001,
    batch_size=16,
    optimizer=AdamOptimizer(),
)

We can print the model's architecture using print keyword:

+-----------------+--------------+----------+
|      Layer      |   Function   |  Shape   |
+-----------------+--------------+----------+
|   DenseLayer    | Weighted Sum | (10, 30) |
| ActivationLayer |     Relu     |          |
|  DropoutLayer   |   Dropout    |          |
|   DenseLayer    | Weighted Sum | (30, 20) |
| ActivationLayer |     Tanh     |          |
|   DenseLayer    | Weighted Sum | (20, 10) |
| ActivationLayer |     Relu     |          |
|   DenseLayer    | Weighted Sum | (10, 2)  |
|  SoftmaxLayer   |   Softmax    |          |
+-----------------+--------------+----------+

Predictions Part

After training a model, we can use it to make predictions on unseen datas. To see the model's performance, we refer to the confusion matrix

Benchmarking

In order to compare different architectures, we can use the benchmark.py program and find the most suitable one.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Multilayer Perceptron

Overview

Data Visualization

Training Part

Features Available

Predictions Part

Benchmarking

About

Uh oh!

Releases

Packages

Languages

bbordere/multilayer-perceptron

Folders and files

Latest commit

History

Repository files navigation

Multilayer Perceptron

Overview

Data Visualization

Training Part

Features Available

Predictions Part

Benchmarking

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages