MPNeuralNetwork 🧠

A fully vectorized Deep Learning framework built from scratch using only NumPy and CuPy.

Philosophy & Goal

In an era of high-level frameworks like PyTorch or TensorFlow, it is easy to treat Neural Networks as "black boxes".

MPNeuralNetwork is an engineering initiative designed to demystify the underlying mathematics of Deep Learning. By rebuilding the engine from the ground up, I aimed to bridge the gap between theoretical equations and production-grade code.

Key Objectives:

Mathematical Rigor: Implementing backpropagation, chain rule derivatives, and loss functions manually.
Performance Optimization: Moving from naive scalar loops to fully vectorized matrix operations and implementing im2col for convolutions.
Software Architecture: Applying SOLID principles for a modular design.

Key Features

MPNeuralNetwork goes beyond basic matrix operations by incorporating an "intelligent" engine.

Fully Vectorized: Optimized for batch processing. Convolutions use im2col for hardware acceleration.
GPU Acceleration: Seamless support for NVIDIA GPUs via CuPy. Switch backends with a single environment variable.
Early Stopping & Checkpointing: Automatically monitors validation loss and restores the best weights.
Smart Initialization: Automatically applies He Init (for ReLU) or Xavier (for Sigmoid/Tanh).
Comprehensive Regularization: Supports Dropout, L1/L2 Weight Decay (AdamW style).
Numerical Stability: Internally handles logits for Softmax/Sigmoid to prevent overflow.
Full Serialization: Save/Load model state to .npz files.

👉 Learn more about the internal engine

Component Inventory

Category	Available Components
Layers	`Dense`, `Convolutional`, `MaxPooling2D`, `AveragePooling2D`, `Dropout`, `BatchNormalization`, `Flatten`
Activations	`ReLU`, `Sigmoid`, `Tanh`, `Softmax`, `PReLU`, `Swish`
Optimizers	`SGD`, `RMSprop`, `AdamW`
Losses	`MSE`, `BinaryCrossEntropy`, `CategoricalCrossEntropy`

Installation

pip install mpneuralnetwork

Quick Start

MNIST Classification

import numpy as np
from sklearn.datasets import fetch_openml
from sklearn.preprocessing import OneHotEncoder
from mpneuralnetwork.layers import Dense, Dropout
from mpneuralnetwork.activations import ReLU
from mpneuralnetwork.losses import CategoricalCrossEntropy
from mpneuralnetwork.optimizers import Adam
from mpneuralnetwork.model import Model

# 1. Load Data (MNIST)
X, y = fetch_openml('mnist_784', version=1, return_X_y=True, as_frame=False)
X = (X / 255.0).astype(np.float32) # Normalize & Float32
y = OneHotEncoder(sparse_output=False).fit_transform(y.reshape(-1, 1))

# 2. Define the Architecture
network = [
    Dense(128, input_size=784), # Auto-He Init
    ReLU(),
    Dropout(0.2),
    Dense(10)                   # Output Logits
]

# 3. Initialize
model = Model(
    layers=network,
    loss=CategoricalCrossEntropy(),
    optimizer=Adam(learning_rate=0.001)
)

# 4. Train (Auto-Validation Split)
model.train(X, y, epochs=5, batch_size=64, auto_evaluation=0.2)

👉 See full tutorials in the User Guide

Architecture & Performance

Vectorization

The training loop handles 3D/2D tensors, replacing slow Python loops with NumPy's BLAS routines. Convolutional layers use the im2col technique, transforming convolutions into efficient Matrix Multiplications (GEMM).

Optimization

The framework enforces Float32 precision globally to halve memory usage and double bandwidth. Recent benchmarks show a 26% speedup and 50% memory reduction compared to the initial implementation.

👉 Read the Optimization & Benchmarking Guide

Roadmap

The roadmap has been moved to our GitHub Project board.

Author

Maxime Pires - AI Engineer | CentraleSupelec

LinkedIn | Portfolio