MPNeuralNetwork ðŸ§
A fully vectorized Deep Learning framework built from scratch using only NumPy and CuPy.
📖 Read the Full Documentation
Philosophy & Goal
In an era of high-level frameworks like PyTorch or TensorFlow, it is easy to treat Neural Networks as "black boxes".
MPNeuralNetwork is an engineering initiative designed to demystify the underlying mathematics of Deep Learning. By rebuilding the engine from the ground up, I aimed to bridge the gap between theoretical equations and production-grade code.
Key Objectives:
- Mathematical Rigor: Implementing backpropagation, chain rule derivatives, and loss functions manually.
- Performance Optimization: Moving from naive scalar loops to fully vectorized matrix operations and implementing
im2colfor convolutions. - Software Architecture: Applying SOLID principles for a modular design.
Key Features
MPNeuralNetwork goes beyond basic matrix operations by incorporating an "intelligent" engine.
- Fully Vectorized: Optimized for batch processing. Convolutions use
im2colfor hardware acceleration. - GPU Acceleration: Seamless support for NVIDIA GPUs via CuPy. Switch backends with a single environment variable.
- Early Stopping & Checkpointing: Automatically monitors validation loss and restores the best weights.
- Smart Initialization: Automatically applies He Init (for ReLU) or Xavier (for Sigmoid/Tanh).
- Comprehensive Regularization: Supports Dropout, L1/L2 Weight Decay (AdamW style).
- Numerical Stability: Internally handles logits for Softmax/Sigmoid to prevent overflow.
- Full Serialization: Save/Load model state to
.npzfiles.
👉 Learn more about the internal engine
Component Inventory
| Category | Available Components |
|---|---|
| Layers | Dense, Convolutional, MaxPooling2D, AveragePooling2D, Dropout, BatchNormalization, Flatten |
| Activations | ReLU, Sigmoid, Tanh, Softmax, PReLU, Swish |
| Optimizers | SGD, RMSprop, AdamW |
| Losses | MSE, BinaryCrossEntropy, CategoricalCrossEntropy |
Installation
pip install mpneuralnetwork
Quick Start
MNIST Classification
import numpy as np
from sklearn.datasets import fetch_openml
from sklearn.preprocessing import OneHotEncoder
from mpneuralnetwork.layers import Dense, Dropout
from mpneuralnetwork.activations import ReLU
from mpneuralnetwork.losses import CategoricalCrossEntropy
from mpneuralnetwork.optimizers import Adam
from mpneuralnetwork.model import Model
# 1. Load Data (MNIST)
X, y = fetch_openml('mnist_784', version=1, return_X_y=True, as_frame=False)
X = (X / 255.0).astype(np.float32) # Normalize & Float32
y = OneHotEncoder(sparse_output=False).fit_transform(y.reshape(-1, 1))
# 2. Define the Architecture
network = [
Dense(128, input_size=784), # Auto-He Init
ReLU(),
Dropout(0.2),
Dense(10) # Output Logits
]
# 3. Initialize
model = Model(
layers=network,
loss=CategoricalCrossEntropy(),
optimizer=Adam(learning_rate=0.001)
)
# 4. Train (Auto-Validation Split)
model.train(X, y, epochs=5, batch_size=64, auto_evaluation=0.2)
👉 See full tutorials in the User Guide
Architecture & Performance
Vectorization
The training loop handles 3D/2D tensors, replacing slow Python loops with NumPy's BLAS routines. Convolutional layers use the im2col technique, transforming convolutions into efficient Matrix Multiplications (GEMM).
Optimization
The framework enforces Float32 precision globally to halve memory usage and double bandwidth. Recent benchmarks show a 26% speedup and 50% memory reduction compared to the initial implementation.
👉 Read the Optimization & Benchmarking Guide
Roadmap
The roadmap has been moved to our GitHub Project board.
Author
Maxime Pires - AI Engineer | CentraleSupelec