Loss Functions API

Loss functions quantify how well the model's predictions match the ground truth. They provide the gradient signal used for optimization.

Base Loss

`mpneuralnetwork.losses.Loss`

Base class for loss functions.

Computes the error between the model's predictions and the true values, as well as the gradient of this error for backpropagation.

Source code in src/mpneuralnetwork/losses.py

class Loss:
    """Base class for loss functions.

    Computes the error between the model's predictions and the true values,
    as well as the gradient of this error for backpropagation.
    """

    @abstractmethod
    def direct(self, output: ArrayType, output_expected: ArrayType) -> float:
        """Computes the scalar loss value.

        Args:
            output (ArrayType): Model predictions.
            output_expected (ArrayType): Ground truth values.

        Returns:
            float: The loss value.
        """
        pass

    @abstractmethod
    def prime(self, output: ArrayType, output_expected: ArrayType) -> ArrayType:
        """Computes the gradient of the loss function w.r.t the input.

        Args:
            output (ArrayType): Model predictions.
            output_expected (ArrayType): Ground truth values.

        Returns:
            ArrayType: Gradient of the loss.
        """
        pass

    def get_config(self) -> dict:
        return {"type": self.__class__.__name__}

`direct(output, output_expected)` `abstractmethod`

Computes the scalar loss value.

Parameters:

Name	Type	Description	Default
`output`	`ArrayType`	Model predictions.	required
`output_expected`	`ArrayType`	Ground truth values.	required

Returns:

Name	Type	Description
`float`	`float`	The loss value.

Source code in src/mpneuralnetwork/losses.py

@abstractmethod
def direct(self, output: ArrayType, output_expected: ArrayType) -> float:
    """Computes the scalar loss value.

    Args:
        output (ArrayType): Model predictions.
        output_expected (ArrayType): Ground truth values.

    Returns:
        float: The loss value.
    """
    pass

`prime(output, output_expected)` `abstractmethod`

Computes the gradient of the loss function w.r.t the input.

Parameters:

Name	Type	Description	Default
`output`	`ArrayType`	Model predictions.	required
`output_expected`	`ArrayType`	Ground truth values.	required

Returns:

Name	Type	Description
`ArrayType`	`ArrayType`	Gradient of the loss.

Source code in src/mpneuralnetwork/losses.py

@abstractmethod
def prime(self, output: ArrayType, output_expected: ArrayType) -> ArrayType:
    """Computes the gradient of the loss function w.r.t the input.

    Args:
        output (ArrayType): Model predictions.
        output_expected (ArrayType): Ground truth values.

    Returns:
        ArrayType: Gradient of the loss.
    """
    pass

Regression

`mpneuralnetwork.losses.MSE`

Bases: Loss

Mean Squared Error (MSE) Loss.

Formula

L = (1/N) * sum((y_pred - y_true)^2)

Derivative

dL/dy_pred = (2/N) * (y_pred - y_true)

Used for regression problems.

Source code in src/mpneuralnetwork/losses.py

class MSE(Loss):
    """Mean Squared Error (MSE) Loss.

    Formula:
        `L = (1/N) * sum((y_pred - y_true)^2)`

    Derivative:
        `dL/dy_pred = (2/N) * (y_pred - y_true)`

    Used for regression problems.
    """

    def direct(self, output: ArrayType, output_expected: ArrayType) -> float:
        res: float = xp.mean(
            xp.sum(xp.square(output_expected - output), axis=1, dtype=DTYPE),
            dtype=DTYPE,
        )
        return res

    def prime(self, output: ArrayType, output_expected: ArrayType) -> ArrayType:
        grad: ArrayType = 2 * (output - output_expected) / output.shape[0]
        return grad

Classification

`mpneuralnetwork.losses.BinaryCrossEntropy`

Bases: Loss

Binary Cross Entropy Loss.

Formula (conceptually): L = - (y * log(p) + (1-y) * log(1-p))

Implementation details

Assumes the input output corresponds to LOGITS (raw scores), not probabilities. The Sigmoid activation is applied internally using the numerically stable log-sum-exp trick.

Used for binary classification problems.

Source code in src/mpneuralnetwork/losses.py

class BinaryCrossEntropy(Loss):
    """Binary Cross Entropy Loss.

    Formula (conceptually):
        `L = - (y * log(p) + (1-y) * log(1-p))`

    Implementation details:
        Assumes the input `output` corresponds to **LOGITS** (raw scores), not probabilities.
        The Sigmoid activation is applied internally using the numerically stable log-sum-exp trick.

    Used for binary classification problems.
    """

    def __init__(self) -> None:
        self.sigmoid = Sigmoid()

    def direct(self, output: ArrayType, output_expected: ArrayType) -> float:
        loss_per_element = xp.maximum(output, 0) - output * output_expected + xp.log(1 + xp.exp(-xp.abs(output), dtype=DTYPE), dtype=DTYPE)
        res: float = xp.mean(xp.sum(loss_per_element, axis=1, dtype=DTYPE), dtype=DTYPE)
        return res

    def prime(self, output: ArrayType, output_expected: ArrayType) -> ArrayType:
        predictions = self.sigmoid.forward(output)
        grad: ArrayType = (predictions - output_expected) / output.shape[0]
        return grad

`mpneuralnetwork.losses.CategoricalCrossEntropy`

Bases: Loss

Categorical Cross Entropy Loss.

Formula (conceptually): L = - sum(y_i * log(p_i))

Implementation details

Assumes the input output corresponds to LOGITS. The Softmax activation is applied internally.

Used for multi-class classification problems (one-hot encoded targets).

Source code in src/mpneuralnetwork/losses.py

class CategoricalCrossEntropy(Loss):
    """Categorical Cross Entropy Loss.

    Formula (conceptually):
        `L = - sum(y_i * log(p_i))`

    Implementation details:
        Assumes the input `output` corresponds to **LOGITS**.
        The Softmax activation is applied internally.

    Used for multi-class classification problems (one-hot encoded targets).
    """

    def __init__(self, temperature: float = 1.0, epsilon: float = 1e-8) -> None:
        """Initializes the Categorical Cross Entropy loss.

        Args:
            temperature (float, optional): Temperature parameter for the softmax. Defaults to 1.0.
            epsilon (float, optional): Small float added to denominator to avoid dividing by zero. Defaults to 1e-8.
        """
        self.temperature = temperature
        self.epsilon = epsilon
        self.softmax = Softmax(temperature=temperature, epsilon=epsilon)

    def direct(self, output: ArrayType, output_expected: ArrayType) -> float:
        predictions = self.softmax.forward(output)
        res: float = xp.mean(
            -xp.sum(
                output_expected * xp.log(predictions + self.epsilon, dtype=DTYPE),
                axis=1,
                dtype=DTYPE,
            ),
            dtype=DTYPE,
        )
        return res

    def prime(self, output: ArrayType, output_expected: ArrayType) -> ArrayType:
        predictions = self.softmax.forward(output)
        grad: ArrayType = ((predictions - output_expected) / (self.temperature + self.epsilon)) / output.shape[0]
        return grad

`init(temperature=1.0, epsilon=1e-08)`

Initializes the Categorical Cross Entropy loss.

Parameters:

Name	Type	Description	Default
`temperature`	`float`	Temperature parameter for the softmax. Defaults to 1.0.	`1.0`
`epsilon`	`float`	Small float added to denominator to avoid dividing by zero. Defaults to 1e-8.	`1e-08`

Source code in src/mpneuralnetwork/losses.py

def __init__(self, temperature: float = 1.0, epsilon: float = 1e-8) -> None:
    """Initializes the Categorical Cross Entropy loss.

    Args:
        temperature (float, optional): Temperature parameter for the softmax. Defaults to 1.0.
        epsilon (float, optional): Small float added to denominator to avoid dividing by zero. Defaults to 1e-8.
    """
    self.temperature = temperature
    self.epsilon = epsilon
    self.softmax = Softmax(temperature=temperature, epsilon=epsilon)

Loss Functions API

Base Loss

mpneuralnetwork.losses.Loss

direct(output, output_expected) abstractmethod

prime(output, output_expected) abstractmethod

Regression

mpneuralnetwork.losses.MSE

Classification

mpneuralnetwork.losses.BinaryCrossEntropy

mpneuralnetwork.losses.CategoricalCrossEntropy

__init__(temperature=1.0, epsilon=1e-08)

`mpneuralnetwork.losses.Loss`

`direct(output, output_expected)` `abstractmethod`

`prime(output, output_expected)` `abstractmethod`

`mpneuralnetwork.losses.MSE`

`mpneuralnetwork.losses.BinaryCrossEntropy`

`mpneuralnetwork.losses.CategoricalCrossEntropy`

`init(temperature=1.0, epsilon=1e-08)`