LeNet-5 Architecture

LeNet-5 is a classic convolutional neural network (CNN) architecture, which was proposed in the late 1980s. It played an important role in the early development of CNNs and laid the foundation for the subsequent development of deep learning.

Overview

Layer Structure: LeNet-5 consists of two convolutional layers, two pooling layers, and three fully connected layers.
Input Size: The input image size is 32x32 pixels.
Output: The output layer has 10 neurons, corresponding to the 10 possible digit classes (0-9).

Architecture Details

Convolutional Layers

Conv1: 6 filters, 5x5 size, stride 1, padding 2.
Conv2: 16 filters, 5x5 size, stride 1, padding 2.

Pooling Layers

Pool1: 2x2 pooling with stride 2.
Pool2: 2x2 pooling with stride 2.

Fully Connected Layers

FC1: 120 neurons, using the sigmoid activation function.
FC2: 84 neurons, using the sigmoid activation function.
FC3: 10 neurons, using the softmax activation function.

Example

Here is an example of the LeNet-5 architecture:

[Input] -> [Conv1] -> [Pool1] -> [Conv2] -> [Pool2] -> [FC1] -> [FC2] -> [FC3] -> [Output]

Image

For more information about CNNs, please refer to our CNN Tutorial.