by Academic (multiple contributors) • Founded 1980
A Convolutional Neural Network (CNN or ConvNet) is a class of deep neural networks designed to automatically and adaptively learn spatial hierarchies of features from input images and other grid-like data. By using convolutional layers, pooling, and non-linear activations, CNNs excel at recognizing patterns such as edges, textures, and objects with far fewer parameters than fully connected networks. They are foundational to modern computer vision and have enabled breakthroughs in image classification, detection, segmentation, and many other perception tasks.
Uses transformer self-attention instead of convolutions to model global relationships in images.
Replaces convolutions with multi-layer perceptrons that mix spatial and channel information.
Models part-whole relationships using capsules and dynamic routing instead of pooling.
Use hand-crafted features and classical machine learning instead of learned convolutions.