Convolutional Neural Network

CNN does support an extensive amount of use cases, not only computer vision one. Nevertheless, it underperforms with variations e.g. rotations.

The convolutional layers in front of the fully connected layer act as filter, as dot-product with a field e.g., 3×3 of features. The filter overcome the usual sparsity of natural data. Sparsity describes the “not existence” of information e.g., with LIDAR data one some points have a value (> or < 0).

Source: https://www.davidsbatista.net/blog/2018/03/31/SentenceClassificationConvNets/

But stop, often not take under consideration are the pooling layer (combining features), but these layer are highly important. They perform the downsampling e.g., dimensional reduction from 3D (height, width, color) to 2D (height, width, black/white) or reduce the number of feature from 800×600 to 400×300.

Source: https://www.davidsbatista.net/blog/2018/03/31/SentenceClassificationConvNets/

One reason why CNN are so powerful is the fact, that the usual feature engineering is “not” required. The filter and pooling identifies orders e.g., vertical structures.

A nice evolution overview:

Source: https://link.springer.com/article/10.1007/s10462-020-09825-6

And a bit of taxonomy

Source: https://link.springer.com/article/10.1007/s10462-020-09825-6

This entry was posted in Uncategorized. Bookmark the permalink.

Leave a Reply

Your email address will not be published. Required fields are marked *