A overlappartially such that they cover the

A relatively recent innovation, CNNs are a class of deep, neural networks that have been suc-cessfully applied to computer vision problems with state of the art results, best demonstratedin the ILSVRC ImageNet competition (Russakovsky et al. 2015)3. The neuron connection pat-terns are modelled to approximate the structure/organisation of the animal visual cortex. Asin animal models, in order to better identify local patterns, individual cortical neurons onlyrespond to stimuli in a confined spatial region, the receptive field; these receptive fields overlappartially such that they cover the entire visual input, and hence can process a picture in ag-gregate (Wikipedia 2017). Much like in MLPs each layers purpose within a CNN is to identifypatterns within the data. The key difference being that because of the filter, convolution andpooling architecture these neuron clusters have the ability to recognise contextual data, suchas adjacent pixels within an image (or adjacent words within a sentence).The first layers within a CNN detect simple features that can be recognised and interpretedrelatively easy, e.g. an edge of an object. Subsequent layers detect features of features, e.g.a corner, and finally are aggregated to form a representation of the object as a whole e.g. atable. The precise location of a feature is of no consequence as filters are designed to sweep(or convolve) over the image until its entirety has been examined. Pooling serves a relatedfunction, combining the outputs of filters/neuron clusters from one layer into a single neuroninput for the subsequent layer. Max pooling, for example, identifies the maximum value withineach convolution and then presents it to the next neuron layer, sometimes as part of a furtherconvolution and filter set. Finally, a set of dense layers, of the same basic construction as amulti-layer perceptron, are used to make final predictions (typically classifications) of an input.A significant benefit of CNNs is that, when compared to other image classification algorithms,they require relatively little pre-processing. This independence from human intervention, fea-ture design and engineering is of substantial practical and theoretical benefit. Given this thesis’subject matter, and time constraints, this capability was a particularly attractive.

Go Top
x

Hi!
I'm Eleanor!

Would you like to get a custom essay? How about receiving a customized one?

Check it out