Deep Learning

Author: Shubham Shrivastava

Deep learning is a subset of AI and machine learning that uses multi-layered artificial neural networks to deliver state-of-the-art accuracy in tasks such as object detection, speech recognition, language translation and others [1].

Deep learning differs from traditional machine learning techniques in that they can automatically learn representations from data such as images, video or text, without introducing hand-coded rules or human domain knowledge. Their highly flexible architectures can learn directly from raw data and can increase their predictive accuracy when provided with more data.

In deep learning, each level learns to transform its input data into a slightly more abstract and composite representation. In an image recognition application, the raw input may be a matrix of pixels; the first representational layer may abstract the pixels and encode edges; the second layer may compose and encode arrangements of edges; the third layer may encode a nose and eyes; and the fourth layer may recognize that the image contains a face. Importantly, a deep learning process can learn which features to optimally place in which level on its own.

Deep Learning plays a huge role in Autonomous Driving. It is closely linked with computer vision and is used for applications like Road-Scene Segmentation, Object Detection, Driver Monitoring, or even End-to-End Self-Driving. Some of the terminologies very frequently used in deep learning are:

DL = NN (Deep Learning = Neural Nets)
DL is a subset of ML (Machine Learning)
MLP: Multi layer Perceptron
DNN: Deep Neural Networks
RNN: Recurrent Neural Networks
LSTM: Long Short Term Memory
CNN: Convolutinal Neural Networks
DBN: Deep Belief Networks

A good read: Deep Learning, By Ian Goodfellow, Yoshua Bengio and Aaron Courville

Some of the applications of deep learning in self-driving are presented here in details along with code for reference. Please contact Shubham Shrivastava for any questions or concerns.


Object Detection, Classification, and Localization


SSNet for Semantic Segmentation


Traffic Sign Classification using CNN


References

[1] https://developer.nvidia.com/deep-learning [2] https://arxiv.org/pdf/1511.00561.pdf