Deep Learning

Understanding and using deep learning networks

What’s new in 19b: Deep Learning Examples

Posted by Johanna Pingel,

Not to be outdone by Heather with her latest features in MATLAB post, Shounak Mitra, Product Manager for Deep Learning Toolbox, offered to post about new deep learning examples. Enjoy!
There are quite a few new deep learning features for 19b, since this was a major release for Deep Learning. Instead of listing all the new features, I'm listing the new examples which do a great job of highlighting the new features. There are plenty of new examples listed below, plus I'll highlight the ones I think are the most exciting in each category.

General Deep Learning

One of the new things which deserves attention is Custom Training Loops and Auto Differentiation (Autodiff). This allows for new features such as GANs and Siamese Networks to be available in MATLAB! There are new examples to explore these more advanced concepts:
This example shows how to train a generative adversarial network (GAN) to generate images. This example shows how to train a Siamese network to identify similar images of handwritten characters.
 

Visualization

More and more, people want to understand why a deep learning model is making a certain prediction. These new examples offer new ways to gain more insight into your model.
If you can only check out one example, choose Grad-CAM. This is a highly visual and easy to implement way to visualize network predictions.

Computer Vision & Image Processing

19b was a good release for data preprocessing for deep learning. Here is a nice reference example showing the specifics of data preprocessing for many different applications: Semantic Segmentation, Object Detection, Image Classification, and more.
For more image processing and computer vision, here are examples broken into categories.

Deep Learning on Large Images

More Augmentation

While these can be thought of as more advanced features, augmenting images and bounding boxes are great way to increase the robustness of your dataset and potentially increase the accuracy of your model.  

Deep Learning Classification of Large Images, is especially useful Medical Imaging, where images can easily be so large as to not fit in memory.

 

Code Generation

Every release, keep an eye on new features and functions supported with Code Generation. A complete list of features can be found here, but of note this release is CUDA code generation of deep learning networks such as MobileNet-v2 and DeepLab-v3+. There are new functions from Image Processing and Computer Vision, and you can generate code for LSTM networks. There are a few more examples than this, but here are some highlights:

Reinforcement Learning

Reinforcement learning is a hot topic right now, especially in the research community. Reinforcement Learning Toolbox, first released in 19a, gives you access to the complete RL workflow: from creating an environment, to training and deployment. In R2019b we added some new exciting examples:
   
  • Imitate MPC Controller for Lane Keep Assist - If you already have a working decision-making system that you want to build on, why reinvent the wheel? This example shows how you can exploit an existing controller for lane keep assist to warm-start reinforcement learning training.
  • Create Agent Using Deep Network Designer and Train Using Image Observations -If you are looking for a simple example to get started, then check out this one. While this isn't technically new this release, it’s based on a simple pendulum environment and shows how you can put together neural network architectures interactively with Deep Network Designer.
   

Audio and Signal Processing

Signal:

A new highlight of this release in signal is automated labeling. You can create your own labeling algorithm and bring it into the labeling apps. These two examples will show details on implementing this:
Label QRS Complexes and R Peaks of ECG Signals Using Deep Network Label Spoken Words in Audio Signals Using External API

Audio:

For those developing audio and speech applications we've introduced new functionality for data augmentation and feature extraction. You can now set up pipelines of randomly-parametrized audio effects for augmenting data, including pitch and time stretching.

Wavelets:

There are also two new examples for wavelets: Wavelet Time Scattering for Music Genre Classification & Spoken Digit Recognition. You can automatically extract features using wavelet time scattering on GPUs. This release includes these new examples that demonstrate how to use this new capability for signal classification.

Data Synthesis for Training

We also added some new examples that show how you can synthesize radar and communications data to train your networks.
You can synthesize Micro-Doppler signatures for pedestrians, bicycles, and pedestrians to train your networks.
That's it! I hope you found one or more of these examples useful. Of course, you can check out the release notes for each product if you want to dive into more details. For any questions on these examples or new features, leave me a comment below!
594 views (last 30 days)  | |

Comments

To leave a comment, please click here to sign in to your MathWorks Account or create a new one.