Artificial Intelligence

Local LLMs with MATLAB

Sivylla Paraskevopoulou — Tue, 09 Jul 2024 13:16:22 +0000

Local large language models (LLMs), such as llama, phi3, and mistral, are now available in the Large Language Models (LLMs) with MATLAB repository through Ollama! This is such exciting news that I can’t think of a better introduction than to share with you this amazing development. Even if you don’t read any further (but I hope you do), because you are too eager to try out local LLMs with MATLAB, know that you can access the repository via these two options:

I am glad you decided to keep reading. In the previous blog post Large Language Models with MATLAB, I shared with you how to connect MATLAB to the OpenAI API. In this blog post, I am going to show you:

How to access local LLMs, including llama3 and mixtral, by connecting MATLAB to a local Ollama server.
How to use llama3 for retrieval-augmented generation (RAG) with the help of MATLAB NLP tools.
Why RAG is so useful when you want to use your own data for natural language processing (NLP) tasks.

For more examples on RAG, creating a chatbot, processing text in real-time, and more NLP applications, see Examples: LLMs with MATLAB.

Set Up Ollama Server

First, go to https://ollama.com/ and follow the download instructions. To use local models with Ollama, you will need to install and start an Ollama server, and then, pull models into the server. For example, to pull llama3, go to your terminal and type:

ollama pull llama3

Some of the other supported LLMs are llama2, codellama, phi3, mistral, and gemma. To see all supported LLMs by the Ollama server, see Ollama models. To learn more about connecting to Ollama from MATLAB, see LLMs with MATLAB - Ollama.

Initialize Chat for RAG

RAG is a technique for enhancing the results achieved with an LLM by using your own data. The following figure shows the RAG workflow. Both accuracy and reliability can be augmented by retrieving information from trusted sources. For example, the prompt fed to the LLM can be enhanced with more up-to-date or technical information.

Initialize the chatbot with the specified model (llama3) and instructions. The chatbot anticipates that it will receive a query from the user, which may or may not be enhanced by additional context. This means, RAG may or may not be applied.

system_prompt = "You are a helpful assistant. You might get a " + ...
    "context for each question, but only use the information " + ...
    "in the context if that makes sense to answer the question. ";
chat = ollamaChat("llama3",system_prompt);

Ask Simple Question

First, let’s check how the model performs when prompted with a general knowledge question.

Define the prompt for the chatbot. Notice that the prompt does not include any context, that means that RAG is not applied.

query_simple = "What is the most famous Verdi opera?";
prompt_simple = "Answer the following question: "+ query_simple;

Define a function that wraps text, which you can use to make the generated text easier to read.

function wrappedText = wrapText(text)
    wrappedText = splitSentences(text);
    wrappedText = join(wrappedText,newline);
end

Generate a response.

response_simple = generate(chat,prompt_simple);
wrapText(response_simple)

ans = 
    "That's an easy one!
     The most famous Verdi opera is undoubtedly "La Traviata".
     It was written in 1853 and is based on Alexandre Dumas' novel "La Dame aux Camélias".
     It tells the tragic story of Violetta Valéry, a courtesan who falls in love with Alfredo Germont, but ultimately sacrifices her life to preserve his reputation.
     The opera features some of Verdi's most beloved music, including the iconic duet "Ah, fors'è lui" and the show-stopping aria "Sempre libera"."

The generated response is coherent, accurate, and quite helpful.

Ask Technical Question

Now, let’s see if the chatbot can perform as well when asked a technical question without enhancing its performance with RAG. More specifically, I want to know how to import a deep learning model from PyTorch® into MATLAB.

Define the prompt for the chatbot and generate a response. Wrap the text for easier visualization.

query_tech = "How to import a PyTorch model into MATLAB?";
prompt_tech = "Answer the following question: "+ query_tech;
response_tech = generate(chat,prompt_tech);
wrapText(response_tech)

ans = 
    "To import a PyTorch model into MATLAB, you can use the `torch2matlab` tool provided by MathWorks.
     This tool is specifically designed for converting PyTorch models into MATLAB's deep learning framework, known as Deep Learning Toolbox.
     Here are the general steps to follow:
     1.
     First, make sure you have the `torch2matlab` tool installed on your system.
     You can download it from the MathWorks website.
     2.
     Save your PyTorch model using the `torch.
     jit.
     save()` function, which will convert your model into a binary file (e.g., `.
     pt` file).
     3.
     Open MATLAB and navigate to the directory where you saved the PyTorch model.
     4.
     Run the following command in the MATLAB Command Window:
     ```matlab
     model = torch2matlab('your_model_file.
     pt');
     ```
     Replace `'your_model_file.
     pt'` with the actual name of your PyTorch model file (e.g., `mymodel.pt`).
     5.
     The `torch2matlab` tool will convert your PyTorch model into a MATLAB-compatible format and store it in the `model` variable.
     6.
     You can then use this converted model in your MATLAB code, just like you would with any other Deep Learning Toolbox model.
     Note that the conversion process may not always be perfect, as the two frameworks have different architectures and representations of neural networks.
     However, the `torch2matlab` tool should handle most common cases and allow you to import and use your PyTorch models in MATLAB."

Before you start looking into how to use torch2matlab, I have to inform you that such a tool does not exist. Even though the generated response contains some accurate elements, it is also clear that the model hallucinated, which is the most widely known pitfall of LLMs. The model didn’t have enough data to generate an informed response but generated one anyways.

Hallucinations might be more prevalent when querying on technical or domain-specific topics. For example, if you want as an engineer to use LLMs for your daily tasks, feeding additional technical information to the model using RAG, can yield much better results as you will see further down this post.

Download and Preprocess Document

Luckily, I know just the right technical document to feed to the chatbot, a previous blog post, to enhance its accuracy.

Specify the URL of the blog post.

url = "https://blogs.mathworks.com/deep-learning/2024/04/22/convert-deep-learning-models-between-pytorch-tensorflow-and-matlab/";

Define the local path where the post will be saved, download it using the provided URL, and save it to the specified local path.

localpath = "./data/";
if ~exist(localpath, 'dir')
    mkdir(localpath);
end

filename = "blog.html";
websave(localpath+filename,url);

Read the text from the downloaded file by first creating a FileDatastore object.

fds = fileDatastore(localpath,"FileExtensions",".html","ReadFcn",@extractFileText);

str = [];
while hasdata(fds)
    textData = read(fds);
    str = [str; textData];
end

Define a function for text preprocessing.

function allDocs = preprocessDocuments(str)
    paragraphs = splitParagraphs(join(str));
    allDocs = tokenizedDocument(paragraphs);
end

Split the text data into paragraphs.

document = preprocessDocuments(str);

Retrieve Document

In this section, I am going to show you an integral part of RAG, that is how to retrieve and filter the saved document based on the technical query.

Tokenize the query and find similarity scores between the query and document.

embQuery = bm25Similarity(document,tokenizedDocument(query_tech));

Sort the documents in descending order of similarity scores.

[~, idx] = sort(embQuery,"descend");
limitWords = 1000;
selectedDocs = [];
totalWords = 0;

Iterate over the sorted document indices until the word limit is reached.

i = 1;
while totalWords <= limitWords && i <= length(idx)
    totalWords = totalWords + size(document(idx(i)).tokenDetails,1);
    selectedDocs = [selectedDocs; joinWords(document(idx(i)))];
    i = i + 1;
end

Generate Response with RAG

Define the prompt for the chatbot with added technical context, and generate a response.

prompt_rag = "Context:" + join(selectedDocs, " ") ...
    + newline +"Answer the following question: "+ query_tech;
response_rag = generate(chat, prompt_rag);
wrapText(response_rag)

ans = 
    "To import a PyTorch model into MATLAB, you can use the `importNetworkFromPyTorch` function.
     This function requires the name of the PyTorch model file and the input sizes as name-value arguments.
     For example:
     net = importNetworkFromPyTorch("mnasnet1_0.
     pt", PyTorchInputSizes=[NaN, 3,224,224]);
     This code imports a PyTorch model named "mnasnet1_0" from a file called "mnasnet1_0.
     pt" and specifies the input sizes as NaN, 3, 224, and 224.
     The `PyTorchInputSizes` argument is used to automatically create and add the input layer for a batch of images."

The chatbot’s response is now accurate!

In this example, I used web content to enhance the accuracy of the generated response. You can replicate this RAG workflow to enhance the accuracy of your queries with any sources (one or multiple) you choose, like technical reports, design specifications, or academic papers.

Key Takeaways

The Large Language Models (LLMs) with MATLAB repository has been updated with local LLMs through an Ollama server.
Local LLMs are great for NLP tasks, such as RAG, and now you can use the most popular LLMs from MATLAB.
Take advantage of MATLAB tools, and more specifically Text Analytics Toolbox functions, to enhance the LLM functionality, such as retrieving, managing, and processing text.

Export Models from Machine Learning Apps to Simulink

Sivylla Paraskevopoulou — Mon, 24 Jun 2024 13:01:43 +0000

In this blog post, I am going to show you the most interactive way to create a Simulink model that includes a machine learning model by using the Classification Learner app. Simulating and testing machine learning models is becoming increasingly popular, but how do you integrate machine learning models into Simulink models?

The workflow shown here is very similar if you are using the Regression Learner app. The export functionality was introduced in MATLAB R2024a and you can learn more in the documentation topics Export Classification Model to Make Predictions in Simulink and Export Regression Model to Make Predictions in Simulink.

In the following sections, I ‘ll outline all the steps including data preparation, creating a machine learning model with the Classification Learner app, and finally creating a Simulink model from the app. Note that the Classification Learner and Regression Learner apps are not the only AI apps from which you can export to Simulink. You can also export to Simulink from the Deep Network Designer and Curve Fitter apps.

Prepare Data

We are going to start at the command line to prepare the data for training classifiers.

Read the sample file CreditRating_Historical.dat into a table. The predictor data contains financial ratios and industry sector information for a list of corporate customers. The response variable contains credit ratings assigned by a rating agency.

creditrating = readtable("CreditRating_Historical.dat");

Create Model with Classification Learner

Open the Classification Learner app at the command line. Alternatively, you can open the app by clicking the Apps tab, and then click the arrow at the right of the Apps section to open the apps gallery. In the Machine Learning and Deep Learning group, click Classification Learner.

classificationLearner

In the New Session from the Workspace dialog box, select the creditrating table from the Data Set Variable list. The app selects the response and predictor variables. The default response variable is Rating. Check that none of the predictors is categorical. At the moment, machine learning models trained with categorical predictors cannot be exported to Simulink from the app.

The default validation option is 5-fold cross-validation, to protect against overfitting. In the Test section, click the check box to set aside a test data set. Specify 15 percent of the imported data as a test set. To accept the options and continue, click Start Session.

Not all available classifiers in the app are supported for Simulink export. When I started reading the relevant documentation topic, I thought I would have to manually select which models to train given that my end goal was to export the best performing model to Simulink. Fortunately, the app designers anticipated my (and most likely other users’) frustration and provided an option to train only the models supported for Simulink export. To train all supported models, click All Simulink Supported in the Models section of the Learn tab and then click Train Selected.

Then, click Train Selected in the Train tab. You can train models in parallel using Classification Learner if you have Parallel Computing Toolbox. Parallel training allows you to train multiple classifiers at once. Make sure that the Use Parallel button is selected.

Export to Simulink

To find the best classifier, sort the trained models based on the validation accuracy. In the Models pane, open the Sort by list and select Accuracy (Validation). In the Export section of the Learn tab, click Export Model and select Export Model to Simulink. It’s a one-click process.

Maybe I got ahead of myself. You are one more click away from creating a Simulink model. In the Export Classification Model to Simulink dialog box, you can select to save the model and input data to the MATLAB workspace or the Simulink model workspace. Click Export.

The app launches Simulink and creates a new Simulink model, which you can save in your current directory. The new variables (for example, trainedModel and inputData) appear in the MATLAB workspace.

The From Workspace (Simulink) block is connected to a Predict block of the type corresponding to your exported classification model. In this case, the Simulink model is using the ClassificationNeuralNetwork Predict block. The To Workspace block outputs the predicted labels from the Predict block to a new variable named outputPredictions in the MATLAB workspace. You can double-click any of these blocks to change their settings. For an example, see Predict Class Labels Using ClassificationNeuralNetwork Predict Block.

The Simulink model with an integrated machine learning model is ready to use. This has been the easiest way that I ‘ve ever used to create a Simulink model with a machine learning model. All the right blocks are already connected, and I didn’t have to search through Simulink libraries or documentation.

To run the Simulink model, click Run on the Simulation tab. By simulating a machine learning in Simulink, you can test its integration into a larger system. That is, you can assess the machine learning behavior and system performance.

Explainability in Object Detection for MATLAB, TensorFlow, and PyTorch Models

Sivylla Paraskevopoulou — Mon, 10 Jun 2024 13:26:31 +0000

In R2024a, Computer Vision Toolbox introduced the d-rise function. D-RISE is an explainability tool that helps you visualize and understand which parts are important for object detection. If you need a refresher on what explainable AI is and why it’s important, watch this short video.

D-RISE is a model-agnostic method that doesn’t require knowledge of the inner workings of the object detection model, as proposed in this paper. It produces a saliency map (image with highlighted areas that most affect the prediction) given a specific image and object detector. Because it’s a general and model-agnostic method, it can be applied to different types of object detectors.

Figure: Perform object detection and explain the detection results using D-RISE for MATLAB, imported TensorFlow, and PyTorch models.

In this blog post, I’ll show you how to use D-RISE to explain object detection results for MATLAB, TensorFlow, and PyTorch® models. More specifically, I will walk through how to use D-RISE for these object detectors:

Built-in MATLAB object detector.
TensorFlow object detector that is imported into MATLAB.
PyTorch object detector that is used in MATLAB with co-execution.

The drise function provides two syntaxes for explaining the predictions of built-in MATLAB object detectors and any other type of object detector. To use drise with TensorFlow (even imported) and PyTorch models, you must use the custom detection syntax of the function.

Figure: Syntaxes of the drise function for built-in MATLAB object detectors and other types of object detectors.

But don’t fret, I’ll provide you with the necessary code for all options. Check out this GitHub repository to get the code of the examples using D-RISE with TensorFlow and PyTorch object detectors.

D-RISE with MATLAB Model

This section shows how to use D-RISE with a built-in MATLAB object detector, more specifically a YOLO v2 object detector. You can get the full example from here.

Read in a test image from the Caltech Cars data set.

img = imread("testCar.png");
img = im2single(img);

Detect vehicles in the test image by using the trained YOLO v2 detector. Note that the detector in this example has been trained to detect only vehicles, whereas the TensorFlow and PyTorch object detectors in the following examples are detecting all objects in the image.

Pass the test image and the detector as input to the detect function. The detect function returns the bounding boxes and the detection scores.

[bboxes,scores,labels] = detect(detector,img);
figure
annotatedImage = insertObjectAnnotation(img,"rectangle",bboxes,scores);
imshow(annotatedImage)

Use the drise function to create saliency maps explaining the detections made by the YOLO v2 object detector.

scoreMap = drise(detector,img);

Plot the saliency map over the image. Areas highlighted in red are more significant in the detection than areas highlighted in blue.

tiledlayout(1,2,TileSpacing="tight")

for i = 1:2
    nexttile
    annotatedImage = insertObjectAnnotation(img,"rectangle",bboxes(i,:),scores(i));
    imshow(annotatedImage)
    hold on
    imagesc(scoreMap(:,:,i),AlphaData=0.5)
    title("DRISE Map: Detection " + i)
    hold off
end

colormap jet

To see more examples on how to use D-RISE with MATLAB object detectors, see the d-rise reference page.

D-RISE with TensorFlow Model

This section shows how to import a TensorFlow model for object detection, how to use the imported model in MATLAB and visualize the detections, and how to use D-RISE to explain the predictions of the model. You can get the code for this example from here.

Import and Initialize Network

Import a pretrained TensorFlow model for object detection. The model is in the SavedModel format.

modelFolder = "centernet_resnet50_v2_512x517_coco17";
detector = importNetworkFromTensorFlow(modelFolder);

Specify the input size of the imported network. You can find the expected image size stated in the name of the TensorFlow network. The data format of the dlarray object must have the dimensions "SSCB" (spatial, spatial, channel, batch) to represent a 2-D image input. For more information, see Data Formats for Prediction with dlnetwork. Then, initialize the imported network.

input_size = [512 512 3];
detector = detector.initialize(dlarray(ones(512,512,3,1),"SSCB"))

detector = 
  dlnetwork with properties:

         Layers: [1×1 centernet_resnet50_v2_512x517_coco17.kCall11498]
    Connections: [0×2 table]
     Learnables: [388×3 table]
          State: [0×3 table]
     InputNames: {'kCall11498'}
    OutputNames: {'kCall11498/detection_boxes'  'kCall11498/detection_classes'  'kCall11498/detection_scores'  'kCall11498/num_detections'}
    Initialized: 1

  View summary with summary.

Detect with Imported Network

The network has four outputs: bounding boxes, classes, scores, and number of detections.

mlOutputNames = detector.OutputNames'

mlOutputNames = 4×1 cell
'kCall11498/detection_boxes'  
'kCall11498/detection_classes'
'kCall11498/detection_scores' 
'kCall11498/num_detections'

Read the image that you want to use for object detection. Perform object detection on the image.

img = imread("testCar.png");
[y1,y2,y3,y4] = detector.predict(dlarray(single(img),"SSCB"));

Get Detections with Highest Scores

Create a map of all the network outputs.

mlOutputMap = containers.Map;
mlOutputs = {y1,y2,y3,y4};
for i = 1:numel(mlOutputNames)
    opNameStrSplit = strsplit(mlOutputNames{i},'/');
    opName = opNameStrSplit{end};
    mlOutputMap(opName) = mlOutputs{i};
end

Get the detections with scores above the threshold thr, and the corresponding class labels.

thr = 0.5;
[bboxes,classes,scores,num_box] = bestDetections(img,mlOutputMap,thr);
class_labels = getClassLabels(classes);

Visualize Object Detection

Create the labels associated with each of the detected objects.

colons = repmat(": ",[1 num_box]);
percents = repmat("%",[1 num_box]);
labels = strcat(class_labels,colons,string(round(scores*100)),percents);

Visualize the object detection results with annotations.

figure
outputImage = insertObjectAnnotation(img,"rectangle",bboxes,labels,LineWidth=1,Color="green");
imshow(outputImage)

Explainability for Object Detector

Explain the predictions of the object detection network using D-RISE. Specify a custom detection function to use D-RISE with the imported TensorFlow network.

targetBox = bboxes(1,:);
targetLabel = 1;
scoreMap = drise(@(img)customDetector(img),img,targetBox,targetLabel);

Plot the results. As mentioned above, areas highlighted in red are more significant in the detection than areas highlighted in blue.

figure
annotatedImage = insertObjectAnnotation(img,"rectangle",targetBox,"vehicle");
imshow(annotatedImage)
hold on
imagesc(scoreMap,AlphaData=0.5)
title("DRISE Map: Custom Detector")
hold off
colormap jet

D-RISE with PyTorch Model

This section shows how to perform object detection with a PyTorch model using co-execution, and how to use D-RISE to explain the predictions of the PyTorch model. You can get the code for this example from here.

Python Environment

Set up the Python interpreter for MATLAB by using the pyenv function. Specify the version of Python to use.

pe = pyenv(Version=".\env\Scripts\python.exe",ExecutionMode="OutOfProcess");

Object Detection

Read the image that you want to use for object detection.

img_filename = "testCar.png";
img = imread(img_filename);

Perform object detection with a PyTorch model using co-execution.

pyrun("from PT_object_detection import loadPTmodel, detectPT")
[model,weights] = pyrun("[a,b] = loadPTmodel()",["a" "b"]);
predictions = pyrun("a = detectPT(b,c,d)","a",b=img,c=model,d=weights);

Convert the prediction outputs from Python data types to MATLAB data types.

[bboxes,labels,scores] = convertVariables(predictions,imread(img_filename));

Get the class labels.

class_labels = getClassLabels(labels);

Visualization

Create the labels associated with each of the detected objects.

num_box = length(scores);
colons = repmat(": ",[1 num_box]);
percents = repmat("%",[1 num_box]);
class_labels1 = strcat(class_labels,colons,string(round(scores'*100)),percents);

Visualize the object detection results with annotations.

figure
outputImage = insertObjectAnnotation(img,"rectangle",bboxes,class_labels1,LineWidth=1,Color="green");
imshow(outputImage)

Explainability

Explain the predictions of the PyTorch model using D-RISE. Specify a custom detection function to use D-RISE.

targetBbox = bboxes(1,:);
targetLabel = 1;
scoreMap = drise(@(img)customDetector(img),img,targetBbox,targetLabel,...
    NumSamples=512,MiniBatchSize=8);

You can plot the saliency map computed by D-RISE as you previously did for the object detection results for the TensorFlow model.

Get the MATLAB code

Building Confidence in AI with Constrained Deep Learning

Sivylla Paraskevopoulou — Thu, 30 May 2024 12:52:47 +0000

Constrained deep learning is an advanced approach to training robust deep neural networks by incorporating constraints into the learning process. These constraints can be based on physical laws, logical rules, or any other domain-specific knowledge. For example, if you are designing a deep learning model for a mobile phone’s battery state of charge (SoC), it makes sense that the SoC will be monotonically increasing when the phone is plugged in and charging, and monotonically decreasing when the phone is unplugged and being used.

Imposing constraints in the learning process guarantees that certain desirable properties are present in the trained neural network by design. This makes it easier to ensure that the neural network meets the specified requirements, and therefore easier to verify the network. Constrained deep learning is particularly relevant when developing deep learning models in safety-critical or regulated industries, such as aerospace, automotive, healthcare, and finance. To learn more about Verification and Validation for AI, check out this blog post series.

A newly available repository provides you with code, examples, and technical information for designing constrained models that meet the desired behavior. More specifically, this repository brings the concepts of monotonicity, convexity, and Lipschitz continuity as constraints embedded into deep learning models.

You have two options to access the repository:

The repository provides introductory examples to get you started with constrained deep learning. Check out these examples to learn how to use the buildConstrainedNetwork function to design monotonic, convex, and Lipschitz continuous neural networks.

With one of these introductory examples, you can learn how to create a 1-D fully monotonic neural network (FMNN) architecture. Fully monotonic neural networks adhere to a specific class of neural network architectures with constraints applied to the network architecture and weights. You can build a simple FMNN using fully connected layers and fullsort (that is, gradient norm preserving activation functions). Specify the ResidualScaling to balance monotonic growth with smoothness of solution and that the MonotonicTrend is "increasing".

inputSize = 1;
numHiddenUnits = [16 8 4 1];
fmnnet = buildConstrainedNetwork("fully-monotonic",inputSize,numHiddenUnits,...
    Activation="fullsort",...
    ResidualScaling=2,...
    MonotonicTrend="increasing")

fmnnet = 
  dlnetwork with properties:

         Layers: [11x1 nnet.cnn.layer.Layer]
    Connections: [11x2 table]
     Learnables: [8x3 table]
          State: [0x3 table]
     InputNames: {'input'}
    OutputNames: {'addition'}
    Initialized: 1

  View summary with summary.

The repository also provides longer workflow examples, such as the Remaining Useful Life Estimation Using Monotonic Neural Networks example, which shows how to guarantee monotonically decreasing prediction on a remaining useful life (RUL) task by combining partially and fully monotonic networks.

The results for a specific test engine show that the fully monotonic neural network (FMNN) performs well in estimating the RUL for turbo engine data. The FMNN outperforms the CNN (also trained in this example) for this test engine, most likely because it’s guaranteed to always provide a monotonically decreasing solution. Additionally, even though no restriction was set on the FMNN that the network output should be linear, the FMNN displays linear behavior and follows closely the true RUL.

This example aims to demonstrate a viable approach for obtaining an overall monotonic RUL network. Consider that results can be improved if the signal data is preprocessed with denoising, smoothing, or other techniques.

If you want to delve deeper into the techniques for and applications of building robust neural networks with constrained deep learning, the repository provides comprehensive technical articles. You don’t have to understand or even read the technical articles to use the code in the repository. However, if you are feeling brave or curious enough, these articles explain key concepts of AI verification in the context of constrained deep learning. They include discussions on how to achieve the specified constraints in neural networks at construction and training time, as well as deriving and proving useful properties of constrained networks in AI verification applications.

So, check out the AI Verification: Constrained Deep Learning repository to learn how to build robust neural networks with constrained deep learning, try out the code included in the repository, and comment below to tell your fellow AI practitioners how and where you applied constrained deep learning.

Data Type Conversion Between MATLAB and Python: What’s New in R2024a

Sivylla Paraskevopoulou — Wed, 15 May 2024 13:01:27 +0000

When combining MATLAB with Python® to create deep learning workflows, data type conversion between the two frameworks can be time consuming and sometimes perplexing. I 've certainly experimented with figuring out how to make MATLAB data compatible with a Python-based model and vice versa. So, you can understand why I am so excited that there are two new data type conversions introduced in MATLAB R2024a.

In the following table you can see the new data type conversions. And it came as no surprise to me that they made the list of Mike Croucher's favorite R2024a updates.

Python Data Type		MATLAB Data Type
Pandas DataFrame		MATLAB table
Python dictionary		MATLAB dictionary
Python dictionary		MATLAB structure

Performing the data type conversions is very easy and you can learn all the details in these documentation topics: Use Python Pandas DataFrames in MATLAB and Use Python Dictionaries in MATLAB. So, in this blog post, instead of showing you the code for all the possible conversions, I am going to walk you through an object detection example and present a use case for converting data from a Python dictionary to a MATLAB structure.

Object Detection Example

This example calls a PyTorch model from MATLAB to detect objects in the input image. When you call the PyTorch model you want to (1) pass MATLAB data to a format that the model can process and (2) convert the model outputs (Python data type) to a MATLAB data type that can be used in MATLAB for visualization, as shown here, but also as an input to the next steps in your workflow.

Python Environment

Set up the Python interpreter for MATLAB by using the pyenv function.

pe = pyenv(Version=".\env\Scripts\python.exe",ExecutionMode="OutOfProcess");

Python Code for Object Detection

The following Python code is saved in the PT_object_detection.py file, which you can call from MATLAB to perform object detection with a PyTorch model.

import torch
import torchvision as vision
import numpy

def loadPTmodel():
    # Initialize model with the best available weights
    weights = vision.models.detection.FasterRCNN_ResNet50_FPN_V2_Weights.DEFAULT
    model = vision.models.detection.fasterrcnn_resnet50_fpn_v2(weights=weights,box_score_thresh=0.95)
    model.eval()

    return model, weights

def detectPT(img,model,weights):
    # Reshape image and convert to a tensor.
    X = numpy.asarray(img)
    X_torch1 = torch.from_numpy(numpy.copy(X))
    if X_torch1.ndim==3:
      X_torch = torch.permute(X_torch1,(2,0,1))
    elif X_torch1.ndim==4:
      X_torch = torch.permute(X_torch1,(3,2,0,1))
    # Initialize the inference transforms
    preprocess = weights.transforms()
    # Apply inference preprocessing transforms
    batch = [preprocess(X_torch)]
    # Use the model 
    if X_torch.ndim==3:
      prediction = model(batch)[0]
    elif X_torch.ndim==4:
      prediction = model(list(batch[0]))

    return prediction

Object Detection

Read the image that you want to use for object detection.

img_filename = "testCar.png";
img = imread(img_filename);

Perform object detection with a PyTorch model using co-execution.

pyrun("from PT_object_detection import loadPTmodel, detectPT")
[model,weights] = pyrun("[a,b] = loadPTmodel()",["a" "b"]);
predictions_pt = pyrun("a = detectPT(b,c,d)","a",b=img,c=model,d=weights);

The output of the object detector is a Python dictionary.

class(predictions_pt)

ans = 'py.dict'

Convert Predictions

Convert the objection detection results from a Python dictionary to a MATLAB structure.

predictions = struct(predictions_pt)

predictions = struct with fields:
     boxes: [1×1 py.torch.Tensor]
    labels: [1×1 py.torch.Tensor]
    scores: [1×1 py.torch.Tensor]

The data variables in the structure prediction are PyTorch tensors. Convert the variables into MATLAB arrays.

predictions.boxes = double(predictions.boxes.detach().numpy);
predictions.labels = double(predictions.labels.tolist)';
predictions.scores = double(predictions.scores.tolist)';
predictions

predictions = struct with fields:
     boxes: [4×4 double]
    labels: [4×1 double]
    scores: [4×1 double]

The bounding boxes require further processing to align with the input image. A bounding box is an axis-aligned rectangle defined in spatial coordinates as an Mx4 numeric matrix with rows of the form [x y w h], where:

M is the number of axis-aligned rectangles.
x and y specify the upper-left corner of the rectangle.
w specifies the width of the rectangle, which is its length along the x-axis.
h specifies the height of the rectangle, which is its length along the y-axis.

predictions.boxes = cat(2,predictions.boxes(:,1:2)+1,predictions.boxes(:,3:4)/2);

Get the class labels. The PyTorch model was trained on the COCO data set.

class_labels = getClassLabels(predictions.labels);

Visualization

Create the labels associated with each of the detected objects.

num_box = length(predictions.scores);
colons = repmat(": ",[1 num_box]);
percents = repmat("%",[1 num_box]);
class_labels1 = strcat(class_labels,colons,string(round(predictions.scores'*100)),percents);

Visualize the object detection results with annotations.

figure
outputImage = insertObjectAnnotation(img,...
    "rectangle",predictions.boxes,class_labels1,LineWidth=1,Color="green");
imshow(outputImage)

So, you can see the object detection was successful with a high degree of confidence.

Verification and Validation for AI: From model implementation to requirements validation

Sivylla Paraskevopoulou — Tue, 30 Apr 2024 13:36:28 +0000

The following post is from Lucas García, Product Manager for Deep Learning Toolbox.

This is the fourth and final post of our Verification and Validation for AI post series. Check out our previous blog posts to learn more about the workflow from the start.

Recap

In the previous posts, we emphasized the importance of Verification and Validation (V&V) in the development of AI models, particularly for applications in safety-critical industries such as aerospace, automotive, and healthcare. Our discussion introduced the W-shaped development workflow, an adaptation of the traditional V-cycle for AI applications developed by EASA and Daedalean. Through the W-shaped workflow, we detailed the journey from setting AI requirements to training a robust pneumonia detection model with the MedMNISTv2 dataset. We covered testing the model’s performance, strengthening its defense against adversarial examples, and identifying out-of-distribution data. This process underscores the importance of comprehensive V&V in crafting dependable and secure AI systems for high-stakes applications.

Figure 1: W-shaped development process. Credit: EASA, Daedalean

It’s now time to walk up the stairs of the right-hand side of the W-diagram, starting with Model Implementation.

Model Implementation

The transition from the Learning Process Verification to the Model Implementation stage within the W-shaped development workflow signifies a pivotal moment in the lifecycle of an AI project. At this juncture, the focus shifts from refining and verifying the AI model’s learning capabilities to preparing the model for a real-world application. The successful completion of the Learning Process Verification stage gives confidence in the reliability and effectiveness of the trained model, setting the stage for its adaptation into an inference model suitable for production environments. Model implementation is a critical phase in the W-shaped development workflow, as it transitions the AI model from a theoretical or experimental stage to a practical, operational application.

The unique code generation framework, provided by MATLAB and Simulink, is instrumental in this phase of the W-shaped development workflow. It facilitates the seamless transition of AI models from the development stage to deployment in production environments. By automating the conversion of models developed in MATLAB into deployable code, this framework eliminates the need for manually re-coding in different programming languages (e.g., C/C++ and CUDA code). This automation significantly reduces the risk of introducing coding errors during the translation process, which is crucial for maintaining the integrity of the AI model in safety-critical applications.

Figure 2: MATLAB and Simulink code generation tools

You can check your robust deep learning model in the MATLAB workspace by using the analyzeNetworkForCodegen function. This is an invaluable tool for assessing whether our network is ready for code generation.

analyzeNetworkForCodegen(net)

                   Supported     
                   _________
    none             "Yes"
    arm-compute      "Yes"
    mkldnn           "Yes"  
    cudnn            "Yes"  
    tensorrt         "Yes"

Confirming that the trained network is compatible with all target libraries opens up many possibilities for code generation. In scenarios where certification is a key goal, particularly in safety-critical applications, one might consider opting for code generation that avoids using third-party libraries (indicated by the ‘none’ value). This approach might not only simplify the certification process but also enhance the model’s portability and ease of integration into diverse computing environments, ensuring that the AI model can be deployed with the highest levels of reliability and performance across various platforms.

If additional deployment requirements concerning memory footprint, fixed-point arithmetic, and other computational constraints come into play, leveraging the Deep Learning Toolbox Model Quantization Library becomes highly beneficial. This support package addresses the challenges of deploying deep learning models in environments where resources are limited or where high efficiency is paramount. By enabling quantization, pruning, or projection techniques, Deep Learning Toolbox Model Quantization Library significantly reduces the memory footprint and computational demands of deep neural networks.

Figure 3: Quantizing a deep neural network using the Deep Network Quantizer app

Leveraging the Deep Network Quantizer app, we successfully compressed our model’s size by a factor of 4, transitioning from floating-point to int8 representations. Remarkably, this optimization incurred a mere 0.7% reduction in the model’s accuracy for classifying test data.

Using MATLAB Coder and GPU Coder, we can implement our AI model in C++ and CUDA to run efficiently on platforms supporting these languages. This step is crucial for deploying AI models to real-time systems, where execution speed and low latency are vital. Generating this code involves setting up a configuration object, specifying the target language, and defining the deep learning configuration to use, in this case, cuDNN for GPU acceleration.

cfg = coder.gpuConfig("mex");
cfg.TargetLang = "C++";
cfg.GpuConfig.ComputeCapability = "6.1";
cfg.DeepLearningConfig = coder.DeepLearningConfig("cudnn");
cfg.DeepLearningConfig.AutoTuning = true;
cfg.DeepLearningConfig.CalibrationResultFile = "quantObj.mat";
cfg.DeepLearningConfig.DataType = "int8";
input = ones(inputSize,"int8");
codegen -config cfg -args input predictCodegen -report

Figure 4: GPU Coder code generation report

Inference Model Verification and Integration

The Inference Model Verification and Integration phase represents two critical, interconnected stages in deploying AI models, particularly in applications as critical as pneumonia detection. These stages are essential for transitioning a model from a theoretical construct into a practical, operational tool within a healthcare system.

Since the model has been transformed to an implementation or inference form in C++ and CUDA, we need to verify that the model continues to accurately identify cases of pneumonia and normal conditions from new, unseen chest X-ray images, with the same level of accuracy and reliability as it did in the development or learning environment when the model was trained using Deep Learning Toolbox. Moreover, we must integrate the AI model into the larger system under design. This phase is pivotal as it ensures that the model not only functions in isolation but also performs as expected within the context of a comprehensive system. This phase may often occur concurrently with the previous model implementation phase, especially when leveraging the suite of tools provided by MathWorks.

In the Simulink harness shown in Figure 5, the deep learning model is easily integrated into the larger system using an Image Classifier block, which serves as the core component for making predictions. Surrounding this central block are subsystems dedicated to runtime monitoring, data acquisition, and visualization, creating a cohesive environment for deploying and evaluating the AI model.

The runtime monitoring subsystem is crucial for assessing the model’s real-time performance, ensuring predictions are consistent with expected outcomes. This runtime monitoring system implements the out-of-distribution detector we developed in this previous post. The data acquisition subsystem facilitates the collection and preprocessing of input data, ensuring that the model receives data in the correct format. Meanwhile, the visualization subsystem provides a graphical representation of the AI model’s predictions and the system’s overall performance, making it easier to interpret the model outcomes within the context of the broader system.

Figure 5: Simulink harness integrating the deep learning model

The output from the runtime monitor is particularly insightful. For instance, when the runtime monitor subsystem processes an image that matches the model’s training data distribution, the visualization subsystem displays this outcome in green, signaling confidence in the output provided by the AI model. Conversely, when an image presents a scenario the model is less familiar with, the out-of-distribution detector highlights this anomaly in red. This distinction underscores the critical capability of a trustworthy AI system: not only to produce accurate predictions within known contexts but also to identify and appropriately handle unknown examples.

In Figure 6, we can see the situation where an example is flagged and rejected despite the significantly high confidence of the model when making the prediction, illustrating the system’s ability to discern and act on out-of-distribution data, thereby enhancing the overall safety and reliability of the AI application.

Figure 6: Examples of the output of the runtime monitor subsystem – accepting predictions (left, data is considered to be in-distribution) and rejecting predictions (right, data is considered to be out-of-distribution).

At this stage, it is also crucial to consider the implementation of a comprehensive testing strategy, if not already in place. Utilizing MATLAB Test or Simulink Test, we can develop a suite of automated tests designed to rigorously verify the functionality and performance of the AI model across various scenarios. This approach enables us to systematically validate all aspects of our work, from the accuracy of the model’s predictions to its integration within the larger system.

Independent Data and Learning Verification

The Independent Data and Learning Verification phase aims to rigorously verify that data sets have been managed appropriately through the data management life cycle, which becomes feasible only after the inference model has been thoroughly verified on the target platform. This phase involves an independent review to confirm that the training, validation, and test data sets adhere to stringent data management requirements, and are complete and representative of the application’s input space.

While the accessibility of MedMNIST v2 dataset used in this example clearly helped to accelerate the development process, it also underscores a fundamental challenge. The public nature of the dataset means that certain aspects of data verification, particularly those ensuring dataset compliance with specific data management requirements and the complete representativeness of the application’s input space, cannot be fully addressed in the traditional sense.

The learning verification step is meant to verify that the trained model has been satisfactorily verified, including the necessary coverage analyses. Data and learning requirements have been verified and will all be collectively highlighted together with other remaining requirements in the following section.

Requirements Verification

The Requirements Verification phase concludes the W-shaped development process, focusing on verifying the requirements.

In the second post of this series, we highlighted the process of authoring requirements using the Requirements Toolbox. As depicted in Figure 7, we have reached a stage where the functions and tests implemented are directly linked with their corresponding requirements.

Figure 7: Linking of requirements with implementation and tests

This linkage is crucial for closing the loop in the development process. By running all implemented tests, we can verify that our requirements have been adequately implemented. Figure 8 illustrates the capability to run all tests directly from the Requirements Editor, enabling verification that all the requirements have been implemented and successfully tested.

Figure 8: Running tests from within Requirements Editor

At this point, we can confidently assert that our development process has been thorough and meticulous, ensuring that the AI model for pneumonia detection is not only accurate and robust, but also practically viable. By linking each requirement to specific functions and tests, we’ve established clear traceability that enhances the transparency and accountability of our development efforts. Furthermore, the ability to systematically verify every requirement through direct testing from the Requirements Editor underscores our comprehensive approach to requirements verification. This marks the culmination of the W-shaped process, affirming that our AI model meets the stringent criteria set forth for healthcare applications.

With this level of diligence, we conclude this case study and believe we are well-prepared to deploy the model, confident in its potential to accurately and reliably assist in pneumonia detection, thereby contributing to improved patient care and healthcare efficiencies.

Recall that the demonstrations and verifications discussed in this case study utilize a “toy” medical dataset for illustrative purposes (MedMNIST v2). The methodologies and processes outlined are designed to highlight best practices in AI model development. They entirely apply to real-world data scenarios, emphasizing the necessity of rigorous testing and validation to ensure the model’s efficacy and reliability in clinical settings.

Convert Deep Learning Models between PyTorch, TensorFlow, and MATLAB

Sivylla Paraskevopoulou — Mon, 22 Apr 2024 13:59:48 +0000

In this blog post we are going to show you how to use the newest MATLAB functions to:

Import models from TensorFlow and PyTorch into MATLAB
Export models from MATLAB to TensorFlow and PyTorch

This is a brief blog post that points you to the right functions and other resources for converting deep learning models between MATLAB, PyTorch®, and TensorFlow. Two good resources to get started with are the documentation topics Interoperability Between Deep Learning Toolbox, TensorFlow, PyTorch, and ONNX and Tips on Importing Models from TensorFlow, PyTorch, and ONNX.

If you have any questions about the functionality presented in this blog post or want to share the exciting projects for which you are using model conversion, comment below.

Import Models into MATLAB

You can import models from PyTorch or TensorFlow with just one line of code.


What?	Import PyTorch models.	Import TensorFlow models.	Import PyTorch and Tensorflow models interactively.
How?	Use the importNetworkFromPyTorch function.	Use the importNetworkFromTensorFlow function.	Use the Deep Network Designer app.
When?	Introduced in R2022b.	Introduced in R2023b.	Import capability introduced in R2023b.

Quick Example

This example shows you how to import an image classification model from PyTorch. The PyTorch model must be pretrained and traced.

Run the following code in Python to get and save the PyTorch model. Load the MnasNet pretrained image classification model from the TorchVision library.

import torch
from torchvision import models
model = models.mnasnet1_0(pretrained=True)

Trace the PyTorch model. For more information on how to trace a PyTorch model, go to Torch documentation: Tracing a function. Then, save the PyTorch model.

X = torch.rand(1,3,224,224)
traced_model = torch.jit.trace(model.forward,X)
traced_model.save("traced_mnasnet1_0.pt")

Now, go to MATLAB and import the model by using the importNetworkFromPyTorch function. Specify the name-value argument PyTorchInputSizes so that the import function automatically creates and adds the input layer for a batch of images.

net = importNetworkFromPyTorch("mnasnet1_0.pt",PyTorchInputSizes=[NaN,3,224,224])

net = 

  dlnetwork with properties:

         Layers: [153×1 nnet.cnn.layer.Layer]
    Connections: [162×2 table]
     Learnables: [210×3 table]
          State: [104×3 table]
     InputNames: {'InputLayer1'}
    OutputNames: {'aten__linear12'}
    Initialized: 1

  View summary with summary.

Read the image you want to classify. Resize the image to the input size of the network.

Im_og = imread("peacock.jpg");
InputSize = [224 224 3];
Im = imresize(Im_og,InputSize(1:2));

The inputs to MnasNet require further preprocessing. Rescale the image. Then, normalize the image by subtracting the training images mean and dividing by the training images standard deviation. For more information, see Input Data Preprocessing.

Im = rescale(Im,0,1);

meanIm = [0.485 0.456 0.406];
stdIm = [0.229 0.224 0.225];
Im = (Im - reshape(meanIm,[1 1 3]))./reshape(stdIm,[1 1 3]);

Convert the image to a dlarray object. Format the image with the dimensions "SSCB" (spatial, spatial, channel, batch).

Im_dlarray = dlarray(single(Im),"SSCB");

Classify the image and find the predicted label.

prob = predict(net,Im_dlarray);
[~,label_ind] = max(prob);

Display the image and classification result.

imshow(Im_og)
title(strcat("Classification Result: ",ClassNames(label_ind)),FontSize=16)

Importing a model from TensorFlow is quite similar to importing a model from PyTorch. Of course you need to use the importNetworkFromTensorFlow function instead. Note that your TensorFlow model must be in the SavedModel format and saved by using the following code in Python.

model.save("myModelTF")

For more examples, check out the reference pages of the importNetworkFromPyTorch and importNetworkFromTensorFlow functions, and the documentation page on Pretrained Networks from External Platforms.

Why Import Models into MATLAB?

In short, import models into MATLAB because an imported network is a MATLAB network. This means, you can perform all the following tasks with built-in tools. To learn more, see Deep Learning Toolbox.

Task	Example	More Resources
Easily perform transfer learning and retrain the network.	Prepare Network for Transfer Learning Using Deep Network Designer	Import Deep Neural Network for Transfer Learning
Visualize activations, explain decisions, and verify the robustness of the network (with Deep Learning Toolbox Verification Library).	Understand Network Predictions Using LIME	Visualize and Verify Deep Neural Networks
Simulate the network in Simulink and test its performance within a larger system.	Classify Images in Simulink with Imported TensorFlow Network	Deep Learning with Simulink
Compress the network with quantization, projection, or pruning.	Compress Neural Network Using Projection	Quantization, Projection, and Pruning
Automatically generate C/C++, CUDA, and HDL code for the network.	Code Generation for Deep Learning Networks	Code Generation and Deployment

Export Models from MATLAB

You can export models to TensorFlow directly. To export a model to PyTorch, you must first convert the model to the ONNX model format.


What?	Export models to TensorFlow.	Export models to PyTorch.
How?	Use the exportNetworkToTensorFlow function.	Export via ONNX by using the exportONNXNetwork function.
When?	Introduced in R2022b.	ONNX export introduced in R2018a.

Quick Example

Load the pretrained SqueezeNet convolutional network by using imagePretrainedNetwork. This function, which was introduced in R2024a, is the easiest (and recommended) way to load pretrained image classification networks.

[net,ClassNames] = imagePretrainedNetwork("squeezenet");

Export the network net to TensorFlow. The exportNetworkToTensorFlow function saves the TensorFlow model in the Python package myModel.

exportNetworkToTensorFlow(net,"myModel")

And you are done with exporting!

If you want to test model in Python, you can use an image available in MATLAB. Read the image you want to classify. Resize the image to the input size of the network.

Im = imread("peacock.jpg");
InputSize = net.Layers(1).InputSize;
Im = imresize(Im,InputSize(1:2));

Permute the 2-D image data from the MATLAB ordering (HWCN) to the TensorFlow ordering (NHWC), where H, W, and C are the height, width, and number of channels of the image, respectively, and N is the number of images. For more information, see Input Dimension Ordering. Save the image in a MAT file.

ImTF = permute(Im,[4,1,2,3]);
filename = "peacock.mat";
save(filename,"ImTF")

The code below shows you how to use the exported model to predict in Python or save it in the SavedModel format. First, load the exported TensorFlow model from the package myModel.

import myModel
model = myModel.load_model()

Save the exported model in the TensorFlow SavedModel format. Saving the model in SavedModel format is optional. You can perform deep learning workflows directly with the model.

model.save("myModelTF")

Classify the image with the exported model.

import scipy.io as sio
x = sio.loadmat("peacock.mat")
x = x["ImTF"]

import numpy as np
x_np = np.asarray(x, dtype=np.float32)

scores = model.predict(x_np)

Read more on exporting deep neural networks to TensorFlow in this blog post. For more examples, see the exportNetworkToTensorFlow and exportONNXNetwork reference pages and the documentation page on Exporting Deep Neural Networks.

Tips on Accelerating Deep Learning Training

Sivylla Paraskevopoulou — Mon, 08 Apr 2024 14:10:07 +0000

Deep learning models are trained by using large sets of labeled data. Training a deep learning model can be time-consuming; it can take from hours to days. In this blog post, we will provide a few tips on how to speed up deep learning training with tools, functions, and apps.

Deep Learning Toolbox offers more tools than the ones presented in this post, but we will give you enough resources to get started with speeding up the training of deep learning models. To learn more on training acceleration techniques, see Speed Up Deep Neural Network Training.

Transfer Learning

Transfer learning is a deep learning approach in which a model that has been trained for one task is used as a starting point for a model that performs a similar task. Transfer learning is useful for tasks for which a variety of pretrained models exist, such as computer vision and natural language processing.

For computer vision, many popular convolutional neural networks (CNNs) are pretrained on the ImageNet dataset, which contains over 14 million images and a thousand classes of images. Updating and retraining a network with transfer learning is usually much faster and easier than training a network from scratch.

Figure: Transfer knowledge from a pretrained model to another model that can be trained faster with less training data.

To get a pretrained model, you can explore MATLAB Deep Learning Model Hub to access models by category and get tips on choosing a model. You can also get pretrained networks from external platforms. You can convert a model from TensorFlow, PyTorch®, or ONNX to a MATLAB model by using an import function, such as the importNetworkFromTensorFlow function.

In R2024a, the imagePretrainedNetwork function was introduced and you can use it to load most pretrained image models. This function loads a pretrained neural network and optionally adapts the network architecture for transfer learning and fine-tuning workflows. For an example, see Retrain Neural Network to Classify New Images.

net = imagePretrainedNetwork("squeezenet",NumClassses=5);

You can also prepare networks for transfer learning interactively using the Deep Network Designer app. Load a pretrained built-in model or import a model from PyTorch or TensorFlow, and then, modify the last learnable layer of the model with the app. This workflow is shown in the following animated figure and described in detail in the example Prepare Network for Transfer Learning Using Deep Network Designer.

Animated Figure: Interactive transfer learning using the Deep Network Designer app.

GPUs and Parallel Computing

High-performance GPUs have a parallel architecture that is efficient for deep learning, as deep learning algorithms are inherently parallel. When GPUs are combined with clusters or cloud computing, training time for deep neural networks can be significantly reduced. Use the trainnet function to train networks, which uses a GPU if one is available. For more information on how to take advantage of GPUs for training, see Scale Up Deep Learning in Parallel, on GPUs, and in the Cloud.

The following tips will help you time your code when working with GPUs:

Consider “warming up” your GPUs. Your code will run slower in the first 3-4 runs. So, measure the training time (see gputimeit) after the initial runs.
Don’t forget synchronization when working with GPUs. See the wait(gpudevice) function to learn how to synchronize GPUs.

Code Profiling

Before trying to speed up training by editing your code, use code profiling to uncover the slowest parts of your code. Profiling is a way to measure the time it takes to run the code and identify which functions and lines of code are consuming the most time. You can profile your code interactively with the MATLAB Profiler or programmatically with the profile function.

Here, we have profiled the training in the example Train Network on Image and Feature Data by using the profile function.

profile on -historysize 9000000  
net = trainnet(dsTrain,net,"crossentropy",options);
profile viewer

The profiling results are shown in the following figure. Note that this specific example trains a very small network and training is very fast. But we can still observe that a portion of the training time is taken by reading the training data from a datastore. When training much more complex networks with larger datasets, profiling can reveal more opportunities for code optimization. For example, you might observe that preprocessing the input data before training (rather than repeatedly preprocessing data at runtime) can save time, or that too much time is spent on validation.

Figure: Code profiler results for training a small neural network.

Monitoring Overhead

When you train a deep neural network, you can visualize the training progress. This is very helpful to assess the performance of your network and whether it meets your requirements. However, training visualizations add processing overhead to the training and increase training time. If you have completed your assessment, for example by plotting the training progress for several epochs, consider turning off the visualizations and inspecting the plots after the training completion.

Figure: Training progress of a CNN for regression.

Another way to monitor the training progress is by using the verbose output of the trainnet function. Displaying the training progress at the command line has less negative impact on the training time than the visualization of the training progress. However, to decrease training time you can turn off the verbose output or decrease the frequency with which the progress is displayed.

Conclusion

In this blog post, we have provided a few tips on how to accelerate training time for deep neural networks. The Deep Learning Toolbox documentation provides many more resources. If you are not sure how to get started with training acceleration or need specific advice, comment below. Or leave a comment to share with the AI blog community your tips and tricks for accelerating deep learning training in MATLAB.

Deep Learning Toolbox R2024a: Major Update and New Examples

Sivylla Paraskevopoulou — Mon, 25 Mar 2024 20:26:05 +0000

On March 20^th, MATLAB R2024a was released with many updates for Deep Learning Toolbox. Exciting new features for deep learning help engineers create and use explainable, robust, and scalable deep learning models for automated visual inspection, wireless communications, computer vision, and many more applications.

Some of the new Deep Learning Toolbox capabilities are:

Simulink co-execution blocks to simulate Python®-based (PyTorch®, TensorFlow, ONNX, and custom) models in a system-wide context.
Explainability and verification tools to explain network results and verify the reliability of deep neural networks.
Support for more deep learning architectures, including transformers, and training options.

Find out more about Deep Learning Toolbox features and new capabilities:

In this blog post, we present a few new examples that show how to use the new features and apply deep learning. In future blog posts, we will dive deeper into individual new features.

New Examples on Features


Model Design	Explainability and Verification	Integration with Python

Create and Train Network with Nested Layers	Verification of Neural Networks	Predict Responses Using PyTorch Model Predict Block

New Examples on Applications


Computer Vision	Natural Language Processing	Wireless Communications

Interactively Segment Image Using Segment Anything Model	Out-of-Distribution Detection for LSTM Document Classifier	AI for Positioning Accuracy Enhancement

Segment CT Scan Using MONAI Label	Classify Documents Using Document Embeddings	Data Preparation for Neural Network Digital Predistortion Design

Incremental Learning: Adaptive and real-time machine learning

Sivylla Paraskevopoulou — Mon, 04 Mar 2024 14:36:33 +0000

Incremental learning is a machine learning approach that addresses the challenge of adaptively fitting models to new incoming data. The incremental learning approach is particularly useful to engineers that need to model streaming data. Often, engineers and other AI practitioners deploy machine learning to target devices, and incremental learning ensures that the models continue to work as intended if the data changes.

In this blog post, we are going to explain what incremental learning is, why it is useful, and how to implement incremental learning with MATLAB tools and Simulink blocks.

What is Incremental Learning?

Incremental learning is a machine learning approach that enables machine learning models (and deep learning models) to continuously learn by processing incoming non-stationary data from a data stream. With incremental learning, you can create artificial intelligence (AI) systems that continuously update to integrate new knowledge while maintaining previous knowledge.

Figure: Incremental learning workflow.

Incremental Learning vs Traditional Machine Learning

A traditional machine learning model is trained on a batch of data and generalization to new data (that is, avoiding overfitting or underfitting) is ensured by methods like cross-validation, regularization, and hyperparameter tuning.

On the other hand, incremental learning adapts to new data in real time, and therefore it provides certain benefits compared to traditional machine learning. Incremental learning is flexible, quick, and adaptive to new data. An incremental learning model fits to data quickly and efficiently, which means it can adapt in real time to changes (or drifts) in the data distribution. It is also more efficient when little information is known about the training data. For example, class names might not be known until after the model processes observations.

Additionally, incremental learning has these benefits:

Protecting the privacy of end-user data.
Allowing devices to learn even with limited or no internet connectivity.
Allowing the design of advanced devices with personalization and smart features.

Challenges in Incremental Learning

Incremental learning is not without its inherent challenges, a couple of which are data storage and catastrophic forgetting.

Data storage – Data arrives in a stream and the sample size is unknown and possibly large, which makes data storage difficult. Therefore, the incremental learning algorithm must process the data when they are available and before they get discarded.

Catastrophic forgetting – An incremental learning model can’t access previous data while learning on new data. The model can overfit on the new data, which results in poor model performance.

Incremental Anomaly Detection

Incremental anomaly detection is a branch of machine learning that, similarly to incremental learning, involves processing incoming data from a data stream. In incremental anomaly detection, instead of fitting a machine learning model, the algorithm computes anomaly scores in real time.

Learn More About Incremental Learning

To learn more about what incremental learning is and get started with an example, see:

Figure: Classification error for incremental learning model using flexible workflow by updating the performance metrics.

Why Is Incremental Learning Useful?

To solve real world problems, machine learning models must leave the desktop and go into production. When a machine learning model is operating on its target device, such as on the cloud or an edge device, the machine learning model is likely to receive non-stationary streaming data. This is when incremental learning is particularly useful.

Applications of Incremental Learning

Lithium-ion batteries are everywhere today, from wearable electronics, mobile phones, and laptops to electric vehicles and smart grids. Let’s say you are designing a virtual sensor using AI to estimate the battery’s State-Of-Charge (SOC). An SOC virtual sensor is a key component of a battery management system (BMS) that ensures the safe and efficient operation of a battery. The virtual sensor receives voltage, current, and temperature measurements from other sensors. These measurements are likely to change over time and the model that you have deployed should adapt to these changes.

Figure: Designing a virtual sensor for battery State-Of-Charge (SOC) estimation using AI.

The design of virtual sensors is just one potential application of incremental learning. Other applications include:


Signal Processing	Predictive Maintenance	Wireless Communications

An example from my personal experience is using incremental learning in the design of implantable brain-machine interfaces (BMIs). During my PhD research, I developed algorithms and designed chips for implantable BMIs. The algorithms aimed to model very noisy brain signals and cluster brain activity to identify which neuron fired and when. Because all the preprocessing and machine learning must happen on an ultra-low power and tiny chip, the algorithms must be computationally efficient, have a small footprint, and process data in real time.

As part of my work, I developed an incremental learning algorithm that clustered the incoming neural signals in real time, while retaining information (like the cluster centers and statistical dependencies) of previously clustered activity. I wish ten years ago, MATLAB had built-in algorithms for incremental learning, but more on the recent tools available in MATLAB for incremental learning in the next section.

Incremental Learning and MLOps

MLOps is as a set of practices that automate the process of taking machine learning models to production, and managing the models once they are in production. As part of MLOps, machine learning models in production are constantly monitored. By using incremental learning algorithms, the machine learning models can be updated on-the-fly, which potentially reduces errors.

Figure: The MLOps lifecycle.

Consider that in real-world applications, data is often dynamic and always changing. So, drift can be a big issue for machine learning models. A data drift can happen for many reasons, such as changes in the distribution of the input data over time or the relationship between the input and desired output.

With incremental learning, the model is updated when the input changes.

Video: What is MLOps?

How to Implement Incremental Learning

Now that you understand what incremental learning is and how useful it is for modeling streaming data, we will describe MATLAB and Simulink tools so that you can easily implement incremental learning in your application.

Incremental Learning with MATLAB

Using algorithms from Statistics and Machine Learning Toolbox, you can create flexible, efficient, and adaptive incremental learning models for classification and regression, such as linear support vector (SVM), logistic regression, and naive Bayes classifiers, and least-squares and linear SVM regression models. Alternatively, you can convert a traditionally trained model to an incremental learning model by using the incrementalLearner function. To learn more about these incremental learning models, see the documentation topic Incremental Learning Overview.

With Statistics and Machine Learning Toolbox, you can detect concept drift for incremental learning models, that is, detect when the data has changed so that the model is no longer valid. Also, you can automatically generate C/C++ code for incremental learning models. To learn more, see the example Code Generation for Incremental Learning.

Figure: Concept drift detection for incremental learning with MATLAB.

Incremental Learning with Simulink

Using Simulink blocks provided in Statistics and Machine Learning Toolbox, you can integrate incremental learning into the design, simulation, and test of complex AI engineered systems, such as in the design of virtual sensors. To learn more, see the following examples:


Incremental Learning in Simulink for Classification	Incremental Learning in Simulink for Regression

Takeaways

Incremental learning addresses the challenge of fitting machine learning models adaptively to incoming streaming data.
Incremental learning can reduce errors when machine learning models are operating in production.
MATLAB and Simulink provide tools, functions, and blocks to create incremental learning models, integrate them into system-level design, and deploy them to hardware.