The MATLAB Blog

Practical Advice for People on the Leading Edge

How we made a better backslash in MATLAB R2024a

Few things say 'MATLAB' better than the backslash. So good, we put it on a T-shirt.
As most people will be aware, the backslash operator solves a system of linear equations. That is, it finds the x that solves A*x = b. For example
A = [2 1 1;
-1 1 -1;
1 2 3];
 
b = [2;3;-10];
 
x = A\b
x = 3×1
3 1 -5
So simple and yet there is a lot of numerical linear algebra behind backslash. You see, exactly how one writes an algorithm to solve a linear system of equations depends very much on the structure and type of the matrix A. Is A Hermitian? If so, is its diagonal all positive or all negative and so on? Then you have to ask is A single, double or complex? Is the matrix full or sparse? The answers to these questions determine which algorithm is best to use. Get it right and you get the correct answer very quickly, making maximal use of both your computer hardware and decades of numerical linear algebra research.
In the older MATLAB R2023b, the logic that backslash uses for full (i.e. not sparse) matrices is as follows
In the latest version of MATLAB, R2024a, we rearranged this flow-chart a little and added a test for tridiagonal matrices. This isn't a particularly expensive test since we are already some of the way there by doing the upper Hessenberg check.
As such, the backslash flow chart for full matrices is as follows (These flowcharts are from the documentation for mldivide by the way)
The practical upshot is that backslash is now significantly faster when the input matrix is dense and tridiagonal. In general, you are better off using sparse matrices when dealing with tridiagonal matrices (see later in this post) but sometimes you happen to have a dense one (e.g. from the output of ldl) and we wanted to ensure that backslash would deal with it efficiently.
Tridiagonal matrices are zero everywhere apart from the main diagonal and the two diagonals either side of it (the so-called superdiagonal and subdiagonal). Here's an example generated using MATLAB's gallery function
full(gallery("tridiag",8))
ans = 8×8
2 -1 0 0 0 0 0 0 -1 2 -1 0 0 0 0 0 0 -1 2 -1 0 0 0 0 0 0 -1 2 -1 0 0 0 0 0 0 -1 2 -1 0 0 0 0 0 0 -1 2 -1 0 0 0 0 0 0 -1 2 -1 0 0 0 0 0 0 -1 2
Tridiagonal matrices are particularly important because they appear in many areas of science and engineering including quantum mechanics, partial differential equations and splines to name a few.
To see how much faster we've made things, let's construct a problem involving a random tridiagonal matrix
n = 1e4;
mainDiag = rand(n, 1);
upperDiag = rand(n-1, 1);
lowerDiag = rand(n-1, 1);
triDiag = diag(mainDiag) + diag(upperDiag, 1) + diag(lowerDiag, -1);
Just to make sure that I haven't made a mistake while generating triDiag, I turn to the isbanded function. The lower and upper bandwidths of a tridiagonal matrix are both 1 so I can test as follows
isbanded(triDiag,1,1) % If this returns true, D is tridiagonal
ans = logical
1
OK, so now I'm convinced that D is tridiagonal and that I can use it to time this update to backslash
b = rand(n,1); % Construct the RHS of the equation
 
f = @() triDiag\b;
runtime = timeit(f)
runtime = 0.0716
On the same machine, R2023b did this in 0.5637 seconds so we have
  • R2024a: 0.0716 seconds
  • R2023b: 0.5637 seconds
Almost 7.9x speed-up! This is a little more than the 6.5x speed-up discussed in the release notes. The author of that section of the release notes used a Windows 11, AMD EPYC 74F3 24-Core Processor @ 3.19 GHz test system whereas my system is rather more modest.
cpuinfo
ans = struct with fields:
CPUName: '11th Gen Intel(R) Core(TM) i7-11700 @ 2.50GHz' Clock: '2501 MHz' Cache: 4194304 TotalMemory: 3.4010e+10 NumCPUs: 1 TotalCores: 8 OSType: 'Windows' OSVersion: 'Microsoft Windows 11 Pro' Hostname: 'DESKTOP-N3C7C6Q'
Clearly, how much faster the new version of backslash is depends on your system but whatever machine you use, it will definitely be faster in R2024a.
Going sparse for maximum speed
In linear algebra, the path to speed often relies on taking advantage of matrix structure and there is one aspect of the structure of tridiagonal matrices that is important -- the fact they are very sparse. We haven't sped this up in 24a but it's worth pointing out that if you can use sparse matrices for these problems then you really should. It's so much faster, even taking into account the speed-ups shown above. The memory benefits are huge too!
Let's convert triDiag to a sparse matrix to see
spTriDiag = sparse(triDiag);
f = @() spTriDiag\b;
runtime = timeit(f)
runtime = 7.2862e-04
That's almost 100x faster than the full-matrix version in R2024a and almost 800x faster than the full-matrix version in R2023b.
More on backslash
  • Backslash - A historical post where Cleve tells us how backslash came to represent the solution of linear systems in MATLAB
  • What is A\A? - The answer: A\A is always I, except when it isn't.
|
  • print

Comments

To leave a comment, please click here to sign in to your MathWorks Account or create a new one.