Too much information about the size function

Posted by Steve Eddins, March 22, 2011

11 views (last 30 days) | 0 Likes | 16 comments

Today's post is proof that this old MATLAB dog can learn new tricks. The subject is this: Given an image array, how can you most easily determine the number of rows and columns it has? (In code, that is; I don't mean by looking at the Workspace Browser!)

I'm going to take an indirect path to get there:

1. I'll explain a general principle of MATLAB function behavior.

2. I'll show the that the size function violates this principle.

3. I'll show how a common use of size can lead you astray.

4. I'll show the code I used to use.

5. Finally I'll show how to do it a new "trick" (that Loren taught me) which uses a fairly recent addition to the MATLAB language.

There's nothing profound here, but there are some subtle issues about using size with multidimensional arrays that image processing users should probably be aware of. (I'll warn you here --- if you're already bored with this topic, the rest of the post doesn't get better. You should probably just go read one of the other MATLAB Central blogs instead.)

The general principle of MATLAB function behavior is about functions with multiple output arguments. For example, the Image Processing Toolbox function bwlabel has two output arguments. A sample call using both output arguments is:

  [L,n] = bwlabel(BW);

The first output argument is the label matrix, and the second argument is the number of labeled objects.

For most MATLAB functions, the meaning of each output argument does not depend on how many output arguments are specified when the function is called. In the example above, if you call bwlabel with only one output argument, that output is still the label matrix. The second output argument is silently dropped by MATLAB. And if you call the function with no output argument, the first output is still the label matrix and is used to set the automatic workspace variable ans.

I can think of three kinds of exceptions to this behavior. In the first case, common to many graphics functions, a function returns something (like a handle) if called with an output argument, but it returns nothing if called without an output argument. Examples in this category include plot and bar.

The second kind of exception is for functions that "pretty-print" something to the Command Window if called without an output argument and return a value if called with an output argument. Examples include dir and whos.

The third category is (unfortunately) the "everything else" exceptions. And size is in this group.

You can see the exceptional behavior with your plain ol' two-dimensional matrices. Consider the following two calls to size:

 A = rand(3,4);  % A is 3-by-4
 m = size(A);
 [m,n] = size(A);

m is not the same in both cases. In the first case, size returns the size vector, [3 4]. In the second case, m is the number of rows (3) and n is the number of columns (4).

For higher dimensional arrays it gets more interesting. The rule is this: If you call size with two or more output arguments, the last output argument is the number of elements in that array dimension and all the remaining dimensions of the array.

It's probably easier to just show you. Let's look at a 3-by-4-by-5 array and calls to size with a different number of output arguments.

B = rand(3,4,5);
[m,n,p] = size(B)

Not surprising. Now just call it with just two output arguments:

[m,n] = size(B)

Now the second output is 20 instead of 4! That's because 4*5 is 20. For fun, call size with 4 outputs:

[m,n,p,q] = size(B)

MATLAB is perfectly happy to treat B as a four-dimensional array whose last dimension has size 1. That's actually pretty useful, even essential, for writing reasonable-looking multidimensional code in MATLAB.

Now let me connect this back to the problem I posed at the very beginning: find the number of rows and columns in an image array. Let's make it a concrete example:

neb = imread('ngc6543a.jpg');
imshow(neb, 'InitialMagnification', 50)
title('Obligatory image')
xlabel('This blog is supposed to be about image processing after all')

Here's the naive attempt to ask for the number of rows and columns in our image:

[nrows,ncols] = size(neb)

nrows =

   650


ncols =

        1800

But unfortunately that's not right because neb is a 3-D array. Therefore the second output is the number of columns times 3, the number of color planes. There are two ways I've always used to make sure I get the right answer. The first is to call size with three output arguments, even though I only need two:

[nrows,ncols,junk] = size(neb)

nrows =

   650


ncols =

   600


junk =

     3

I always found this vaguely unsatisfactory because I don't like unused variables cluttering up my code. The second way requires two separate calls to size:

nrows = size(neb,1)

nrows =

   650

ncols = size(neb,2)

ncols =

   600

Loren pointed out to me recently that a feature added to R2009b, using the tilde character in input and output argument lists, can be used here. Used in an output argument list, the tilde character says to MATLAB, "Hey, I'm not going to need this output argument and don't want to even give it a variable name, so please just clean it up automatically for me." Loren and I both posted about this feature (hers and mine) if you want to go read more about it. As I recall, not everyone was a fan. Oh, well. I like it.

Anyway, here's how you can use it in the call to size:

[nrows,ncols,~] = size(neb)

nrows =

   650


ncols =

   600

Maybe this was more than you really wanted to know about size. However, I think that if you regularly write code that handles multidimensional arrays, eventually you'll be happy that you made it all the way to the end.

Published with MATLAB® 7.11