Doug's MATLAB Video Tutorials

January 29th, 2010

Advanced: Speeding up an enumerated string search in a data mining application

This week, I am going to cover a technique that I used to speed up a data mining application that had a search through enumerated strings looking for all the matches. Using this technique of substituting numeric values for the strings, I was able to see a twenty fold speed increase.
Also, my colleague Scott thought that people interested in this video should check out categorical arrays in the statistics toolbox.

3 Responses to “Advanced: Speeding up an enumerated string search in a data mining application”

  1. sja replied on :

    Thanks, would be great with more of this type of tricks.

  2. Claus replied on :

    LOVE the tricks. Even stuff that you think is mundane may be a revelation to us

    Thanks,

    Claus
    Cambridge, Mass
    Stats, Optim, Finance tool boxes

  3. dhull replied on :

    @Claus,

    Glad to hear the kind words. I was doing this data mining project for a totally different reason and thought this would be a good example.

    So I know exactly what you are enjoying, what did you take away from this video?

    Doug

Leave a Reply

Wrap code fragments inside <pre> tags, like this:

<pre class="code">
a = magic(3);
sum(a)
</pre>

If you have a "<" character in your code, either follow it with a space or replace it with "&lt;" (including the semicolon).


MathWorks

Doug Hull is a proud MathWorker who is on a mission to help you with MATLAB.

Doug's picture

These postings are the author's and don't necessarily represent the opinions of The MathWorks.