Today, I want to analyze the sitemaps on mathworks.com. These XML files, in standard locations, provide search engines with lists of the pages on the site. I want to compare these lists with the list I compiled by crawling the site. My goal is to find areas of the site I missed by crawling or areas that I crawl but are missing sitemaps.
Features covered in this code-along style video include:
- webread, readtable
- contains, startsWith, extract, extractBetween, and the new pattern object
Follow me (@stuartmcgarrity) if you want to be notified via Twitter when I post.
Play the video in full screen mode for a better viewing experience.
To leave a comment, please click here to sign in to your MathWorks Account or create a new one.