Stuart’s MATLAB Videos

Watch and Learn

Scraping Links from a Set of MATLAB Documentation Pages

My colleague Sam asked if I could help him try and understand how a set of documentation pages were linked together and perhaps visualize them as a graph.

Now, I do have a good idea of all the pages on our website and all the links between them but he is only interested in the links in the body of the page, i.e. not the links in the menu or footer of the page or in the navigation on the left. So I need to find a way to extract just those links. After using webread to read the content, I think I can use a combination of functions form the Text Analytics Toolbox to process the HTML tags.

Features covered in this code-along style video include:

Follow me (@stuartmcgarrity) if you want to be notified via Twitter when I post.

Play the video in full screen mode for a better viewing experience. 

|
  • print
  • send email

댓글

댓글을 남기려면 링크 를 클릭하여 MathWorks 계정에 로그인하거나 계정을 새로 만드십시오.