Stuart’s MATLAB Videos

Watch and Learn

Scraping Links from a List of Web Pages

My colleague Sherrie just asked me if I could extract the links from a specific set of pages, then store them in a spreadsheet. She just needs the links from the area in the center of the page and not on the top or bottom “boilerplate” area. I also plan to put the results for each page into a different worksheet of the spreadsheet. I will use a bunch of functions from the Text Analytics Toolbox that I have used before.

Features covered in this code-along style video include:

  • webread
  • htmlTree, extractHTMLText, findElement, getAttribute

Follow me (@stuartmcgarrity) if you want to be notified via Twitter when I post.

Play the video in full screen mode for a better viewing experience. 

|
  • print
  • send email

Comments

To leave a comment, please click here to sign in to your MathWorks Account or create a new one.