Finding the Outgoing Links From a Set of Web Pages Using Crawl Data
Here I try to find all the unique outgoing links from a set of web pages, i.e. excluding the links common to all the pages. I actually don’t need to read the pages themselves to find the links because I already have a (network) graph of our website from crawling it previously.
Features covered in this code-along style video include:
- table, readtable, and detectImportOptions
- setdiff, intersect, and vertcat
Play the video in full screen mode for a better viewing experience.
评论
要发表评论,请点击 此处 登录到您的 MathWorks 帐户或创建一个新帐户。