Finding and Removing HTML Elements from a Web Page
My script that looks for translation errors on our website is generating too many false positives. I need to ignore an area at the bottom of all pages, called the country selector which is a list of country names that are not translated. I first try to remove the country name strings after I have extracted the text from the web page, then I realize it would be better to find and remove specific HTML elements before extracting the text.
Features covered in this code-along style video include:
- Text Analytics Toolbox: getAttribute, extractHTML
- arrayfun
Play the video in full screen mode for a better viewing experience.


댓글
댓글을 남기려면 링크 를 클릭하여 MathWorks 계정에 로그인하거나 계정을 새로 만드십시오.