Parsed HTML document
OptionalbaseUrl: string | URL | nullOptional base URL for resolving relative URLs
Assets metadata object with categorized URLs
Extracts all external assets referenced in the document, organized by type. All URLs are normalized to absolute format based on the document's base URL.
The extractor finds assets from:
<img>, <picture>, srcset, OpenGraph meta tags<link rel="stylesheet"><script src>@font-face and url() with font extensions<video>, <audio>, <source>, <track><link rel="manifest"><link rel="preload"> and <link rel="prefetch"><link rel="dns-prefetch"> and <link rel="preconnect">
Extract assets metadata from parsed HTML document.