Authoritative URL for the page.
Discovered feed URLs (RSS, Atom, JSON Feed) as URL objects
OptionaltitlePage title (cleaned, from best available source).
OptionaldescriptionPage description (from best available source).
OptionalimagePage keyvisual/image URL (from best available source).
OptionaliconBest available icon/favicon for the site.
OptionallanguagePrimary language code (ISO 639-1).
OptionalregionRegion code (ISO 3166-1 alpha-2).
Raw HTML content of the page (UTF-8).
Plain text content extracted from the HTML.
Internal links found on the page (same domain, excluding current URL).
External links found on the page (different domains).
Gathered website data.
Remarks
This interface represents the complete gathered data from a website, including the authoritative URL and all extracted metadata. It will be extended incrementally with more properties.