Heading & Meta Extractor

Audit the structure of any webpage.

What This Tool Does

The Heading and Meta Extractor scans any publicly accessible webpage and extracts its heading hierarchy (H1 through H6), core meta tags (title, description, robots, canonical), and any JSON-LD structured data blocks. It gives you a quick on-page SEO audit without needing browser extensions or manual source code inspection.

Inputs

  • Page URL: Enter the full URL of the page you want to scan and click Scan Page.

How It Works

The tool sends the URL to a server-side proxy, which fetches the raw HTML. The response is parsed client-side using the DOMParser API to extract the title element, meta description, robots directives, canonical link, all heading elements in document order, and any <script type="application/ld+json"> blocks. Results are displayed in organized sections.

Understanding the Results

  • Basic Meta: Shows the title, description, robots tag, and canonical URL. Missing elements are labeled clearly.
  • Header Outline: Displays all headings in a tree-like indented format, making it easy to spot hierarchy issues such as missing H1 tags or skipped heading levels.
  • Structured Data: Lists any JSON-LD schema blocks found on the page with a preview of each block's content.

Page Structure

Basic Meta

    Header Outline

    Structured Data (JSON-LD)

    Step-by-Step Example

    1. Enter a URL such as https://example.com and click Scan Page.
    2. Review the Basic Meta section. Confirm the title is present and within 60 characters.
    3. Check that a meta description exists and is between 120 and 160 characters.
    4. Verify the canonical URL points to the correct page.
    5. Review the Header Outline. Ensure there is exactly one H1 and headings follow a logical order.
    6. Check the Structured Data section for valid JSON-LD blocks.

    Use Cases

    • Quick on-page SEO audits before publishing new content.
    • Verifying heading hierarchy across site templates.
    • Checking that canonical tags are correctly implemented after a URL migration.
    • Confirming JSON-LD structured data is present on key pages.
    • Competitive analysis of how other sites structure their headings and meta tags.

    Limitations and Notes

    • Only publicly accessible pages can be scanned. Login-protected or IP-restricted pages are not supported.
    • JavaScript-rendered content may not be captured since the tool parses raw HTML.
    • The tool shows the first 200 characters of each JSON-LD block. Use a dedicated validator for full schema inspection.
    • Dynamic meta tags set by JavaScript frameworks may not appear in the parsed HTML.

    Frequently Asked Questions

    How many H1 tags should a page have?

    One H1 per page is the safest rule — it tells search engines and screen readers what the page is about. HTML5 technically allows multiple H1s inside section elements, but in practice it confuses crawlers and accessibility tools. Exceptions: some CMS templates put the site name in an H1 in the header — that's a template problem worth fixing. To verify, run the page through our Heading & Meta Extractor; it lists every H1 and H2 in document order so you can spot duplicates fast.

    Why is heading hierarchy important for SEO?

    Headings give Google a quick map of your content. A clean structure — H1, then H2s for main sections, then H3s under those — helps the crawler understand topic flow and can win you featured snippets. It also helps screen readers and skim readers, which keeps users on the page longer. Compare these: a page with one H1 and four H2s reads like a clear article; a page with three H1s and skipped H3 levels reads like a mess. Run a check after every long article.

    What meta tags are important for SEO?

    Five really matter: the title tag, meta description, robots meta, canonical, and viewport. Open Graph and Twitter cards aren't ranking factors but they control how the page looks when shared on social. The hreflang tag matters if you serve multiple languages or regions. The 'keywords' meta tag is dead — Google ignored it years ago, so skip it. To audit, paste any URL into our Heading & Meta Extractor; it pulls all the tags so you can see what's there and what's missing.

    How to find H1 and H2 tags on a page?

    Three easy options. One — open the page, right-click, and 'View Page Source', then search for h1 and h2. Two — install a browser extension like SEO Meta in 1 Click. Three (fastest for a quick audit) — paste the URL into our Heading & Meta Extractor and it returns every heading from H1 down to H6 in order, plus all the meta tags. The third one wins when you're auditing twenty pages in a row.

    What is JSON-LD structured data?

    JSON-LD is a way to feed structured data to search engines using a small block of JavaScript Object Notation in the page head. It tells Google exactly what the page is — an article, product, FAQ, recipe, event — so it can show rich results like star ratings or FAQ accordions. Example: a Product schema with price and availability can make your listing stand out in shopping results. It doesn't directly affect rankings, but the visual upgrade often lifts CTR. Validate yours after every change.

    What is robots meta tag?

    The robots meta tag is a single line in the page head that tells search engines what to do with the page. Common values: 'index, follow' (default — crawl and index), 'noindex' (don't show in search results), 'nofollow' (don't pass link credit through outbound links), and 'noarchive' (don't cache the page). Use noindex for thank-you pages, internal search results, and staging URLs. One misplaced 'noindex' on a money page can drop you out of search overnight, so always verify after deployments.

    How to check meta tags of a website?

    Two quick paths. For a single page — paste the URL into our Heading & Meta Extractor; it returns title, description, robots, canonical, viewport, OG tags, and Twitter cards in one view. For a whole site, run a crawl with Screaming Frog or Sitebulb and export the meta data to a spreadsheet. Once you have it in a sheet, sort for empty cells and duplicate values — that gives you the priority fix list. Re-run the audit monthly to catch regressions.

    Sources and References

    Related SEO Tools