Skip to main content
Can AI see it

Know what AI sees. Measure what it's worth.

What is Siteimprove Crawl?

Direct Answer: Siteimprove Crawl bot, operated by Siteimprove, for SEO and content suite analysis.

Operator: Siteimprove Type: SEO Tool Purpose: SEO and content suite analysis

The Siteimprove Crawl bot is used for the Siteimprove content suite, which includes Quality Assurance, Accessibility, Policy, and SEO. It crawls websites on ports 80 for HTTP and 443 for HTTPS. The bot uses specific IP addresses and user agents to identify itself.

User-Agent Identification

The following user-agent strings identify Siteimprove Crawl in your live traffic data:

  • Mozilla/5.0 (compatible; MSIE 10.0; Windows NT 6.1; Trident/6.0) SiteCheck-sitecrawl

robots.txt Rules for Siteimprove Crawl

Respects robots.txt: No

Use the following robots.txt rules to control Siteimprove Crawl access:

# Block Siteimprove Crawl
User-agent: SiteimproveBot-Crawler
Disallow: /

# Allow Siteimprove Crawl
User-agent: SiteimproveBot-Crawler
Allow: /

This bot does not commit to following robots.txt

Siteimprove Crawl does not officially follow robots.txt directives. The only reliable way to control access is through server-side blocking (IP filtering, user-agent rules in your web server config) combined with log monitoring to verify effectiveness.

Need continuous verification across 500+ bots? Can AI See It automates this.

Crawl Behavior

Frequency:Continuous

Request Pattern:Crawls Run On Ports 80 For HTTP And 443 For HTTPS

Official Documentation Quotes

"This article provides details for the IP addresses and user-agent strings used by Siteimprove on your website."

Crawl Activity Index

Relative crawl activity for Siteimprove Crawl over the past 28 days. Higher values indicate increased crawling intensity compared to the period baseline.

View recent activity data (last 7 days)
Date Activity Index
Mar 26, 2026 88.0
Mar 27, 2026 82.7
Mar 28, 2026 83.1
Mar 29, 2026 81.8
Mar 30, 2026 87.3
Mar 31, 2026 90.2
Apr 1, 2026 88.8

Source: Cloudflare Radar

Why track Siteimprove Crawl traffic?

Control third-party crawl impact on your server. Siteimprove Crawl crawls your site to build Siteimprove's SEO database. While useful for competitive analysis, these crawlers can consume significant server resources on large sites.

Identify who's analyzing your site. Siteimprove Crawl visits reveal when competitors or agencies are running audits on your domain.

Manage crawl priority. If Siteimprove Crawl is consuming crawl budget you'd rather allocate to search engines, you can throttle or block it based on measured volume.

Surface 4XX and 5XX errors before search engines find them. If Siteimprove Crawl reports broken pages or server errors in its crawl data, you can fix those issues proactively — before search engine crawlers encounter them and your rankings suffer.

Is Siteimprove Crawl worth the server resources?

Siteimprove Crawl crawls your site to build Siteimprove's SEO database. Unlike search engines, this crawler doesn't send you any referral traffic — it feeds a third-party tool.

That's not necessarily a problem — Siteimprove's data may power tools you use yourself. The question is whether Siteimprove Crawl's crawl volume is proportionate to its value.

What Can AI See It measures

Crawl volume

Requests per day and bandwidth consumed by Siteimprove Crawl

Resource share

What % of your total bot traffic is Siteimprove Crawl?

Fake bot detection

Scrapers spoofing Siteimprove Crawl's user-agent string

Log Verification

To verify Siteimprove Crawl traffic in your live traffic data:

  1. Search access logs for the user-agent strings listed above
  2. Check if the IP addresses match documented ranges (if provided by Siteimprove)
  3. Verify the crawl pattern matches documented behavior
  4. Use reverse DNS lookup for additional verification if available

IP Verification: Siteimprove provides official IP verification via Published IP ranges. View verification instructions →

A text file containing all IP addresses is available for download

Note: Observed behavior in production environments may differ from official documentation. Live traffic monitoring provides the only reliable verification of actual bot behavior.

Undocumented Information

The following information is not officially documented for Siteimprove Crawl:

  • crawl frequency details
  • JavaScript rendering details

See which SEO tools are crawling your site — and how much they cost you

  • Identify third-party crawlers consuming your server resources
  • Separate SEO tool traffic from search engine crawls
  • Detect fake bots spoofing Siteimprove Crawl's user-agent

Official Documentation

View Official Siteimprove Crawl Documentation →

Information sourced from official documentation. Content generated with AI assistance.