Skip to main content
Can AI see it

Know what AI sees. Measure what it's worth.

What is Meta-ExternalAgent?

Direct Answer: Meta-ExternalAgent is a bot operated by Meta for AI training purposes, specifically for training AI models or improving products by indexing content directly.

Operator: Meta Type: AI Training Crawler Purpose: Training AI models or improving products by indexing content directly AI Training

The Meta-ExternalAgent is a bot used for purposes such as training AI models or improving products by indexing content directly. It is operated by Meta and falls under the category of ai-training. The bot is known to follow robots.txt instructions.

User-Agent Identification

The following user-agent strings identify Meta-ExternalAgent in your live traffic data:

  • meta-externalagent/1.1
  • meta-externalagent/1.1 (+https://developers.facebook.com/docs/sharing/webmasters/crawler)

robots.txt Rules for Meta-ExternalAgent

Respects robots.txt: Yes

Use the following robots.txt rules to control Meta-ExternalAgent access:

# Block Meta-ExternalAgent
User-agent: meta-externalagent/1.1
Disallow: /

# Allow Meta-ExternalAgent
User-agent: meta-externalagent/1.1
Allow: /

Robots.txt is a directive, not a barrier

Meta states that Meta-ExternalAgent respects robots.txt. However, configuration mistakes, caching delays, and edge cases mean your directives may not always be followed as expected. Live traffic verification confirms whether Meta-ExternalAgent actually obeys your rules in practice.

Need continuous verification across 500+ bots? Can AI See It automates this.

Crawl Behavior

Frequency:Not Documented

Request Pattern:Not Documented

Crawl Activity Index

Relative crawl activity for Meta-ExternalAgent over the past 28 days. Higher values indicate increased crawling intensity compared to the period baseline.

View recent activity data (last 7 days)
Date Activity Index
Mar 28, 2026 53.0
Mar 29, 2026 50.0
Mar 30, 2026 52.2
Mar 31, 2026 63.9
Apr 1, 2026 66.5
Apr 2, 2026 65.5
Apr 3, 2026 67.6

Source: Cloudflare Radar

Why track Meta-ExternalAgent traffic?

Measure what Meta gives back. Meta-ExternalAgent takes your content for AI training — but does Meta send any traffic in return through other products? Track whether the trade-off is worth it before deciding to block.

Understand what content is being collected for AI training. Meta-ExternalAgent crawls your site to gather data that may train AI models. Tracking its activity reveals which pages are selected — and which are skipped.

Make an informed block-or-allow decision. Blocking Meta-ExternalAgent prevents your content from being used in future model training. But first, measure the volume: how many pages does it fetch, how often, and does Meta send any referral traffic through other products?

Detect content harvesting patterns. If Meta-ExternalAgent is systematically crawling your highest-value content (product pages, proprietary research, premium articles), you may want to restrict access using robots.txt or server-side rules.

What does Meta-ExternalAgent crawling actually cost you?

AI training bots like Meta-ExternalAgent collect your content to improve future AI models. Unlike AI search bots, there's no direct referral pipeline — Meta-ExternalAgent doesn't cite sources or send traffic back to your site.

What you give

  • Server resources for every crawl request
  • Your content, expertise, and original research
  • Data that improves a competing AI product

What you get back

  • No direct referral traffic from Meta-ExternalAgent
  • No attribution in AI model outputs
  • No revenue share from model usage

This doesn't automatically mean you should block Meta-ExternalAgent. But you need to measure the real cost before deciding. Meta may send traffic through other products (Meta's AI products) — blocking the training bot might not affect referrals at all, or it might. Only log data tells you.

What Can AI See It measures for AI training bots

Crawl volume

How many pages Meta-ExternalAgent fetches from your site

Content targeting

Which pages and sections Meta-ExternalAgent prioritizes

Cross-platform CRR

Do Meta's OTHER products send you traffic?

Compliance check

Does Meta-ExternalAgent actually respect your robots.txt?

How is this different from prompt testing tools? Prompt testing checks if AI mentions your brand in simulated queries. Can AI See It measures what actually happens: real crawls, real referrals, real conversions — from your live traffic data.

Read: Why live traffic monitoring beats prompt testing →

Log Verification

To verify Meta-ExternalAgent traffic in your live traffic data:

  1. Search access logs for the user-agent strings listed above
  2. Check if the IP addresses match documented ranges (if provided by Meta)
  3. Verify the crawl pattern matches documented behavior
  4. Use reverse DNS lookup for additional verification if available

Note: Observed behavior in production environments may differ from official documentation. Live traffic monitoring provides the only reliable verification of actual bot behavior.

Undocumented Information

The following information is not officially documented for Meta-ExternalAgent:

  • crawl frequency
  • request pattern
  • IP verification
  • JavaScript rendering

Measure your Crawl-to-Referral Ratio for Meta-ExternalAgent

See how much traffic Meta actually sends back to your site relative to how much content Meta-ExternalAgent takes.

  • Connect Meta-ExternalAgent crawls in your logs with referral sessions in analytics
  • Calculate your CRR — the metric prompt testing tools can't provide
  • Make data-driven block/allow decisions for every AI bot

Measure business impact from Meta-ExternalAgent

The question isn't just whether to block Meta-ExternalAgent — it's what you lose or gain from its crawling activity.

  • Crawl volume: how many pages Meta-ExternalAgent collects from your site
  • Content value: which content categories are targeted most
  • Cross-platform CRR: does Meta send traffic through other products?
  • Referral tracking: Meta-ExternalAgent takes — measure what Meta gives back. Track actual visits arriving from Meta's products to your site.
Audit Meta-ExternalAgent crawl activity on your site →

Based on your live traffic data and analytics — not synthetic prompt tests.

Official Documentation

View Official Meta-ExternalAgent Documentation →

Information sourced from official documentation. Content generated with AI assistance.