What is Google NotebookLM?
Direct Answer: Google NotebookLM bot operated by Google for AI training.
Google NotebookLM is a bot used by notebooklm.google for AI training purposes. It is operated by Google and has a known user-agent string 'Google-NotebookLM'. The bot does not follow robots.txt instructions from the database.
User-Agent Identification
The following user-agent strings identify Google NotebookLM in your live traffic data:
Google-NotebookLM
robots.txt Rules for Google NotebookLM
Respects robots.txt: No
This bot does not commit to following robots.txt
Google NotebookLM does not officially follow robots.txt directives. The only reliable way to control access is through server-side blocking (IP filtering, user-agent rules in your web server config) combined with log monitoring to verify effectiveness.
Need continuous verification across 500+ bots? Can AI See It automates this.
Crawl Behavior
Frequency:Not Documented
Request Pattern:Not Documented
Crawl Activity Index
Relative crawl activity for Google NotebookLM over the past 28 days. Higher values indicate increased crawling intensity compared to the period baseline.
View recent activity data (last 7 days)
| Date | Activity Index |
|---|---|
| Mar 26, 2026 | 88.0 |
| Mar 27, 2026 | 82.7 |
| Mar 28, 2026 | 83.1 |
| Mar 29, 2026 | 81.8 |
| Mar 30, 2026 | 87.3 |
| Mar 31, 2026 | 90.2 |
| Apr 1, 2026 | 88.9 |
Source: Cloudflare Radar
Why track Google NotebookLM traffic?
Measure what Google gives back. Google NotebookLM takes your content for AI training — but does Google send any traffic in return through other products? Track whether the trade-off is worth it before deciding to block.
Understand what content is being collected for AI training. Google NotebookLM crawls your site to gather data that may train AI models. Tracking its activity reveals which pages are selected — and which are skipped.
Make an informed block-or-allow decision. Blocking Google NotebookLM prevents your content from being used in future model training. But first, measure the volume: how many pages does it fetch, how often, and does Google send any referral traffic through other products?
Detect content harvesting patterns. If Google NotebookLM is systematically crawling your highest-value content (product pages, proprietary research, premium articles), you may want to restrict access using robots.txt or server-side rules.
What does Google NotebookLM crawling actually cost you?
AI training bots like Google NotebookLM collect your content to improve future AI models. Unlike AI search bots, there's no direct referral pipeline — Google NotebookLM doesn't cite sources or send traffic back to your site.
What you give
- Server resources for every crawl request
- Your content, expertise, and original research
- Data that improves a competing AI product
What you get back
- No direct referral traffic from Google NotebookLM
- No attribution in AI model outputs
- No revenue share from model usage
This doesn't automatically mean you should block Google NotebookLM. But you need to measure the real cost before deciding. Google may send traffic through other products (Google AI Overviews and Gemini) — blocking the training bot might not affect referrals at all, or it might. Only log data tells you.
What Can AI See It measures for AI training bots
How many pages Google NotebookLM fetches from your site
Which pages and sections Google NotebookLM prioritizes
Do Google's OTHER products send you traffic?
Does Google NotebookLM actually respect your robots.txt?
How is this different from prompt testing tools? Prompt testing checks if AI mentions your brand in simulated queries. Can AI See It measures what actually happens: real crawls, real referrals, real conversions — from your live traffic data.
Read: Why live traffic monitoring beats prompt testing →Log Verification
To verify Google NotebookLM traffic in your live traffic data:
- Search access logs for the user-agent strings listed above
- Check if the IP addresses match documented ranges (if provided by Google)
- Verify the crawl pattern matches documented behavior
- Use reverse DNS lookup for additional verification if available
Note: Observed behavior in production environments may differ from official documentation. Live traffic monitoring provides the only reliable verification of actual bot behavior.
Undocumented Information
The following information is not officially documented for Google NotebookLM:
- crawl frequency
- request pattern
- IP verification
- JavaScript rendering
Measure your Crawl-to-Referral Ratio for Google NotebookLM
See how much traffic Google actually sends back to your site relative to how much content Google NotebookLM takes.
- Connect Google NotebookLM crawls in your logs with referral sessions in analytics
- Calculate your CRR — the metric prompt testing tools can't provide
- Make data-driven block/allow decisions for every AI bot
Measure business impact from Google NotebookLM
The question isn't just whether to block Google NotebookLM — it's what you lose or gain from its crawling activity.
- Crawl volume: how many pages Google NotebookLM collects from your site
- Content value: which content categories are targeted most
- Cross-platform CRR: does Google send traffic through other products?
- Referral tracking: Google NotebookLM takes — measure what Google gives back. Track actual visits arriving from Google's products to your site.
Based on your live traffic data and analytics — not synthetic prompt tests.
Official Documentation
View Official Google NotebookLM Documentation →
Information sourced from official documentation. Content generated with AI assistance.