Rate Limits & Quotas
X21 has several limits to ensure optimal performance and manage costs. Understanding these helps you work within constraints.Token Limits
Conversation Limit
200,000 tokens per conversation Includes:- All your messages
- All AI responses
- Tool definitions (background)
- System messages
- Attached files (~100KB per PDF page estimate)
Per-Response Limits
- Output: 32,000 tokens reserved
- Thinking: 1,600 tokens reserved
- Combined: Up to 33,600 tokens per AI response
What Are Tokens?
Tokens are units of text:- ~4 characters = 1 token
- “Hello” = 1 token
- “spreadsheet” = 2 tokens
- Full conversation history counts
Token Counter
Status bar shows real-time usage:When Limits Are Reached
At 200,000 tokens, X21 automatically:- Compacts conversation (summarizes old messages)
- Preserves recent context
- Continues without interruption
File Limits
PDF Attachments
100 pages total per request Examples:- 1 file × 100 pages = OK
- 2 files × 50 pages each = OK
- 5 files × 30 pages each = Exceeds limit
Image Files
No explicit limit, but:- Large images increase processing time
- Multiple images count toward token usage
- Recommended: Compress large images
File Types
Supported:- PDFs (up to 100 pages total)
- PNG, JPG, JPEG, GIF, WEBM
- Excel files (use file operations instead)
- Word documents
- Other formats
Query Limits
Recent Chats
Max 50 conversations per history query Configurable range: 1-50Search Results
Max 100 results per search query Configurable range: 1-100Messages Per Conversation
No hard limit, but:- Very long conversations may compact
- Performance may degrade beyond 200k tokens
Rate Limiting
Anthropic API Limits
X21 uses Claude API, which has rate limits: If rate limited:- Error message appears
- Wait time indicated (typically 30-60 seconds)
- Retry automatically or manually
- Multiple rapid requests
- Large file processing
- Peak usage times
Recovery
- Wait the indicated time
- Retry your request
- Contact support if persistent
Best Practices
Managing Tokens
Start fresh:- New conversation for new tasks
- Don’t mix unrelated work
- Clear separation reduces token usage
- Watch the token counter
- Compact before hitting 200k
- Use concise prompts when possible
File Attachments
Optimize PDFs:- Extract relevant pages only
- Compress before attaching
- Split large documents
- Resize to reasonable dimensions
- Compress without losing quality
- Screenshot only necessary portions
Avoiding Rate Limits
Pace requests:- Don’t rapid-fire multiple requests
- Let responses complete
- Batch operations when possible
- Fewer rate limits during off-peak hours
- Plan large operations accordingly
Error Messages
Token Limit Errors
Rate Limit Errors
File Limit Errors
Quotas by User Type
All X21 users have the same limits:- 200,000 tokens per conversation
- 100 PDF pages per request
- Shared Anthropic API rate limits
Related Topics
- Understanding AI Responses - Token details
- File Attachments - File handling
- Performance Optimization - Efficiency tips
- Troubleshooting: Error Types - Error handling

