Performance Issues
If X21 is responding slowly or experiencing performance issues, this guide helps identify and resolve the cause.Slow AI Responses
Symptoms
- Long wait for response to start
- Slow streaming of text
- Delayed tool execution
- Timeouts
Common Causes & Solutions
1. Token Limit Approaching Cause: Conversation near 200,000 token limit Check: Look at token counter in status bar Solution:- Start a new conversation
- X21 will auto-compact, but new chat is faster
2. Large Attachments Cause: Processing big PDFs or many images Check: File size and page count Solution:
- Reduce PDF to relevant pages only
- Compress images
- Attach fewer files per request
- Split large documents
3. Complex Operations Cause: Operation requires extensive thinking Check: Thinking block duration Solution:
- Break into smaller steps
- Simplify prompt
- Provide more specific instructions
- Use slash commands for common tasks
4. Network Issues Cause: Slow internet connection to API Check: Network speed, other apps Solution:
- Check internet connection
- Close bandwidth-heavy applications
- Wait for better connectivity
- Try smaller requests
When to Start New Conversation
Signs You Should Start Fresh
- Token usage high (over 150,000 tokens)
- Responses getting slower over time
- Unrelated to previous conversation
- Context no longer relevant
- Performance degraded
Benefits of New Conversation
- Faster responses: Less history to process
- Better focus: Clear context for AI
- Token efficiency: Start with low usage
- Organized history: Easier to find later
How to Start New
- Click New Chat button
- Confirms if ongoing requests
- Previous conversation auto-saved
- Fresh start with full token budget
Conversation Compacting
What It Does
When conversation reaches 200,000 tokens:- Summarizes old messages
- Preserves recent context
- Reduces token usage
- Continues seamlessly
When It Happens
- Automatic at token limit
- No user action required
- Brief pause during compacting
- Notice appears in conversation
Impact
Benefits:- Continue same conversation
- Maintain context
- No data loss
- Old details summarized
- Some nuance may be lost
- Still slower than new conversation
Tool Execution Speed
Slow Tool Execution
Causes:- Large data ranges
- Many concurrent tools
- Complex VBA operations
- Excel busy with other tasks
- Batch operations more efficiently
- Smaller ranges per operation
- Sequential execution rather than parallel
- Close other Excel workbooks
- Disable Excel add-ins temporarily
Tool Optimization
Efficient patterns:Workbook Performance
Large Workbooks
Impact: Slower Excel operations Solutions:- Work on smaller worksheets
- Clear unused data/formatting
- Disable automatic calculations temporarily
- Close other workbooks
- Archive old data
External References
Impact: Slower when referencing other workbooks Solutions:- Break external links if not needed
- Copy data into current workbook
- Close source workbooks
System Performance
Resource Usage
Check:- Task Manager (Windows)
- Activity Monitor (Mac)
- Look for high CPU/memory usage
- Close unused applications
- Restart Excel if memory high
- Restart Deno server
- Reboot computer if needed
Disk Space
Impact: Low disk space slows everything Check: Available disk space Solution:- Ensure 5+ GB free space
- Clear temp files
- Archive old log files
- Clean up downloads
Network Performance
Checking Connection
Test:- Visit Anthropic status
- Check general internet speed
- Test other online services
- Wait for better connectivity
- Work offline where possible
- Schedule large operations for better network times
Port Issues
Check ports:- Port 8000: Deno server
- Port 8080: VSTO add-in
- Ensure not blocked by firewall
Monitoring Performance
Token Usage
Watch the token counter:- fewer than 50,000: Excellent
- 50,000-100,000: Good
- 100,000-150,000: Consider new chat soon
- 150,000-200,000: Start new chat for best performance
- over 200,000: Auto-compacting occurs
Response Times
Expected:- First response: 2-5 seconds
- Streaming: Continuous flow
- Tool execution: 1-3 seconds per tool
- Thinking: 1-10 seconds depending on complexity
- First response: >10 seconds
- Streaming: Stuttering or stops
- Tool execution: >10 seconds
- Thinking: >30 seconds
Best Practices
Proactive Management
- Start new conversations for new tasks
- Monitor token usage regularly
- Batch similar operations
- Optimize attachments before sending
- Close unused workbooks
Efficient Workflows
Good:When Performance Matters
For time-sensitive work:- Start fresh conversation
- Use slash commands
- Enable auto-approve
- Prepare data in advance
- Work during off-peak hours
Related Topics
- Rate Limits & Quotas - Understanding limits
- Best Practices: Performance Optimization - Optimization strategies
- Error Types - Timeout and overload errors
- Connection Status - Network troubleshooting

