CloudWatch Dashboard
Create a comprehensive dashboard to visualize all metrics:Recommended Widgets
- Lambda Metrics
- S3 Metrics
- Application Metrics
Invocations
- Metric:
AWS/Lambda→Invocations - Statistic: Sum
- Period: 1 minute
- Chart type: Line
- Metric:
AWS/Lambda→Errors - Statistic: Sum
- Period: 1 minute
- Chart type: Stacked area (with Invocations)
- Metric:
AWS/Lambda→Duration - Statistics: Average, Maximum, p99
- Period: 1 minute
- Chart type: Line
- Metric:
AWS/Lambda→Throttles - Statistic: Sum
- Period: 1 minute
- Chart type: Number
CloudWatch Alarms
Set up proactive alerting for critical issues:Lambda Alarms
High Error Rate
High Error Rate
High Duration
High Duration
Throttling
Throttling
SNS Notification Setup
Configure email/SMS alerts:CloudWatch Logs Insights
Query and analyze Lambda execution logs:Query Examples
- Error Analysis
- Slow Executions
- Popular MCP Servers
- Error Patterns
Saved Queries
Save frequently used queries for quick access:- Daily Execution Summary
- Failed Server Executions
- Memory Usage Patterns
- Cold Start Analysis
Performance Metrics
Lambda Performance
Cold Start Detection
Memory Utilization
Concurrent Executions
Error Rate Trend
Cost Monitoring
Track infrastructure costs:Cost Explorer Filters
Filter by Service
- Service: AWS Lambda - Service: Amazon S3 - Service: CloudWatch - Tag:
project:superbox
Cost Optimization Queries
X-Ray Tracing (Optional)
Enable AWS X-Ray for detailed request tracing:Enable X-Ray
Benefits
- End-to-end request visualization
- Identify bottlenecks in execution flow
- Trace external API calls
- Analyze Lambda initialization time
Grafana Integration (Optional)
For advanced visualization, integrate CloudWatch with Grafana:Add CloudWatch Data Source
- Go to Configuration → Data Sources - Add AWS CloudWatch - Configure IAM credentials
Anomaly Detection
Enable CloudWatch Anomaly Detection:Monitoring Checklist
Daily Checks
- Review error rate (< 1%)
- Check average duration (< 30s)
- Verify no throttling events
- Monitor S3 bucket size growth
Weekly Reviews
- Analyze cost trends
- Review top error patterns
- Check cold start frequency
- Optimize memory allocation