Datadog Integration
Connect Calmo to your Datadog account to enable comprehensive observability, monitoring, and performance analysis through AI assistance. This integration provides access to 22 specialized tools across 5 categories for complete monitoring and observability workflows.Overview & Value Proposition
The Datadog integration transforms how your team handles observability and monitoring by providing:- Intelligent Monitoring - AI-powered analysis of metrics, logs, and traces with automatic anomaly detection
- Comprehensive Observability - Full-stack monitoring from infrastructure to application performance
- Advanced Analytics - Real User Monitoring (RUM) insights and performance bottleneck identification
- Incident Response - Rapid investigation and correlation across monitoring signals
- Infrastructure Management - Host monitoring, downtime scheduling, and capacity planning
- Safe Operations - Read-only tools enabled by default with controlled write access
Key Capabilities
When connected, Calmo gains access to 22 Datadog tools across 5 categories:Category | Tools | Capability |
---|---|---|
Monitoring & Metrics | 4 tools | Query metrics, dashboards, and monitors |
Logs & Traces | 4 tools | Access logs, traces, and service information |
Infrastructure | 4 tools | Host management and infrastructure monitoring |
Incidents | 2 tools | Incident tracking and management |
Real User Monitoring | 5 tools | RUM applications, events, and performance analysis |
Downtimes | 3 tools | Downtime scheduling and management |
Prerequisites
- Datadog account with appropriate access permissions
- Admin access to generate API and Application keys
- Calmo account with team or personal workspace
Setup Process
Step 1: Generate Datadog Credentials
Create API Key:- Log in to your Datadog account
- Navigate to Organization Settings → API Keys
- Click + New Key
- Name: “Calmo Integration API Key”
- Copy the generated API key immediately
- Navigate to Organization Settings → Application Keys
- Click + New Key
- Name: “Calmo Integration App Key”
- Copy the generated Application key immediately
Step 2: Configure Site Selection
Choose your Datadog site based on your account location:Site | URL | Region |
---|---|---|
US1 | datadoghq.com | United States (default) |
EU1 | datadoghq.eu | Europe |
US3 | us3.datadoghq.com | United States (West) |
US5 | us5.datadoghq.com | United States (Gov) |
AP1 | ap1.datadoghq.com | Asia Pacific |
Gov | ddog-gov.com | US Government |
Step 3: Connect to Calmo
- Navigate to Integrations in your Calmo dashboard
- Click Datadog integration
- Enter your API Key and Application Key
- Select your Datadog Site from the dropdown
- Configure tool permissions:
- ✅ Read-only operations enabled by default
- ❌ Write operations disabled for safety
- Complete the connection
Tool Categories & Configuration
📊 Monitoring & Metrics (Safe)
Default: Enabled - Essential for observability analysis- query_metrics - Query and analyze metric data with filters and aggregations
- list_dashboards - List available dashboards and their metadata
- get_dashboard - Retrieve specific dashboard configurations and widgets
- get_monitors - Access monitor configurations, statuses, and alert history
📋 Logs & Traces (Safe)
Default: Enabled - Critical for debugging and troubleshooting- get_logs - Search and retrieve log data with advanced filtering
- get_all_services - List all services reporting to Datadog APM
- list_traces - Query distributed traces for performance analysis
🖥️ Infrastructure (Mixed Safety)
Default: Read operations enabled - Host and infrastructure monitoring Read Operations (✅ Enabled by default):- list_hosts - List all hosts and their metadata
- get_active_hosts_count - Get count of active hosts
- mute_host - Mute alerts for specific hosts
- unmute_host - Unmute previously muted hosts
🚨 Incidents (Safe)
Default: Enabled - Incident tracking and response- list_incidents - List incidents with status and metadata
- get_incident - Get detailed incident information and timeline
📱 Real User Monitoring (Safe)
Default: Enabled - Frontend performance and user experience- get_rum_applications - List RUM applications and their configurations
- get_rum_events - Query RUM events with filtering capabilities
- get_rum_grouped_event_count - Get aggregated event counts and statistics
- get_rum_page_performance - Analyze page load performance metrics
- get_rum_page_waterfall - Get detailed page load waterfall analysis
⏰ Downtimes (Write Operations)
Default: Read-only enabled - Maintenance window management Read Operations (✅ Enabled by default):- list_downtimes - List scheduled and active downtimes
- schedule_downtime - Schedule maintenance windows
- cancel_downtime - Cancel scheduled downtimes
Team vs Personal Configuration
Team/Organization Setup
- Shared Datadog account access across team members
- Organization-level monitoring policies and dashboards
- Centralized incident response and escalation procedures
- Team administrators control write operation permissions
Personal Setup
- Individual Datadog account connections
- Personal dashboard and monitoring preferences
- Private metric queries and analysis
- Full control over enabled tool capabilities
Security & Best Practices
⚠️ Safety Recommendations
- Read-First Approach - Begin with read-only tools, add write access gradually
- Key Rotation - Regularly rotate API and Application keys
- Scope Limitation - Use least-privilege access for integration keys
- Monitoring Integration - Monitor the integration’s own usage through Datadog
- Site Verification - Ensure correct Datadog site selection to avoid data issues
🔒 Permission Levels
Risk Level | Operations | Recommendation |
---|---|---|
Low | Query metrics, view dashboards, read logs | ✅ Safe to enable |
Medium | List infrastructure, read incidents | ✅ Generally safe |
High | Mute hosts, schedule downtimes | ⚠️ Enable with caution |
Configuration Management
Updating Datadog Connection
- Navigate to Integrations → Datadog
- Click Edit Configuration
- Update API keys, Application keys, or site selection
- Modify tool permissions based on team requirements
- Test connection with read-only operations
- Save configuration changes
Managing Multiple Environments
- Connect separate Datadog accounts for different environments
- Use different API keys for production vs staging
- Configure environment-specific tool permissions
- Maintain separate monitoring policies per environment
Advanced Features
Multi-Site Support
The integration supports all Datadog sites globally:- Geographic Distribution - Connect to the site closest to your data
- Compliance Requirements - Meet data residency requirements
- Performance Optimization - Reduce latency with regional sites
Advanced Query Capabilities
- Metric Aggregation - Complex queries with multiple aggregation functions
- Log Analytics - Advanced log searching with Datadog’s query language
- Trace Analysis - Distributed tracing across microservices
- Custom Dashboards - Integration with existing dashboard configurations
Real-Time Monitoring
- Live Metrics - Real-time metric streaming and analysis
- Alert Correlation - Cross-reference alerts with system events
- Performance Insights - Automated performance bottleneck detection
- Anomaly Detection - AI-powered anomaly identification
Troubleshooting
Common Issues
Authentication Failed- Verify API and Application keys are correct and active
- Check that keys haven’t expired or been revoked
- Ensure proper permissions are assigned to the keys
- Verify Datadog site matches your account region
- Check URLs and endpoints match your Datadog instance
- Update site selection in integration settings
- Review API key permissions in Datadog organization settings
- Ensure Application key has required scopes
- Check organization-level access restrictions
- Datadog enforces API rate limits (varies by endpoint)
- Monitor API usage in Datadog organization settings
- Contact support if limits are consistently exceeded
Getting Help
- Test Connection - Use the connection test feature in integration settings
- Update Credentials - Regenerate API and Application keys if needed
- Contact Support - Reach out to support@getcalmo.com for assistance
Data Types & Analysis
Metrics Analysis
- Infrastructure Metrics - CPU, memory, disk, network utilization
- Application Metrics - Request rates, error rates, response times
- Custom Metrics - Business-specific KPIs and measurements
- Synthetic Monitoring - Uptime and performance of critical services
Log Investigation
- Application Logs - Error tracking and debugging information
- Infrastructure Logs - System events and security logs
- Audit Logs - User activity and configuration changes
- Custom Log Sources - Integration with third-party logging systems
Performance Monitoring
- APM Traces - Distributed tracing across services and databases
- Database Performance - Query performance and database health
- RUM Analytics - Real user experience and frontend performance
- Network Monitoring - Network performance and connectivity issues
For additional help with Datadog integration, contact our support team at support@getcalmo.com.