Overview
AI Runbooks allow you to define a set of tasks that Metoro should perform when an alert fires. Instead of manually investigating alerts, you can configure Metoro to automatically gather relevant data, analyze the situation, and document findings.How Runbooks Work
- Alert Fires - A configured alert is triggered in Metoro
- Runbook Executes - Metoro automatically executes the runbook you’ve defined
- Investigation Document - Metoro creates a new document containing all the information gathered
- Notification - A link to the investigation document is included in the alert notification
What Can Runbooks Do?
Metoro can perform a broad set of investigation tasks in your runbooks:Find Logs
Search for relevant logs around the time of the alert, including error logs, warnings, and contextual information
Analyze Traces
Find and analyze traces related to the alert, including slow requests, errors, and dependency failures
Query Metrics
Gather relevant metrics like error rates, latency percentiles, throughput, and resource utilization
Check Dependencies
Analyze upstream and downstream service dependencies to identify cascading failures
Isolate Failing Pods
Identify which specific pods or instances are experiencing issues
Correlate Changes
Link issues to recent deployments or configuration changes
Creating a Runbook
Step 1: Create or Edit an Alert
- Navigate to Alerts in the main navigation
- Create a new alert or edit an existing one
- In the alert configuration, find the AI Runbook section
Step 2: Define the Runbook
Write instructions for what Metoro should investigate when the alert fires. Use natural language to describe what you want Metoro to do.- Simple Runbook
- Detailed Runbook
- Specific Investigation
Step 3: Enable AI Investigation for the Alert
- Toggle the AI investigation destination for the alert
- Save the alert configuration
Viewing Runbook Results
When an alert fires and the runbook executes:- Alert Notification - The alert notification includes a link to the investigation document
- Investigations Page - View all investigation documents in AI SRE → Investigations
- Investigation Details - Click on an investigation to see:
- All data Metoro gathered
- Analysis and findings
- Recommended actions
Best Practices
Be Specific
The more specific your runbook instructions, the better Metoro can investigate:- Less Effective
- More Effective
Include Context
Give Metoro context about what the alert means:Focus on Actionable Information
Ask for information that helps with resolution:Runbook Examples
High Error Rate Alert
Latency Alert
Resource Alert
Related Documentation
AI SRE Overview
Learn how AI SRE fits into Metoro’s broader platform
Alerts
Configure alerts in Metoro
AI Alert Investigations
Investigate firing alerts with AI
