Incidents
The Incidents card provides a quick overview of how many incidents have occurred within your monitored services over a selected time period. Incidents represent disruptions or issues that may affect service availability or performance.
Key Information
- Number of Incidents: This number represents how many incidents have occurred within the selected time period (e.g., the past 24 hours, 7 days, or 30 days). Incidents can range from minor issues to significant outages.
- Time Period Selection: Similar to the Uptime card, you can toggle between the past 24 hours, 7 days, or 30 days by clicking on the arrow next to the number of incidents.
- Colors: The number of incidents is color-coded to provide a quick understanding of the situation at a glance:
- Green indicates no incidents.
- Orange indicates a moderate number of incidents that may require attention.
- Red suggests a significant number of incidents or major disruptions.
How to Use
- Monitor Incident Frequency: The incidents card helps you keep track of how often issues are occurring in your services. A higher number of incidents may signal recurring problems that need investigation.
- Switch Timeframes: Click on the arrow to view the number of incidents over different timeframes (24 hours, 7 days, or 30 days).
Actionable Steps
- Investigate Incidents: If the number of incidents rises, drill down into the incident logs to investigate the root causes and assess their impact on your services. Check details on the Incidents on the Incidents page.
- Resolve and Prevent: After identifying the cause of incidents, take corrective measures to resolve them and implement strategies to prevent similar issues in the future.
Best Practices
- Track Incident Trends: Use the Incidents card to monitor trends over time, ensuring you respond to increases in incident frequency quickly.
- Prioritize Critical Incidents: Focus on resolving critical incidents first to minimize downtime and disruptions to essential services.
By regularly monitoring the Incidents card, you can stay informed about the health and stability of your services and take proactive steps to mitigate potential issues.