Platform Metrics¶
Platform Metrics provides a high-level view of your DIBOP instance's operational health. It aggregates data from orchestration executions, API calls, and system health checks into actionable dashboards.
Overview¶
The Platform Metrics page is accessible from MONITOR > Observability and from the Platform Home status strip. It shows trends over time, helping you understand how your integrations are performing and where to focus your attention.
Key Metrics¶
API Calls Over Time¶
A time-series chart showing the volume of API calls made through DIBOP, broken down by:
- Successful calls (2xx responses) -- green
- Client errors (4xx responses) -- amber
- Server errors (5xx responses) -- red
You can view this chart at hourly, daily, or weekly granularity.
What to look for:
- Spikes in error rates that correlate with specific times or events
- Gradual increases in call volume that may approach rate limits
- Unexpected drops in call volume that could indicate a broken orchestration
System Health Indicators¶
A summary card for each connected system showing its current health:
| Indicator | Meaning |
|---|---|
| Healthy (green) | The system is responding normally |
| Degraded (amber) | The system is responding but with elevated error rates or latency |
| Offline (red) | The system is not responding |
| Unchecked (grey) | No recent health check data |
Health is determined by the most recent health check ping. DIBOP pings active connections at regular intervals.
Latency Trends¶
A chart showing the average API response time for each connected system over time. This helps you:
- Identify systems that are getting slower
- Correlate latency spikes with external events (maintenance windows, deployments)
- Set baseline expectations for SLA thresholds
Error Rates¶
A breakdown of errors by type:
| Error Type | Description |
|---|---|
| Auth Errors (401/403) | Credential or permission issues |
| Not Found (404) | Resource not found -- possible misconfiguration |
| Rate Limits (429) | External system rate limit exceeded |
| Server Errors (5xx) | External system failures |
| Timeouts | No response within the configured timeout |
| Network Errors | Connection refused, DNS failure, etc. |
The error rate chart helps you prioritise which issues to investigate first.
Orchestration Metrics¶
Execution Volume¶
Total number of orchestration executions over time, broken down by status (success, failed, partial).
Average Execution Duration¶
The mean time to complete an orchestration, tracked over time. A sudden increase may indicate:
- A connected system becoming slower
- A new step added to an orchestration
- Network issues between DIBOP and external systems
Most Active Orchestrations¶
A ranked list of orchestrations by execution count. This shows which workflows are doing the most work and may need the most attention.
Most Failing Orchestrations¶
A ranked list of orchestrations by failure count. Focus your investigation here to improve overall reliability.
Capacity Metrics¶
API Call Volume vs Quota¶
If your enterprise has an API call quota, this metric shows:
- Current usage (calls made in the billing period)
- Quota limit
- Projected usage (based on current trends)
- Days remaining in the billing period
Approaching Quota
If projected usage exceeds your quota, consider optimising orchestrations to reduce unnecessary API calls, or request a quota increase from your platform administrator.
Storage Usage¶
How much data your enterprise is storing across:
- Execution logs
- API call logs
- Canonical data records
Custom Dashboards¶
Platform Metrics provides a default dashboard, but you can customise it:
Pinning Metrics¶
Click the pin icon on any metric card to add it to your personalised dashboard. Pinned metrics appear at the top of the page.
Metric Thresholds¶
Set visual thresholds on any metric to highlight when values cross a boundary:
- Click the settings icon on a metric card
- Set a warning threshold (amber highlight)
- Set a critical threshold (red highlight)
- Click Save
These thresholds are visual only -- they do not trigger alerts. To create automated notifications, use Alert Rules.
Time Range and Refresh¶
Time Range¶
Use the time range selector to adjust the period displayed on all charts:
- Last 24 hours
- Last 7 days
- Last 30 days
- Last 90 days
- Custom range
Auto-Refresh¶
Enable auto-refresh to update the dashboard at regular intervals:
- Every 30 seconds
- Every minute
- Every 5 minutes
- Off (manual refresh only)
Exporting Metrics¶
Export metric data for external reporting:
- Click Export in the toolbar
- Select the metrics and time range to include
- Choose the format (CSV or JSON)
- Click Download
Next Steps¶
- Observability Dashboard -- SLA-focused monitoring
- API Call Log -- drill into individual API calls
- Setting Up Alerts -- automate notifications for metric thresholds
- Quotas & Rate Limits -- understand your usage limits