OpsAI Prompt Library

OpsAI works through natural conversation, so the quality of your answer depends a lot on how you ask. This page is a copy-and-adapt library of prompts, organized by the surface you are working in.

OpsAI exposes three prompting surfaces, each backed by its own agents:

Surface	What it does	Where you use it
Ask OpsAI	Full conversation and investigation - query telemetry, run root-cause analysis across logs, metrics, traces, RUM, and Kubernetes, read code, and open fixes. Backed by the conversation and investigation teams.	The OpsAI chat panel
Dashboard widget builder	Builds and edits widgets on the dashboard you are currently in. State-aware: it already knows which dashboard you are on and discovers the right metrics for you.	Inside a dashboard, in the AI widget builder
Alert builder	Turns a plain-language request into a ready-to-apply alert rule config. State-aware: it can refine the alert draft you are currently editing.	The AI alert creation panel

Pick the surface that matches your task, copy a prompt, and replace the placeholders (service name, app name, dashboard, time range) with your own values.

You can talk to OpsAI directly from the OpsAI panel - no setup is required to start a conversation. To act on errors, code, or clusters, connect the relevant integration first (GitHub/Bitbucket for code, the Kube Agent for Kubernetes). See OpsAI Overview to get set up.

How to write effective prompts#

A few habits make every prompt sharper, regardless of the surface:

Name the resource. Mention the service, application, host, cluster, namespace, dashboard, or RUM app you care about. "the checkout service" beats "my app".
Scope the time window. OpsAI defaults to a recent window. Say "in the last 1 hour", "over the past 24 hours", or "between 2pm and 3pm today" to control it.
State the goal, not just the data. "Find why latency spiked and what changed" gets you an investigation; "show p99 latency" gets you a number.
Add filters. Environment (prod, staging), status code, error type, region, or tag values narrow the results.
Follow up. OpsAI keeps context across the conversation. Start broad, then drill in: "now group that by endpoint", "show only 5xx", "what changed right before this?".
Ask it to act. End with "find the root cause and propose a fix", "add these as widgets", or "create the alert" to move from analysis to action.

Ask OpsAI

The chat panel is the general-purpose surface. It routes your request to the right agents - the conversation team for queries, the investigation team for root-cause analysis, and the fix flow for code changes. Use the sections below by module.

Logs#

OpsAI can search, filter, and summarize log data, and pull error logs into an investigation.

1Show me error logs for the payment-service in the last 1 hour.
2Search logs across all services for "connection refused" in the last 30 minutes.
3Summarize the most common error messages in the orders-service logs today.
4Show logs for the checkout service filtered to severity ERROR and group them by message.
5Find log spikes in the last 6 hours and tell me which service they came from.
6Pull the logs around the time this error occurred and explain what went wrong.

Metrics#

OpsAI can discover which metrics exist for a resource, then query and aggregate them over a time range.

1What metrics are available for the api-gateway service?
2Show CPU usage for the checkout service over the past 24 hours.
3Query memory usage by host for the last 3 hours and show the top 5.
4What is the p99 latency for the orders-service right now, and how does it compare to yesterday?
5Show the request rate and error rate for payment-service side by side for the last hour.
6List the resource types I can query in this environment.

Services and traces (APM)#

OpsAI understands your APM traces and spans, and can use them to explore performance and errors in distributed applications.

1Which services have the highest error rate in the last hour?
2Show me the slowest endpoints in the checkout service over the past 24 hours.
3Trace a slow request through the orders-service and tell me where the time is spent.
4Find traces with errors in the payment-service and group them by exception type.
5Why did latency spike for the api-gateway around 3pm? Correlate traces, logs, and any deploys.
6Which downstream service is causing failures in the checkout flow?

Errors and incidents#

OpsAI surfaces grouped errors and incidents, and can open any one of them for a full root-cause investigation.

1List the top errors across my services in the last 24 hours.
2Show me the most frequent errors in the payment-service and which release introduced them.
3Get the full details for this error: <fingerprint or issue URL>.
4Investigate this error, find the root cause, and propose a fix.
5Which new errors appeared after the last deployment of the orders-service?
6Summarize this incident and tell me the likely cause and impact.

When you ask OpsAI to "propose a fix" for an APM or RUM error and your repository is connected, it gathers the relevant code, pinpoints the file and line, and can open a pull request with the change for review. See GitHub below.

Real User Monitoring (RUM)#

OpsAI can analyze frontend sessions, browser errors, and user journeys, and correlate them with backend traces.

1List my RUM applications.
2Show the RUM errors for my web app in the last 24 hours.
3Which pages have the most JavaScript errors this week?
4Find sessions where users hit an error during checkout and replay what happened.
5Correlate this browser error with the backend trace and logs, then suggest a code fix.
6What is the slowest page load for my frontend app, and what is causing it?

Kubernetes#

With the Middleware Kube Agent connected, OpsAI can read live cluster state, diagnose workloads, and - when you grant write access - remediate issues.

1List my configured Kubernetes clusters.
2Show the status of all pods in the monitoring namespace.
3Why is the payment-service deployment not ready? Diagnose it.
4Describe the failing pod and show me its recent events and logs.
5Check resource usage across nodes and tell me which ones are under pressure.
6This deployment keeps crash-looping - find the root cause and fix it.
7Scale the orders-service deployment to 4 replicas.

Kubernetes remediation only happens when you explicitly grant access to write tools. OpsAI inspects the live cluster state first, applies the change, and then verifies it. For read-only use, it simply diagnoses and recommends.

GitHub (code and pull requests)#

When your repository is connected, OpsAI can read code, search the repo, manage issues and pull requests, and open a PR with a proposed fix.

1Find the file that handles payment retries in my repo and explain what it does.
2Search the repository for where we call the Stripe API.
3Show me the changes in pull request #482.
4Based on this error, read the relevant code and open a pull request with a fix.
5Open a GitHub issue describing this incident and assign it to me.
6Who last changed this file, and what did the change do?

OpsAI reads only the files related to the error through the MCP integration and does not store your source code or error context. Connect GitHub or Bitbucket to enable code reads and PR creation.

Account, users, and usage#

OpsAI can answer questions about your Middleware account - projects, teams, members, and usage.

1Who are the members of my organization?
2List the projects in my account.
3What teams do I have set up?
4Show my current Middleware usage and how close I am to my limits.
5Show me my account details.

Middleware product help#

OpsAI is grounded in Middleware's documentation, so you can ask it how to use the product itself.

1How do I configure RUM in Middleware?
2What's the difference between APM and RUM in Middleware?
3How do I install the Kube Agent with OpsAI enabled?
4How do I connect my GitHub repository to OpsAI?
5How do I set up auto-investigation for my services?

End-to-end investigations#

These prompts combine modules into a single request. OpsAI plans the steps, pulls data from each source, and returns a root cause with a recommended or applied fix.

1The checkout flow is failing in production. Investigate across traces, logs, and RUM, find the root cause, and propose a fix.
2Latency on the api-gateway doubled in the last hour. Tell me what changed - deploys, config, or a downstream service - and how to fix it.
3Users are reporting errors on the payment page. Correlate the browser errors with the backend service, identify the root cause, and open a pull request with the fix.
4This Kubernetes alert just fired. Diagnose the cluster, explain the impact, and remediate it.
5Find the noisiest error across all my services this week and walk me through fixing it end to end.

For a hands-off workflow, enable Auto Investigation per source (clusters for Kubernetes, services for APM, apps for RUM) in OpsAI Settings. OpsAI then analyzes new problems automatically and proposes a fix you can review and apply.

The widget builder works on the dashboard you are currently in. It already knows which dashboard that is, so you do not need to name it. It only builds and edits widgets - create, update, delete, and arrange them - and it discovers the correct resource, metric, filters, and group-by for you before building.

Good to know:

It operates on the current, existing dashboard only. It does not create or delete dashboards, or switch you to another one.
If you do not say how many widgets you want, it creates up to 10 in a single request. Ask for more explicitly if you need them.
Updates are strict - it changes only the field you mention and leaves every other widget setting as-is.
After creating, updating, or deleting widgets, it tidies the layout automatically unless you tell it not to.

Create widgets#

1Add a widget showing CPU usage by host.
2Create a line chart of request rate for the checkout service over the last hour.
3Add three widgets for the orders-service: p99 latency, error rate, and throughput.
4Show memory usage grouped by Kubernetes pod as a time series.
5Add a widget for 5xx error count broken down by service.
6Create a widget tracking average response time for the api-gateway, filtered to the prod environment.

Update widgets#

1Change the latency widget to show p95 instead of p99.
2Update the CPU widget to group by service instead of host.
3Rename the "Errors" widget to "5xx Errors" and keep everything else the same.
4Switch the request-rate widget from a line chart to a bar chart.
5Add a prod-environment filter to the memory usage widget.

Delete and arrange widgets#

1Delete the unused throughput widget.
2Remove the duplicate CPU widget.
3Reorganize the widgets into a clean two-column layout.
4Put the latency and error-rate widgets side by side at the top.
5Make the request-rate widget full width.

The widget builder discovers real metrics, filters, and group-by dimensions from your data before building - it never guesses. If something you ask for is not available in the current dashboard's data, it will tell you what is missing instead of inventing it.

Alert builder

The alert builder turns a plain-language description into a ready-to-apply alert rule configuration. It maps your request to the right alert type, discovers the real metrics and attributes, picks sensible thresholds (including from your live data when you ask), and hands the config to the UI for you to apply. It can also refine the alert draft you are currently editing.