Production SRE workflow smoke 1782816060
Incident ID: org_devdocs:prod-smoke-1782816060
pageresolvedbetter_stack
Updated 6/30/2026, 10:41:03 AMProduction SRE workflow smoke 1782816060
Live smoke test for approval/remediation/repo fleet on production.
Org
org_devdocs
Service
sre.devdocs.ai
Repository
devdocsorg/devdocsai-sre
Environment
production
Approval
approved
Duplicates
1
Evidence
Accepted better_stack signal for sre.devdocs.ai.
better_stack
{
"orgId": "org_devdocs",
"repository": "devdocsorg/devdocsai-sre",
"service": "sre.devdocs.ai",
"environment": "production",
"severity": "page",
"title": "Production SRE workflow smoke 1782816060",
"summary": "Live smoke test for approval/remediation/repo fleet on production.",
"fingerprint": "prod-smoke-1782816060",
"dedupKey": "better_stack:prod-smoke-1782816060",
"payload": {
"source": "codex-production-smoke",
"commit": "1895088"
}
}Org org_devdocs / repo devdocsorg/devdocsai-sre / env production.
sre-control-planedevdocsorg/devdocsai-sre
Likely sre.devdocs.ai degradation surfaced through better_stack.
triage-agentsearch_service_tools
Set DEVDOCS_MCP_API_KEY or SRE_MCP_API_KEY in Vercel to enable direct production MCP tool calls.
devdocs-mcpmcp.devdocs.ai/mcp
{
"requiredProviders": [
"github",
"vercel_token_auth",
"better_stack",
"fly_io",
"neon_api_keys",
"cloudflare_api_key",
"slack_v2",
"statebacked",
"sentry"
],
"configuredEnv": "missing"
}Investigate sre.devdocs.ai across Better Stack, Vercel deployments/events, GitHub history, Cloudflare DNS/audit, Fly, Neon, StateBacked, Sentry, and Slack context before remediation.
investigator-agentsearch_service_tools + provider-specific read toolsdevdocsorg/devdocsai-sre
Trace
Incident resolved after production verification.
resolved
Production Smoke marked production verification passed.
postmortem
{
"details": "Health endpoint and action APIs passed on sre.devdocs.ai."
}Remediation lane is ready for production verification gates.
verifying
Production Smoke started the origin/main-only remediation lane.
remediating
{
"releasePolicy": "push-origin-main -> Vercel build -> production verification on sre.devdocs.ai"
}Production Smoke approved remediation.
awaiting_approval
{
"decision": {
"id": "3417cdec-2472-4dde-bf73-bdf9bcfe4771",
"actor": "Production Smoke",
"decision": "approved",
"reason": null,
"at": "2026-06-30T10:41:03.272Z"
}
}Duplicate signal received from better_stack.
investigating
{
"duplicateCount": 1
}Seeded initial evidence, runtime MCP readiness, and hypotheses for the operator console.
investigating
Incident created from better_stack signal.
detected
Approval Ledger
6/30/2026, 10:41:03 AM
Hypotheses
Current walking skeleton uses source/severity heuristics until live MCP evidence is attached.
Remediation Plan
devdocs-mcp · search_service_tools
Risk: low
Preconditions: MCP API key available · provider account connected
Rollback: No-op; read-only.
github/vercel · github + vercel_token_auth
Risk: medium
Preconditions: Root cause confirmed · human approval captured · fix branch tested locally
Rollback: Revert on origin/main if production verification fails.
Execution plan recorded. Code changes must be pushed to origin/main, built by Vercel, and verified on sre.devdocs.ai before resolving.
Verification
Health endpoint and action APIs passed on sre.devdocs.ai.
Health endpoint and action APIs passed on sre.devdocs.ai.
RCA Draft
# RCA for Production SRE workflow smoke 1782816060 - Severity: page - Source: better_stack - Initial summary: Live smoke test for approval/remediation/repo fleet on production. - Current state: live evidence collection pending in standalone SRE console. ## Verification - Verified by: Production Smoke - Verified at: 2026-06-30T10:41:03.984Z - Result: production checks passed on sre.devdocs.ai - Details: Health endpoint and action APIs passed on sre.devdocs.ai.