SigSentrySigSentry

Time windows

Strategies for picking the time window — the single highest-leverage choice for diagnosis quality

The time window tells SigSentry where to look. Pick well and you get a clean diagnosis; pick poorly and you get noise (or nothing).

The principle

Tighter is better. A 30-minute window with 200 relevant log lines gives a sharper diagnosis than a 24-hour window with 200,000.

Strategies by situation

You know...Pick
Roughly when the issue startedA window starting 5 minutes before that, ending 5 minutes after the latest symptom
The issue is ongoingLast 15 or 30 minutes
The issue happened sometime todayUse the dashboard's timeline view to find the spike, then pick a window around it
Nothing about timingLast 1 hour. Re-run with a tighter window once you have a better starting point

Sizing rules of thumb

WindowWhen to use
5–15 minAcute issues with a clear "happening right now" signal
15–30 minMost production incidents — long enough to capture a pattern, short enough to stay focused
30–60 minSlow-burn issues, intermittent failures, post-deploy regressions
1–6 hoursIssues that span multiple deploys or environment changes
24h+Pattern-matching against historical context. Avoid for a single incident — too noisy

When to widen the window

If the diagnosis comes back with logsScanned very low (under ~50) and the result feels generic, the window is probably too narrow.

Logs scannedAction
< 10Window almost certainly too narrow. Widen 2–4×.
10–50Borderline. Widen if the result is generic.
50–500Good range for most analyses.
500–5,000Plenty of evidence. Result should be sharp.
> 5,000Window probably too wide. Tighten and re-run for a sharper diagnosis.

When to narrow the window

If the diagnosis identifies multiple unrelated issues or jumps around between services, the window probably caught two separate incidents. Narrow it to the most recent symptom and re-run.

The result schema's timeline field can help — if the timeline shows gaps of unrelated events, those are clues about the window's size.

Working with intermittent issues

For issues that recur (every 30 minutes, every hour, etc.):

First analysis: capture one occurrence

Pick a window around one specific occurrence. Get a baseline diagnosis.

Second analysis: span multiple occurrences

Widen to capture 2–3 occurrences. Compare results — if root cause matches, the AI is finding signal across iterations. If it diverges, your "intermittent" issue might actually be multiple distinct issues.

Use Watchdog

Once you understand the pattern, set up a Watchdog rule to catch future occurrences automatically.

Time zones

Time windows are stored as ISO-8601 timestamps in UTC. The dashboard displays them in your browser's local timezone. The API expects UTC in timeStart and timeEnd; if you pass local-time strings without a Z suffix, behavior is undefined.

If your team is distributed, picking a window from the dashboard is safer than building one in code.