Question 1

What is vulnerability state machine throughput decomposition?

Accepted Answer

State machine throughput decomposition is the discipline of reading remediation throughput per state rather than as one aggregate rate across the lifecycle. The aggregate throughput question is whether the programme is closing findings as fast as it is opening them. The decomposition question is which state in the lifecycle is producing the throughput, which state is acting as the bottleneck, which state holds the largest in-progress queue, which state has the highest abandonment or re-open rate, and which state would change the headline number most if its arrival or exit rate moved. Programmes that report only the aggregate throughput cannot diagnose which transition to invest in; programmes that report per-state arrival, exit, and queue depth turn the aggregate number into an operational diagnostic that names the next intervention.

Question 2

What is the standard vulnerability state machine for an enterprise programme?

Accepted Answer

Every defensible vulnerability programme runs a finite state machine with a small set of named states and a documented transition policy between them. The defensible breakdown contains seven primary states: new (a finding has appeared and has not been triaged), triaged (severity, owner, and validity have been confirmed), assigned (a remediation owner accepts the finding), in remediation (work is in progress against the finding), in retest (the deployed fix is being validated), closed remediated (the finding is verified closed by retest), and closed exception (the finding has been accepted as risk, deferred, or marked false positive after the recorded decision chain). Several programmes add intermediate states for in investigation between triaged and assigned, in change window between in remediation and in retest, or in deferred between assigned and closed exception. The exact state names matter less than the discipline that every finding occupies exactly one state at any moment and every transition is recorded against the live record with timestamp and named user.

Question 3

How does Little's Law apply to vulnerability remediation?

Accepted Answer

Little's Law (L equals lambda times W) connects the average queue depth L to the average arrival rate lambda and the average waiting time W in steady state. For vulnerability remediation, lambda is the rate at which findings enter a state, W is the average time a finding spends in that state, and L is the average queue depth in that state. The implication for remediation programmes is that the per-state queue depth observed in the live record (L) is the product of the rate at which findings flow into that state (lambda) and the average dwell time inside the state (W); a programme that wants to shrink the queue at a given state has to either slow the arrival rate, speed the exit rate, or both. Reporting only the aggregate backlog and the aggregate closure rate hides which state is the binding constraint. Reporting per-state L, lambda, and W identifies the binding constraint directly and turns the budget conversation into a queueing-theory conversation rather than a slogan conversation.

Question 4

What does per-state arrival rate look like in practice?

Accepted Answer

Per-state arrival rate is the count of findings entering each state per unit time, normally measured per week or per month against the same severity bands the SLA targets are written against. New-state arrivals come from scanners, pentests, bug bounty, disclosure, third-party intake, and recurring detection cycles. Triaged-state arrivals come from new-state findings whose triage cycle completed. Assigned-state arrivals come from triaged findings whose ownership routing completed. In-remediation arrivals come from assigned findings the remediation owner picked up. In-retest arrivals come from in-remediation findings whose fix was deployed. Closed-state arrivals come from in-retest findings whose retest passed or closure-exception findings whose risk acceptance decision was approved. Programmes that report only the new-state arrival rate read scanner output, not throughput; the discipline is to report arrival rates at every transition because the bottleneck signature shifts between states across the programme lifecycle.

Question 5

What does per-state exit rate look like in practice?

Accepted Answer

Per-state exit rate is the count of findings leaving each state per unit time. For non-terminal states the exit rate is the transition rate into the next state. For terminal states (closed remediated, closed exception, closed false positive) the exit rate is the closure rate into the audit-evidence ledger. The defensible reporting pattern is the arrival rate and the exit rate side by side at the same severity band so the per-state delta makes the queue dynamic visible: a positive delta grows the queue, a negative delta shrinks it, a zero delta holds it in steady state. Programmes that report only the exit rate (the closure number) hide the queue dynamic by treating findings that left the state as if they had been resolved when many of them merely moved to the next state.

Question 6

What are the standard per-state failure modes?

Accepted Answer

Seven failure modes recur across enterprise programmes. New-state pileup happens when scanner output enters the queue faster than triage capacity reads it, so the new-state queue grows until severity calibration breaks down. Triaged-state drift happens when severity is set at triage but ownership routing stalls, so triaged findings sit unassigned. Assignment-loop happens when ownership routing rejects findings back to triage because the affected asset cannot be identified or the remediation owner does not own the asset class. In-remediation-stall happens when the fix is blocked on a change window, a dependency upgrade, or a compensating-control negotiation, and the dwell time exceeds the SLA on the in-remediation state alone. In-retest-bottleneck happens when retest capacity sits behind unrelated engagement work, so fixes are deployed but verification is delayed and the SLA on the in-retest state alone fails. Exception-drift happens when findings move into closed-exception without the documented decision chain, inflating closure throughput at the cost of audit-defensible governance. Reopened-loop happens when closed-remediated findings reopen on retest or rediscovery, so closures from prior periods reappear as arrivals into the new-state queue.

Question 7

How should per-state SLA targets be set?

Accepted Answer

Per-state SLA targets are tighter than per-finding SLA targets because each state owns only a slice of the end-to-end window. CISA Binding Operational Directive 22-01 sets a 14-day end-to-end window for known exploited vulnerabilities. Reading that window across the seven-state machine implies upper bounds of roughly two working days for new-state triage, two working days for ownership assignment, six working days for in-remediation work, two working days for in-retest verification, and the remainder for administrative closure. PCI DSS Requirement 6.3.3 sets a 30-day window for high-risk vulnerabilities, which permits more generous per-state slices. ISO 27001 Annex A 8.8 expects programme-defined cadence; per-state SLA targets named in the programme policy are the audit-defensible artefact. The discipline is to publish per-state SLA targets at the same severity bands the end-to-end SLA is published against so the programme can read which state actually violates the end-to-end window when a finding ages out.

Question 8

What does the per-state cycle-time tail look like?

Accepted Answer

Per-state cycle-time distributions are heavy-tailed in every state that involves human decision-making, scheduled change windows, or third-party dependencies. The median cycle time inside a state is usually a fraction of the 90th-percentile cycle time, and the 90th-percentile cycle time is usually a fraction of the 99th-percentile cycle time. Programmes that report median per-state cycle time only describe the experience of the median finding; programmes that report p50, p90, and p99 per state describe the tail risk and the SLA risk simultaneously. The defensible reporting pattern names the tail because the tail drives audit findings and customer escalations more often than the median, and the tail is where exception conversion and abandonment cluster. The two failure patterns to watch are state-level cycle-time tail growth (p99 moving away from p50 inside one state) and state-level cycle-time tail mass transfer (p99 finishing one state but stalling in the next), both of which look fine on aggregate median MTTR while the operational reality has changed.

Question 9

How does the closed-exception path interact with throughput decomposition?

Accepted Answer

Closed-exception is a terminal state with the same audit visibility as closed-remediated but with a fundamentally different risk meaning. Programmes that treat closed-exception arrivals as equivalent to closed-remediated arrivals in throughput reporting inflate the closure rate by the volume of risk acceptance decisions rather than the volume of fixes deployed. The defensible reporting pattern names closed-remediated and closed-exception as separate terminal arrivals at every severity band; the exception arrival count is the input into the risk acceptance register that the audit committee reads alongside the closure narrative. The mechanism is the recorded decision chain (override class, rationale, scope, approver, expiry, evidence pointer) that the exception lifecycle and expiry discipline runs on the override register. Throughput decomposition becomes audit-defensible when the closed-exception arrival rate is reported next to the closed-remediated arrival rate rather than collapsed into a single closure number.

Question 10

How does the reopen path interact with throughput decomposition?

Accepted Answer

Reopened findings are arrivals back into the new-state queue or the assigned-state queue from a terminal state (closed-remediated or closed-exception). They arrive because retest discovered the fix did not close the finding, recurring detection saw the finding regenerate, the underlying asset was restored from backup, the deployed compensating control failed, or the original triage classification was overturned during post-mortem. The defensible reporting pattern treats reopen arrivals as a distinct arrival class so the inflow rate at the new-state queue can be decomposed into fresh detection inflow versus reopen inflow. Programmes that combine reopen arrivals with fresh detection arrivals lose the diagnostic signal that says closures from prior periods are unstable; programmes that report reopen rate per terminal state read whether the closure discipline is durable and which terminal state has the durability problem.

Question 11

What are the audit reads that survive scrutiny?

Accepted Answer

Eight audit reads survive scrutiny when the throughput decomposition is grounded in the live state record rather than reconstructed at audit week. The first is the per-state queue depth on the assessment date with the matching per-state queue depth twelve cycles earlier. The second is the per-state arrival rate trend across the observation window per severity band. The third is the per-state exit rate trend across the observation window per severity band. The fourth is the per-state cycle-time distribution (p50, p90, p99) for the observation window per severity band. The fifth is the per-state SLA breach count for the observation window. The sixth is the closed-exception arrival rate per severity band with the matching exception register currency and renewal-cadence indicator. The seventh is the reopen rate per terminal state. The eighth is the per-transition rejection rate (the count of findings that bounced back from one state to a prior state). The eight reads are the throughput-decomposition equivalent of the management review pack that NIST CSF 2.0 GV.OV, ISO 27001 Clause 9.3, and SOC 2 CC4.1 expect.

Question 12

Where does per-state ownership sit in the data model?

Accepted Answer

Per-state ownership lives on the finding record as an attribute distinct from the asset owner and the engagement owner. New-state ownership sits with the triage queue owner. Triaged-state ownership sits with the routing queue owner. Assigned-state ownership sits with the named remediation owner. In-remediation ownership stays with the remediation owner until the deployment ledger records the fix. In-retest ownership transitions to the retest queue owner. Closed-state ownership transitions back to the engagement owner or the GRC partner for evidence archival. Programmes that route everything to a single team queue collapse the ownership signal across states and lose the diagnostic that says which queue is the binding constraint at this moment. The defensible discipline names a per-state owner role and a default holder per role so the routing rule can fire deterministically on each transition.

Question 13

How does scanner inflow shape the new-state queue?

Accepted Answer

Scanner inflow is the dominant arrival source for the new-state queue in mature programmes that have automated discovery. The inflow shape depends on the scan cadence (daily, weekly, biweekly, monthly), the per-cycle scan coverage rate, the per-scanner finding signature (a noisy scanner doubles new-state arrivals without doubling real risk), and the deduplication policy at intake (per-target collapse, cross-target merge, severity normalisation). Programmes that operate continuous monitoring cadences with deterministic identity and intake-time dedup see a stable new-state arrival curve that tracks real risk; programmes that run scanners ad hoc with weak identity see arrival spikes that swamp triage capacity and produce the new-state pileup failure mode. The new-state arrival rate is the leading indicator for triage capacity planning, and the triage capacity is the lever that determines whether new-state queue depth stays bounded or grows without bound.

Question 14

How do change windows distort per-state cycle time?

Accepted Answer

Change windows distort per-state cycle time inside the in-remediation state and inside the in-retest state. A weekly production change window with a freeze on the rest of the week produces in-remediation cycle-time distributions clustered at one-week increments, regardless of the underlying remediation work. A two-week change window on a strict CAB approval cadence produces two-week clusters. A monthly maintenance window produces monthly clusters. Programmes that report per-state cycle time without naming the change-window cadence read random variation as a remediation team problem when the underlying cause is the change-window cadence the platform team operates. The defensible diagnostic is to overlay the per-state cycle-time histogram against the change-window calendar so the distortion is visible and the budget conversation about change-window relaxation has the right evidence.

Question 15

How does SecPortal help with state machine throughput decomposition?

Accepted Answer

SecPortal pairs every finding to a versioned engagement record with a status group field (active, fixed, accepted_risk, false_positive) and a severity field (info, low, medium, high, critical) so the per-state queue depth and per-state arrival and exit rates are queries against the live record rather than reconstructions from spreadsheets. The activity log captures every state change as a named-actor entry with timestamp covering the lifecycle for finding, engagement, scan, document, comment, invoice, and team entities; retention windows of 30, 90, or 365 days set by workspace plan support per-state cycle-time analysis across an observation cycle. Findings management captures CVSS 3.1 vector, severity band, owner, evidence, and remediation status. The finding-overrides discipline records the eight-field decision chain for closed-exception transitions (override class, manual severity, exception rationale, approver, scope, expiry, evidence pointer, version stamp) so the closed-exception arrival rate is queryable next to the closed-remediated arrival rate. Retesting workflows pair retest records to original findings so the in-retest queue and the reopen rate per terminal state are observable. Bulk finding import accepts .nessus, .xml, and .csv intake so third-party scanner output joins the same state machine on the same record. Continuous monitoring schedules run daily, weekly, biweekly, and monthly cadences so the new-state arrival curve is queryable across cycles. Compliance tracking maps findings to ISO 27001, SOC 2, PCI DSS, and NIST frameworks with CSV export so the throughput decomposition reads carry framework citations. SecPortal does not push state-change events into Jira, ServiceNow, Slack, SIEM, SOAR, or GRC platforms; the discipline is a query against the live engagement record rather than an automation between separate consoles. SecPortal does not provide enterprise SSO, SCIM, or SAML, does not provide automated risk-acceptance approval routing, and does not assign per-state SLA targets for the programme.

State	Default owner	Exit transition
New	Triage queue owner	Severity, asset binding, validity, category confirmed; move to triaged.
Triaged	Routing queue owner	Named remediation owner accepts; move to assigned.
Assigned	Named remediation owner	Work begins; move to in-remediation.
In remediation	Named remediation owner	Fix deployed and ready for verification; move to in-retest.
In retest	Retest queue owner	Retest passes; move to closed-remediated. Retest fails; move back to assigned with reopen reason.
Closed remediated	Engagement owner / GRC partner	Terminal. Reopen on rediscovery returns finding to new state with reopen reason.
Closed exception	Override approver / GRC partner	Terminal. Reopen on override expiry or revocation returns finding to triaged state.

State	CISA BOD 22-01 14-day envelope	PCI DSS 6.3.3 30-day envelope
New triaged	2 working days	3 working days
Triaged assigned	2 working days	3 working days
Assigned in-remediation	2 working days	4 working days
In-remediation in-retest	6 working days	15 working days
In-retest closed-remediated	2 working days	5 working days

Read	What it answers
1. Per-state queue depth at period end versus twelve cycles earlier	Where the backlog actually lives and how it has moved across the year.
2. Per-state arrival rate trend	Whether discovery, triage, assignment, remediation, and retest are growing or shrinking as load sources.
3. Per-state exit rate trend	Whether the capacity at each transition is keeping up with the corresponding arrival rate.
4. Per-state cycle-time distribution (p50, p90, p99)	Where the heavy tail sits and whether the tail is growing or transferring downstream.
5. Per-state SLA breach count	Which state actually violates the end-to-end SLA when a finding ages out.
6. Closed-exception arrival rate with exception register currency	Whether closure is happening through remediation or through risk acceptance.
7. Reopen rate per terminal state	Whether closures from prior periods are durable; which terminal state has the durability problem.
8. Per-transition rejection rate	How often findings bounce back from one state to a prior state; which arc has the rejection-loop signature.

Vulnerability State Machine Throughput Decomposition

The state machine is the unit of analysis

The seven primary states and the transitions between them

Per-state arrival, exit, and queue-depth

Reading the three numbers together

Per-state failure modes

1. New-state pileup

2. Triaged-state drift

3. Assignment-loop

4. In-remediation stall

5. In-retest bottleneck

6. Exception drift

7. Reopened-loop

Per-state SLA targets and the end-to-end window

Per-state cycle-time distributions are heavy-tailed

Two tail patterns to watch

Closed-exception and reopen interact with the headline number

Eight audit reads that survive scrutiny

Four maturity stages on the way to decomposed throughput

Stage 1: Aggregate throughput only

Stage 2: Per-state queue depth

Stage 3: Per-state arrival and exit rates

Stage 4: Per-state cycle-time tails and audit-defensible reporting

For security leadership, AppSec, GRC, and audit committees

How SecPortal supports state machine throughput decomposition

Conclusion

FAQ

Sources

Run the decomposition against your live record