Question 1

What is vulnerability remediation throughput?

Accepted Answer

Throughput is the rate at which vulnerability findings move from a verified open state to a verified closed state across an observation window. It is not the same as inflow (the rate findings appear from scanners, pentests, bug bounty, or disclosure), nor the same as backlog (the count of currently open findings). Throughput is the closure half of the equation. Programmes that track only inflow report a scanning success and a remediation failure as the same trend. Programmes that track only backlog miss the cadence at which work is actually being completed. The defensible metric is paired throughput against inflow at the same severity bands so the programme can read whether closure is keeping up with discovery, falling behind it, or merely converting one finding type to another.

Question 2

How is vulnerability remediation cycle time measured?

Accepted Answer

Cycle time measures the elapsed working time from finding open to finding verified closed for a single vulnerability, broken into named stages. The defensible breakdown has six stages: triage (open to triaged), assignment (triaged to owned), investigation (owned to fix designed), remediation (fix designed to fix deployed), verification (fix deployed to retest passed), and closure (retest passed to administrative close). Each stage has its own median, its own tail, and its own bottleneck signature. Programmes that report a single MTTR number lose the diagnostic value of the breakdown; the same MTTR can mean a slow triage queue with fast remediation, or a fast triage queue with slow verification. The breakdown matters more than the headline number.

Question 3

What is a defensible mean time to remediate (MTTR) figure?

Accepted Answer

MTTR varies by severity, asset class, scanner source, and patch availability, so a single figure across the whole programme is rarely defensible. Useful MTTR is reported per severity band against an SLA target the programme actually committed to. CISA Binding Operational Directive 22-01 sets a 14-day window for known exploited vulnerabilities; PCI DSS Requirement 6.3.3 sets a 30-day window for high-risk vulnerabilities; ISO 27001 Annex A 8.8 expects a programme-defined cadence justified by risk. Reporting MTTR against the relevant SLA per severity band gives the audit committee, the regulator, and the engineering team the same answer to the same question. Reporting MTTR as one number across all severity bands collapses three different operational pictures into one and is read as window-dressing.

Question 4

What slows vulnerability remediation throughput?

Accepted Answer

Five bottleneck classes account for most throughput loss in programmes that have already automated scanning. Triage latency happens when scanner output sits unread because severity calibration is unclear or duplicate suppression is missing. Ownership latency happens when findings are routed to a queue rather than a named role and sit unassigned. Investigation latency happens when remediation owners cannot reproduce the finding or cannot identify the affected asset version. Remediation latency happens when fixes are blocked on change windows, dependency upgrades, or compensating control negotiation. Verification latency happens when the retest is queued behind unrelated work. Different programmes have different signature bottlenecks; reporting cycle time per stage rather than per finding makes the bottleneck observable in the data rather than in tribal knowledge.

Question 5

How should remediation throughput be reported to leadership?

Accepted Answer

Leadership reporting works when the throughput question is framed against the inflow question and the SLA question at the same severity band. A defensible monthly report names: open count by severity, inflow by severity over the period, closure by severity over the period, in-SLA closure rate, out-of-SLA closure count, exception count, and median cycle time per severity band. The trend lines matter more than the point estimates because audit committees and CISOs read the direction of the programme rather than the snapshot. A programme whose closure exceeds inflow at every severity band is shrinking the backlog; a programme whose closure trails inflow is growing it; a programme that hits the SLA on closures but accumulates exceptions is moving the risk into the exception register rather than closing it.

Question 6

How do exceptions affect remediation throughput numbers?

Accepted Answer

Exceptions are administrative closes that move a finding from the open queue to the exception register without remediation. They preserve audit-window throughput numbers at the cost of pushing risk into the exception ledger. Programmes that count exception closure as throughput report a healthier remediation rate than the underlying technical state warrants. The fix is to track exception count as a separate metric alongside remediated closure, with the residual-risk profile of the exception register reported next to the remediation throughput. The four-axis report (open, remediated, exception-closed, expired-exception) gives the same information without the optical inflation.

Question 7

What does scanner output noise do to throughput numbers?

Accepted Answer

Scanner noise inflates the open queue with findings that do not represent real risk: duplicates across modules, informational findings auto-promoted to actionable severity, false positives that trigger triage cycles before being suppressed, and re-discoveries of findings that are already remediated but not yet retested. Inflated open queues make throughput look worse than it is and consume triage capacity that should be spent on real findings. The fix is to deduplicate at scanner-output stage rather than at remediation-owner stage, calibrate severity at intake rather than after triage, and tie suppression rules to the live engagement record so re-discoveries do not re-open closed findings. Throughput improves visibly once intake quality is enforced.

Question 8

How should throughput targets be set per severity band?

Accepted Answer

Throughput targets work when they are anchored to an external SLA reference rather than chosen from internal precedent. Critical and known-exploited findings track CISA BOD 22-01 (14 days) or sector-specific tighter windows. High-severity findings track PCI DSS Requirement 6.3.3 (30 days) or the equivalent in the relevant framework. Medium-severity findings track the programme-defined cadence justified by the risk assessment. Low-severity findings often track quarterly. Setting targets internally without an external anchor produces SLA windows the programme cannot defend; setting targets to match the external anchor lets the programme report performance against a window the audit committee already understands.

Question 9

How does retest cycle time interact with throughput?

Accepted Answer

Retest verifies that the deployed fix actually closed the finding rather than relying on a remediation-owner self-attestation. Retest cycle time has its own median and tail and is a frequent hidden bottleneck. Programmes that move fast through deployment but queue retests behind unrelated work report a faster MTTR than the underlying state warrants because the finding is closed administratively before verification confirms closure. The defensible discipline is to report retest cycle time as a separate stage with its own SLA, surface findings stuck in retest as a separate queue, and treat retest backlog as a leading indicator of remediation backlog rather than as administrative overhead.

Question 10

What metrics work better than MTTR alone?

Accepted Answer

Five paired metrics outperform MTTR-only reporting. SLA-bound closure rate (percentage of findings closed inside the SLA window per severity band) reads better than median MTTR because it captures the tail. Inflow-versus-closure ratio per severity band reads whether the backlog is growing or shrinking. Exception-to-remediation ratio reads whether the programme is closing risk or moving it. Re-open rate (percentage of findings closed and then re-opened on retest or rediscovery) reads whether closures are durable. Stage-cycle-time breakdown reads where the bottleneck actually sits. Programmes that adopt the five paired metrics replace the single-figure MTTR debate with a structured operational picture that survives audit scrutiny.

Question 11

How does throughput change as the backlog grows or shrinks?

Accepted Answer

Backlog and throughput interact non-linearly. A growing backlog tends to slow throughput because triage capacity is finite and inflow consumes it before remediation work can be scheduled. A shrinking backlog tends to speed throughput because triage capacity is released and remediation owners receive findings closer to the time they were discovered, when the affected code or asset is still fresh. The hardest period is the steady-state mid-backlog where neither dynamic dominates and headcount budgeting collides with closure-rate ambition. Programmes that publish their throughput-versus-backlog curve over twelve months get the budget conversation about remediation capacity onto the same evidence as the audit conversation about SLA performance.

Question 12

How does SecPortal help with vulnerability remediation throughput?

Accepted Answer

SecPortal pairs every finding to a versioned engagement record so the cycle-time stages (triage, assignment, investigation, remediation, verification, closure) are observable from the live record rather than reconstructed from spreadsheets. Findings management captures CVSS 3.1 vector, severity band, owner, evidence, and remediation status; the activity log captures the timestamped chain of state changes by user. Compliance tracking maps findings and controls to ISO 27001, SOC 2, Cyber Essentials, PCI DSS, and NIST frameworks with CSV export so the SLA-bound closure rate per framework is one query against the same record. AI report generation produces remediation roadmaps and compliance summaries from the same engagement data, so the leadership read of throughput and the operational read are the same record. The platform does not set the SLA targets for the programme. It does make the throughput question reproducible at any moment between audits rather than only at audit week.

Stage	Question it answers	Common bottleneck
1. Triage (open to triaged)	Is this finding real, severity-correct, and not a duplicate of an open or recently-closed finding?	Scanner noise, severity calibration disputes, missing duplicate suppression.
2. Assignment (triaged to owned)	Who owns the affected asset and is responsible for the remediation decision?	Findings routed to a queue rather than a named role; ownership ambiguity across team boundaries.
3. Investigation (owned to fix designed)	Can the owner reproduce the finding, identify the affected version, and design a fix?	Insufficient evidence on the finding, ambiguous affected scope, dependency upgrade research.
4. Remediation (fix designed to fix deployed)	Has the fix shipped through the change-management pipeline to the affected environment?	Change windows, dependency conflicts, compensating control negotiation, regression testing.
5. Verification (fix deployed to retest passed)	Has the deployed fix been retested independently and confirmed to close the finding?	Retest queue depth, scanner re-run scheduling, manual retest capacity.
6. Closure (retest passed to closed)	Has the closure been recorded with the verifying evidence on the live engagement record?	Administrative drag, evidence-capture friction, missing closure-record fields.

Severity	External anchor	Defensible window
Known exploited (KEV)	CISA BOD 22-01 (US federal civilian agencies; widely adopted as private-sector benchmark).	14 days from KEV catalog inclusion or local detection, whichever is earlier.
Critical (CVSS 9.0 to 10.0)	PCI DSS Requirement 6.3.3; many sector-specific frameworks; SSVC act-now classification.	15 to 30 days; tighter for internet-facing critical assets; longer windows are hard to justify.
High (CVSS 7.0 to 8.9)	PCI DSS Requirement 6.3.3 high-risk window; ISO 27001 Annex A 8.8 cadence justification.	30 days; risk-assessment can justify tighter for known-exploit or KEV cross-reference.
Medium (CVSS 4.0 to 6.9)	Programme-defined cadence justified by risk assessment; commonly aligned to release cycles.	60 to 90 days; cadence rather than countdown is the durable form.
Low (CVSS 0.1 to 3.9)	Programme-defined; commonly batched into the next major-version refresh.	Quarterly cadence or next major release; rolling backlog is acceptable if movement is documented.

Vulnerability Remediation Throughput: How Internal Security Teams Move Findings to Closed

Throughput, inflow, and backlog are three separate questions

The six stages of remediation cycle time

SLA targets per severity band

Five bottleneck classes that throttle throughput

1. Triage latency

2. Ownership latency

3. Investigation latency

4. Remediation latency

5. Verification latency

Why MTTR alone misleads

Severity blending

Tail concealment

Exception inflation

Re-open invisibility

The five paired metrics that survive scrutiny

1. SLA-bound closure rate per severity band

2. Inflow-versus-closure ratio per severity band

3. Exception-to-remediation ratio

4. Re-open rate

5. Stage-cycle-time breakdown

Backlog versus throughput dynamics

Growing backlog: triage capacity becomes the constraint

Shrinking backlog: remediation freshness improves

Steady-state mid-backlog: budgeting collides with ambition

How the engagement record carries throughput

For internal security and vulnerability management teams

For security leadership and audit committees

Conclusion

Frequently Asked Questions

Sources

Run remediation throughput on the live engagement record