Lesson 17 of 25

Auditing Transaction-Monitoring Systems: Coverage and Tuning

4 min read · CAMS-Audit

Audit TM coverage, thresholds, and the crucial above- and below-the-line testing that reveals whether tuning misses real risk, plus the alert-handling patterns that signal investigators are mass-closing.

What the TM system must do

Detect potentially suspicious activity across products
Scenarios, thresholds, and alerts that feed investigations
Coverage must match the institution's real risks
Audit tests coverage, calibration, and handling

The transaction-monitoring system, or TM, is the engine that scans activity for potentially suspicious patterns and generates alerts for investigation. Auditing it is one of the most technical and most tested fieldwork areas. A TM system works through scenarios, the patterns it looks for, like rapid movement of funds or structuring, with thresholds that decide when behavior is alert-worthy.

Your audit tackles three layers: coverage, does the system watch for the right risks across all products and channels; calibration, are the scenarios and thresholds tuned correctly; and handling, are the alerts it produces investigated and dispositioned well. A weakness in any layer means real suspicious activity can slip through unseen.

Coverage and scenario completeness

Map scenarios to the institution's risk assessment
Every product, channel, and typology covered?
Are new products onboarded into monitoring?
Coverage gaps = blind spots by design

Begin with coverage, because the worst failure is a blind spot the system was never built to see. Map the active scenarios against the institution's risk assessment and ask whether every material risk has monitoring behind it. Is each product, channel, and known typology covered, wires, cash, ACH, trade finance, crypto on-ramps?

A frequent finding is a new product that launched but was never connected to monitoring, so months of activity flowed by unwatched. Coverage gaps are blind spots by design: no amount of tuning helps if the system simply isn't looking at a category of activity. Reconciling the scenario inventory to the product and risk inventory is how you find these holes.

Tuning: above and below the line

Thresholds split activity into alerts and non-alerts
Above-the-line — sample alerts: are they productive?
Below-the-line — sample non-alerts: did we miss real risk?
Below-the-line testing is the crucial, often-skipped half

Now tuning, and this is where the exam gets specific. A threshold draws a line: activity above it generates an alert, activity below it doesn't. Audit tests both sides.

Above-the-line testing samples the alerts the system did generate and asks whether they were productive and well-handled, or whether the system is drowning analysts in noise. Below-the-line testing is the crucial, often-skipped half: you deliberately sample activity that fell just below the threshold, the transactions that did not alert, and investigate whether any were actually suspicious. If below-the-line testing turns up genuine suspicious activity the system ignored, the threshold is set too high and is missing real risk.

Tuning without below-the-line testing is only half an audit.

Alert handling and disposition quality

Are alerts investigated thoroughly and on time?
Quality of disposition, not just the backlog number
Watch for mass-closing to clear queues
Escalation to SAR decisioning works?

Finally, audit how alerts are handled once generated. Look beyond the headline backlog number to the quality of the investigation: was each alert reviewed thoroughly, with the customer context considered, or rubber-stamped closed? A dangerous pattern is mass-closing, analysts clearing alerts in seconds to hit productivity targets, which you can spot by re-performing a sample and checking time-stamps and narrative depth.

Confirm that alerts which should escalate actually flow into the suspicious-activity-report decisioning process rather than dying in the queue. Productivity metrics that look great alongside thin, identical disposition narratives are a red flag the exam loves: efficiency bought by skipping the actual analysis.

Change management and data feeds into TM

Were scenario or threshold changes tested and approved?
Bad data starves even a well-built system
Reconcile transaction feeds into the monitoring engine
A change with no testing can silently disable detection

Two adjacent areas can quietly break even a well-designed monitoring system, so audit them. The first is change management. Every change to a scenario or a threshold should be tested and formally approved before it goes live, because a careless tuning change can silently disable detection, raising a threshold so high that an entire class of suspicious activity stops alerting.

Ask to see the change records: was each change risk-assessed, tested, and approved, or did someone adjust a parameter on a hunch? The second is the data feeding the system, which we'll cover in depth later but flag here because it's foundational. A monitoring engine can only analyze what it receives, so if a transaction feed drops records or arrives malformed, the system shows green while sitting blind.

Reconcile the source transaction data against what actually reached the monitoring engine and look for gaps. A perfectly tuned scenario starved of complete data is just as useless as no scenario at all, and that failure mode is invisible unless you test for it.

Recap and next

Test coverage, tuning, and alert handling
Reconcile scenarios to the risk and product inventory
Above- and below-the-line testing — don't skip below
Next — model risk management and validation (SR 11-7)

Recapping: auditing transaction monitoring means testing coverage, tuning, and alert handling. Reconcile scenarios to the risk and product inventory to find blind spots, run both above-the-line and the crucial below-the-line testing to check whether thresholds miss real risk, and judge alert handling on disposition quality, not just backlog size. And don't overlook change management and the data feeds, because an untested tuning change or a dropped feed can silently disable detection while the system still shows green.

The TM system is also a model, which brings us to our next, closely related topic: model risk management and independent validation under SR eleven dash seven. Test yourself on transaction-monitoring auditing first.

Sources

FFIEC BSA/AML Examination Manual — Transaction Monitoring and Suspicious Activity Monitoring/Reporting
FinCEN guidance on suspicious activity monitoring
interagency statements on model risk management (cross-ref SR 11-7)

Test your knowledge

A few CAMS-Audit questions on this material — pick an answer to see the explanation.

Q1. An auditor reconciles the core banking system's transaction count for the prior quarter to the records that entered the AML monitoring platform. Core banking shows 1,200,000 transactions; the monitoring platform received 950,000. What is the finding?
Q2. An auditor reconciles the institution's AML risk assessment against the product catalog and finds that the crypto-custody product launched eight months ago is not included in the risk assessment. What is the finding?
Q3. An AML monitoring scenario was last calibrated when the institution's correspondent-banking portfolio was primarily US-dollar flows. Since then the portfolio has shifted heavily toward high-risk-jurisdiction currencies. No re-calibration has occurred. The below-the-line test finds 15 genuinely suspicious items that did not alert. What is the primary finding?
Q4. The IIA expects internal audit functions to undergo an external quality assessment on what minimum cycle?

Ready to practice?

Put this lesson to work on real CAMS-Audit questions.

Drill the full CAMS-Audit bank →