Lesson 07 of 15
Transaction Monitoring at Scale and Model Risk
5 min read · AML·FT
See how the alert-to-SAR funnel works on high-volume, real-time rails — and where fintechs fail: loose thresholds, alert backlogs, and coverage gaps. Learn to govern your monitoring system as a model under SR 11-7 and tune it with above/below-the-line testing.
Monitoring is how suspicion gets found
- Transaction monitoring surfaces activity that may be suspicious
- It feeds the SAR obligation (31 CFR 1022.320 / 1020.320)
- FinTech challenge: huge volume, real-time, automated rails
Transaction monitoring is the machinery that turns a stream of payments into a short list of things a human should look at. It's how suspicion actually gets found, and it feeds the legal duty to file a suspicious activity report, the SAR rules at thirty-one C-F-R ten-twenty-two point three-twenty for MSBs and ten-twenty point three-twenty for banks. At a traditional bank, monitoring runs over modest, mostly batch volumes.
A fintech monitors enormous, real-time flows across automated rails, often with thin margins that can't absorb a big manual review team. That combination, high volume, low friction, and the pressure to keep transactions instant, is exactly what makes monitoring at scale hard, and what this lecture is about.
How monitoring works: scenarios and the funnel
- Rules/scenarios flag patterns: structuring, velocity, anomalies
- Flags become alerts; alerts get investigated
- Investigated alerts may escalate to a SAR decision
- The funnel: transactions → alerts → cases → SARs
Most monitoring starts with scenarios: rules that flag patterns associated with laundering, like structuring just under reporting thresholds, rapid movement of funds in and out, velocity spikes, payments inconsistent with the customer's profile, or links to high-risk geographies. Increasingly, fintechs add behavioral and machine-learning models on top. A flag becomes an alert; an alert gets investigated by an analyst who pulls the customer's history and context; and a meaningful investigation may escalate into a case and, ultimately, a decision about whether to file a SAR.
Picture it as a funnel: millions of transactions narrow to thousands of alerts, narrow to hundreds of cases, narrow to the filings that go to FinCEN. Every stage of that funnel is a place a fintech can quietly fail, by missing the pattern, by drowning in false positives, or by investigating badly.
Where fintechs get it wrong
- Thresholds tuned for conversion, not detection
- Alert backlogs nobody can clear — alerts age out
- Coverage gaps: a new product with no scenarios
- "Set and forget" rules that never get re-tuned
The failure modes are predictable. First, thresholds set too loose, sometimes tuned to avoid annoying customers rather than to catch crime, so genuinely suspicious activity sails through below the line. Second, the opposite problem at scale: too many false positives, generating an alert backlog that a thin team can never clear, so alerts age out uninvestigated, which is its own violation.
Third, coverage gaps, the fintech ships a new product, a crypto on-ramp, a new payout corridor, a lending feature, and no monitoring scenario ever gets written for it, so an entire money flow is invisible. And fourth, 'set and forget,' rules deployed at launch and never re-tuned as typologies and volumes change. Examiners look hard at all four: are your thresholds defensible, can you clear your alerts, does every product have coverage, and do you re-tune.
Monitoring systems are models — SR 11-7 applies
- A monitoring/scoring system is a 'model' under SR 11-7
- Three pillars: sound development, independent validation, ongoing use
- Validate before relying; re-validate periodically
- Document data inputs, assumptions, and limitations
Here's a point that catches data-driven fintechs by surprise: your monitoring system is a model, and regulators expect it to be governed like one. The Federal Reserve and OCC supervisory guidance on model risk management, known as S-R eleven dash seven, sets the standard. It rests on a few core ideas.
First, sound model development, with documented data inputs, assumptions, and known limitations. Second, independent validation, an objective review, by people who didn't build the model, that the model works as intended before you rely on it, and re-validation on a regular cycle. Third, ongoing monitoring of the model's performance in production.
The more a fintech leans on machine learning to score risk, the more this matters, because a black-box model that flags or clears customers with no validation and no explainability is a model-risk finding waiting to happen.
Tuning, testing, and explainability
- Above/below-the-line testing to set defensible thresholds
- Track productivity: alert-to-case-to-SAR conversion
- Re-tune as products, volumes, and typologies change
- Keep the logic explainable to an examiner
So how do you run monitoring well at scale? You tune thresholds with evidence, not vibes, using above-the-line and below-the-line testing: sample alerts just above your threshold to confirm they're worth investigating, and sample activity just below it to confirm you're not missing real suspicion. You track productivity metrics, what share of alerts become cases, what share of cases become SARs, to catch a system that's all noise or, worse, all silence.
You re-tune on a schedule and whenever the business changes. And throughout, you keep the logic explainable, because when an examiner asks 'why did this alert fire and that one not,' or 'why did the model clear this customer,' you need an answer better than 'the algorithm decided.' Defensible thresholds plus explainable models plus a clear-able queue is what good fintech monitoring looks like.
Recap and self-check
- Monitoring funnels transactions → alerts → cases → SARs
- Common gaps: loose thresholds, backlogs, missing coverage
- Your system is a model — govern it under SR 11-7
- Tune with above/below-the-line testing; keep it explainable
Let's recap. Transaction monitoring is the funnel that turns raw payment volume into investigated alerts and, ultimately, suspicious activity reports under the SAR rules. The fintech failure modes are loose thresholds, alert backlogs that age out, coverage gaps on new products, and rules that never get re-tuned.
And the principle that ties it together: your monitoring system is a model, so govern it under S-R eleven dash seven with sound development, independent validation, and ongoing monitoring, tuned with above-the-line and below-the-line testing. Self-check: can you clear your alert queue, does every money-moving product have a scenario, and has anyone independently validated your scoring model? Three honest 'yeses' is a strong program.
Next, we shift from laundering detection to sanctions: OFAC screening for fintechs.
Sources
- Bank Secrecy Act / 31 CFR Chapter X
- SAR rules (31 CFR 1022.320
- 1020.320)
- Federal Reserve / OCC SR 11-7, Supervisory Guidance on Model Risk Management
- FFIEC BSA/AML Examination Manual (suspicious activity monitoring)
Test your knowledge
A few AML·FT questions on this material — pick an answer to see the explanation.
Q1. A neobank uses a third-party identity verification vendor for all digital onboarding. Which statement best describes how AML responsibility is allocated between the neobank and the vendor?
Q2. What is the primary purpose of transaction monitoring in a fintech AML program?
Q3. A fintech's rule-based transaction monitoring system generates 500 alerts per day but the investigations team can only review 50. What is the most appropriate long-term remediation?
Q4. An MSB fintech's customer withdraws $7,000 in cash on Monday and $5,000 on Wednesday of the same week, using the same account. Which BSA requirement is most relevant?