TL;DR
Most mortgage LOS RFPs are too long, too generic, and don't separate marketing AI from production AI. The 12 questions below put each vendor on the spot to demo real work on your real data. Each comes with the right-answer pattern (what a production-ready vendor sounds like) and the red-flag tell (the response you should read as marketing). Use them in 60-minute working sessions with each finalist rather than a 200-page written response.
How should an operations leader run a tight mortgage LOS RFP?
Three working sessions per finalist. First session: the AI agent demos (questions 1, 3, 7, 10). Second session: integration, compliance, and audit trail (questions 2, 4, 8, 9). Third session: implementation, pricing, and operational fit (questions 5, 6, 11, 12). Cap each at 60 minutes with all the questions on a shared doc. Vendors that can't show real work in this format aren't ready for your shop.
The 12 questions
Question
Show me your AI document agent classifying 30 of our real loans on this call.
What a production-ready answer sounds like
Vendor opens a sandbox, you upload three loans' worth of documents, the agent classifies each in <5 seconds with confidence scores. Vendor commits to the same demo on a larger sample for the proof of concept.
Red flag
We have AI features but a live demo would require a separate engagement. Translation: it's marketing, not production.
Question
What happens to a TRID 3-day timer if your servers restart mid-clock?
What a production-ready answer sounds like
Timers run in a durable workflow engine (Temporal, Cadence, or equivalent). State is persisted at every transition. Server restarts, deploys, and worker crashes do not lose the clock.
Red flag
We have monitoring and alerts. Translation: the timer is a cron job and your CFPB exposure is real.
Question
Walk me through how your platform calculates Schedule C income with depreciation, business use of home, and 50% meals; show the math.
What a production-ready answer sounds like
Vendor opens a 1084-compliant calculator, demonstrates each add-back with the calculation, shows 2-year trending logic, and confirms the same inputs produce identical outputs every time. Math is deterministic, not LLM-generated.
Red flag
Our LLM handles that. Translation: income/employment defects are the #1 critical defect category per ACES; you do not want an LLM in the math path.
Question
How many Encompass fields do you sync, in which directions, and which E-Folder containers do you map?
What a production-ready answer sounds like
Specific number (e.g., 180+), confirmation of bidirectional sync, named E-Folder containers (U1–U15) mapped explicitly. Demo of a document classified by AI and routed to the correct container.
Red flag
We have an Encompass integration. Translation: it might be a one-way export.
Question
What is your average implementation timeline for a lender at our volume; and what's the longest one you've done?
What a production-ready answer sounds like
Specific range (e.g., '30–60 days for hybrid, 8–12 weeks for standalone at 5K loans/year'). Reference customer or scoped statement of work. Honest answer about edge cases that extend timelines.
Red flag
It depends. Translation: the vendor doesn't have a repeatable implementation pattern.
Question
How do you price for a lender with seasonal volume; say, 1,000 loans in Q1 and 3,000 loans in Q3?
What a production-ready answer sounds like
Usage-based pricing scales with funded volume. No per-seat licensing penalty for hiring temp staff in peak season. Annual commitment with overage at a known rate.
Red flag
Per-seat licensing with annual commitment. Translation: you'll pay for inactive users in Q1 and re-negotiate to add temp staff in Q3.
Question
Walk me through QM/ATR verification on a loan where the borrower has 7-year employment history but two recent 30-day employment gaps.
What a production-ready answer sounds like
All 8 ATR factors verified per loan with stored evidence. Specific handling of employment gaps (continuity of income vs. employment, residual income tests). Block-or-disclose workflow gating until factors are documented.
Red flag
Our compliance team would review. Translation: ATR is post-close QC, not workflow architecture.
Question
Where does your immutable audit log live, and can you show me an example entry?
What a production-ready answer sounds like
Append-only ledger in a database with row-level security. Every entry shows operator, timestamp, before/after state, and reasoning where applicable. Demo of an audit query against a live loan record.
Red flag
We log to standard application logs. Translation: regulator-grade audit trails will be reconstructed from logs and inferences. That's a finding waiting to happen.
Question
Can a third-party LLM or BI tool act on loan data through your platform; and how?
What a production-ready answer sounds like
Yes, via Model Context Protocol (MCP) or open API. Specific tools exposed (e.g., 32+ MCP tools across underwriting, document AI, compliance). Authentication and tenant isolation explicitly described.
Red flag
We have an SDK. Translation: each integration is a per-customer engineering project.
Question
Demo your voice AI agent. Call it from my phone right now.
What a production-ready answer sounds like
Vendor provides a number, you call it, the voice AI answers with mortgage-vocabulary fluency, can pull live loan-status data when given a sample loan number, and transfers cleanly to a human with full call context.
Red flag
Voice AI is on the roadmap. Translation: borrower self-service is not in scope this quarter.
Question
What does your post-close QC look like; sampled or 100%?
What a production-ready answer sounds like
100% pre-fund continuous QC: income reproducibility, AUS findings vs. final terms, TRID timing audit, HMDA completeness, ULDD validation against investor specs. Defects route to a review queue before close, not a discovery report after fund.
Red flag
We sample 10% post-fund. Translation: critical defect rate stays at the ACES 1.79% industry average and repurchase exposure compounds.
Question
Show me the contract change request you'd send for a new HMDA edit-check rule the FFIEC published last quarter.
What a production-ready answer sounds like
Configuration change in admin UI, deployed same-day or next-day. No code change required. Demo on a sandbox tenant.
Red flag
That would be a custom development engagement at $X/hour. Translation: every regulatory update is an IT capacity tax.