How We Deployed a Secure Conversational AI Assistant in 8 Weeks

Financial services firms require strict standards of accuracy, reliability, and regulatory compliance in their digital experiences. A leading North American financial services firm came to WillowTree to create a next-gen chatbot experience in just eight weeks, and our successful delivery paved the way for our GenAI Jumpstart accelerator — a safe, secure, modular architecture we can adapt to any industry.

Key Takeaways

Conversational AI presents vast opportunities but also serious risks in regulated sectors.
A rigorous and innovative architectural framework of AI-enabled and human-in-the-loop processes, guidelines, and oversight is required for safe deployment.
WillowTree's expertise in ethical AI implementation unlocks cutting-edge functionality while minimizing and mitigating risks.

“This is amazing, in terms of how fast you have put this together. Kudos to you guys.”

Client Product Owner

results

A Superior, Safe Conversational Experience

Jump to next section

In just eight weeks, WillowTree's prototype blew older intent-mapped chatbot capabilities out of the water with its human-like conversational abilities and more expansive range of responses. Importantly, our financial services client gained confidence that generative AI could be deployed safely and responsibly within regulatory requirements.

Our eight-week effort formed the basis for our GenAI Jumpstart accelerator program, a crucial prototyping step in our client’s journey toward deploying a safe and compliant public-facing virtual AI assistant.

“Following a successful delivery with a major financial services institution in North America, we codified our GenAI Jumpstart offering to provide eight weeks of AI development around a particular use case that a client brings to us, viable across any industry.”

Charley Adams

VP, Business Development

GenAI Jumpstart

From whiteboard to working prototype — in just 8 weeks

Learn more

Our client, a leading innovator in financial services, wanted to leverage generative and conversational AI to create a sophisticated banking chatbot for their users. Balancing open-ended conversational abilities with strict security was paramount. They needed a solution that could:

Provide natural language responses to common customer questions
Direct users to relevant bank-specific information across their product lines
Maintain their brand voice and style
Stay within regulatory guardrails (e.g., not providing specific investment advice, etc.)

the vision

A Next-Gen Financial Chatbot

Jump to next section

Traditional bank AI chatbots (that tend to annoy users) rely on intent mapping: a process that offers a limited set of around 250 predefined options and responds with hardcoded messages mapped to those user “intents.”However, intent mapping is rigid and restricted. This limitation makes typical intent-based chatbots frustrating for users wanting to ask natural, open-ended finance questions that don't fit neatly into pre-scripted buckets. For instance, when customers inquire about comparing credit card products or ask for personalized account recommendations, legacy chatbots falter.

The Challenge

The Problem with Legacy Finance Chatbots

Jump to next section

“Our product owner was hoping this effort would prove that large language models were superior to their existing intent mapping chatbots in every way, and he told us verbatim that our team blew intent mapping out of the water.”

Conner Brew

Director, Data & AI Delivery

AI Governance Evangelist

The Risks of Conversational AI in Banking

On the other hand, deploying open-ended conversational AI in banking poses serious challenges. Financial institutions deal with sensitive customer data and face complex compliance requirements from regulatory bodies like the SEC and FINRA. In general, two primary risks emerge when integrating sophisticated conversational AI assistants: Hallucination and Jailbreaking.

So, while conversational AI assistants enable personalized banking experiences, their open-ended nature differs enormously from rigid predefined chatbots limited to narrow topics. Safely adapting generative AI for regulated industries is an immense challenge.

The stakes were high: balancing sophistication and security necessitated tradeoffs and a multilayered technical approach spanning data readiness, large language models (LLMs), systems architecture, and UX/UI design.

WillowTree happily stepped up to the challenge.

WillowTree implemented a modular bank chatbot architecture with two key components: Retrieval Augmented Generation, and “Supervisor” LLM.

This dual-model approach contained the conversational range and specificity of our client’s products and services while allowing sophisticated discussions beyond the limits of predefined chatbot intent mapping.

our solution

WillowTree's Dual-LLM Safety System

Staying Within Guardrails

"I can't sing your praises enough. You got me to drink the kool-aid on WillowTree."

Client Product Owner

Deploying a Secure AI Assistant in 8 Weeks

Key Takeaways

A Superior, Safe Conversational Experience

A Next-Gen Financial Chatbot

The Problem with Legacy Finance Chatbots

The Risks of Conversational AI in Banking

WillowTree's Dual-LLM Safety System

Staying Within Guardrails

More case studies