How would you design a customer support chatbot for an online banking platform (like Capital One) to ensure it provides secure, helpful, and relevant responses?
Architecture: Chatbot frontend + secure backend APIs (no direct DB access).
Security: Enforce authentication (OAuth2, MFA), encrypt data, mask sensitive info, use least privilege.
NLU: Detect intents (e.g., check balance, lost card) and retrieve FAQs safely.
Responses: Use templates for account data, LLM or retrieval for general FAQs, avoid hallucinations.
Masking: Prevent access to future or unrelated customer data.
Monitoring: Log safely, detect misuse, retrain on anonymized data.
Compliance: Follow GDPR, PCI DSS, and banking privacy laws.
As a consultant advising a fintech on using generative AI for fraud detection with a labeled dataset of 50,000 transactions, how would you decide between RAG, prompt engineering, or fine-tuning? What factors guide your choice, and how would your recommendation change if the dataset doubled?
Choice depends on dataset size, task specificity, and performance needs:
Factors guiding choice:
If dataset doubles (~100k):
Let’s say BCG is helping a large e-commerce client develop a generative AI tool that can automatically generate marketing copy and brand visuals by processing both product photos and text reviews (multi-modal), rather than relying on text data alone (unimodal). How would you think about the implications of using a multi-modal model versus a unimodal one in this business setting, and what steps would you take to identify and reduce potential biases in its outputs?
Implications of multi-modal vs unimodal:
Steps to identify and reduce biases:
Summary: Multi-modal models produce richer, aligned outputs but need careful bias auditing; unimodal models are simpler but less context-aware.
You work as a machine learning engineer at Amazon, focusing on product discovery. Your team is tasked with building a system that enables customers to enter a text description, such as “red hiking backpack with water bottle holder”, and retrieve the most relevant product images from Amazon’s vast catalog.
How would you design this system end-to-end?
1. Data Preparation:
2. Feature Representation:
3. Indexing & Retrieval:
4. Ranking & Post-processing:
5. System Design Considerations:
6. Evaluation:
Summary:
Transform both text and images into embeddings (ideally in a shared space), store image embeddings in a vector database, retrieve nearest neighbors for user queries, then rank and filter results for relevance.