What is a data science case interview?

A data science case interview combines standard business case framing with technical data science concepts — including ML model design, A/B testing design and interpretation, metrics diagnosis, and SQL/Python reasoning. Unlike a pure consulting case, you are expected to propose a data-driven solution, select appropriate model types (e.g., logistic regression vs. gradient boosting for churn prediction), and discuss trade-offs like precision vs. recall.

How are BCG X data science interviews different from regular BCG interviews?

BCG X interviews include 3–5 rounds with dedicated technical components: an online Python/SQL coding assessment, a technical case where you frame a business problem as a data science solution, a behavioral PEI round, and optionally a technical presentation. Standard BCG interviews have no coding assessment and use purely business-framing cases. BCG X also evaluates your ability to build — not just advise.

What technical topics come up in McKinsey QuantumBlack interviews?

QuantumBlack interviews cover: Python or R coding, SQL querying, classical machine learning (regression, classification, clustering, random forests, gradient boosting), A/B testing and statistical significance, generative AI fundamentals, and ML system design. The two-hour QuantHub adaptive test covers statistics, data modeling, and coding. Technical Experience Interviews (TEI) go deep on your own ML projects.

How many rounds are in a BCG X data science interview?

BCG X data science interviews typically include 3–5 rounds over 4–6 weeks: (1) recruiter screen, (2) online technical coding assessment in Python, (3) technical case interview combining ML framing and business problem-solving, (4) behavioral/PEI interview, and optionally (5) a final technical presentation round. Exact structure varies by level (Associate, Consultant, Expert).

Do I need to code in a data science consulting interview?

It depends on the firm. BCG X requires live or take-home Python/SQL coding in most roles — the online assessment specifically tests data manipulation and model implementation. McKinsey QuantumBlack is more conceptual in case rounds but includes a rigorous two-hour QuantHub test covering Python, R, and statistics. Bain Vector interviews assess coding through take-home or technical screens but case rounds focus on approach and framing, not syntax.

How do I structure an ML design question in a data science case interview?

Use a 5-step structure: (1) Define the business objective and translate it to an ML task (classification, regression, clustering, ranking). (2) Identify available data sources and key features. (3) Select a model family and justify the choice given data volume, interpretability needs, and latency constraints. (4) Define success metrics (precision, recall, AUC-ROC, RMSE) and explain trade-offs. (5) Outline deployment, monitoring, and potential failure modes.

Data Science Case Interview: BCG X, QuantumBlack & Bain Vector

A data science case interview combines business case framing with technical data science concepts — including ML model design, A/B testing, metrics diagnosis, and SQL or Python reasoning — to assess whether you can translate analytical thinking into strategic recommendations. These interviews are used by consulting firms' specialized data and AI divisions, particularly BCG X (formerly BCG Gamma), McKinsey QuantumBlack, Bain Vector, Deloitte Analytics & Cognitive, and Accenture Data & AI.

If your target is specifically BCG X, start with the dedicated BCG X case interview guide. This page covers the broader data science case format across consulting firms.

Why Data Science Case Interviews Are Different

Standard consulting cases test your ability to structure an ambiguous business problem, perform mental math, and synthesize a recommendation. Data science cases test all of that — plus your ability to:

Frame a business problem as an ML or analytics task
Identify what data you need and where it comes from
Select appropriate statistical or ML methods with justified trade-offs
Design a valid experiment (A/B test or quasi-experiment)
Interpret results and quantify uncertainty

The business framing is still table stakes. Interviewers at BCG X expect you to open a case the same way any consulting candidate would — clarify the objective, ask about the timeline, confirm metrics. The difference is what comes next: you need to propose a technical path forward, not just a strategic one.

Firms That Use Data Science Cases

Firm	Division	Focus
BCG	BCG X (formerly BCG Gamma)	Build & design; embedded analytics products
McKinsey	QuantumBlack	AI-driven strategy; ML engineering
Bain	Bain Vector	Advanced analytics, data-driven transformation
Deloitte	Analytics & Cognitive	Enterprise AI, cloud data platforms
Accenture	Data & AI	Applied AI, automation, data architecture

BCG X differs from the traditional BCG practice because it builds data products, not just advisory decks. McKinsey QuantumBlack applies AI and analytics to enterprise transformations. Bain & Company's data scientist process includes case studies where candidates outline an end-to-end approach to a business problem using data.

The Three Main Case Types

1. Product Analytics Cases

These cases present a metric that has moved — usually dropped — and ask you to diagnose why and propose a fix. The structure mirrors a standard profitability case but adds a data layer.

Example prompt: "Spotify's premium conversion rate dropped 15% over the past 30 days. Walk me through how you'd diagnose this."

A strong answer:

Segment the drop — Is it across all user cohorts or specific ones (geography, device, acquisition channel, tenure)?
Rule out external factors — Competitive pricing changes, a major event, a product bug
Identify the funnel stage — Free trial sign-up? Payment completion? Upgrade prompt view?
Propose data queries — What SQL query would you run first? What would you look for in event logs?
Recommend next steps — If it's a payment completion drop among mobile Android users, what's your hypothesis and experiment design?

These cases appear frequently at QuantumBlack and Bain Vector, where candidates are expected to speak in event-level data terms, not just strategic buckets.

2. A/B Testing and Experimentation Cases

Interviewers present you with an experiment that was run — or ask you to design one — and assess whether you understand:

Randomization and bias: Were treatment and control groups correctly assigned? Is there selection bias?
Statistical power: How large does the sample need to be? What's the minimum detectable effect?
Metric selection: Are you measuring the right primary and guardrail metrics?
Novelty and network effects: Are results inflated by novelty? Does the treatment affect non-participants (network effects in social products)?

Typical question: "We ran a 2-week A/B test on a new checkout flow. Conversion improved 3% with a p-value of 0.04. Should we ship it?"

The "correct" answer is: not necessarily. Is 2 weeks long enough to capture weekly usage cycles? What are the guardrail metrics (AOV, return rate, support tickets)? Was the test correctly randomized at the user level, not session level? This nuanced questioning is exactly what data science interviewers evaluate.

3. ML Model Design Cases

These cases ask you to architect a data science solution for a business problem. You will not code live in most consulting firm rounds, but you must demonstrate that you can:

Frame the ML task correctly (binary classification, multi-class, regression, ranking)
Identify features from real-world data sources
Choose a model family and justify interpretability vs. performance trade-offs
Define evaluation metrics appropriate to the business context

Example prompt: "A major bank wants to predict which SMB customers will churn in the next 90 days. Design the ML solution."

Strong structure: Define churned (account closed, 6+ months dormant?). Features: transaction volume trend, product breadth, login frequency, support call frequency, competitor rate environment. Model: gradient boosting (XGBoost or LightGBM) for tabular data; logistic regression as interpretable baseline for regulatory compliance. Evaluation: AUC-ROC for ranking; precision/recall trade-off depends on cost of false positive (sending retention offer) vs. false negative (losing customer).

Worked Example: Uber Ride Acceptance Rate

Prompt: "Uber's driver ride acceptance rate dropped 8% over the past two weeks in three major cities. Build a diagnostic framework and propose an ML solution to address the underlying driver supply problem."

Step 1: Clarify the problem

What is the acceptance rate denominator — all dispatched requests, or dispatched-to-online-driver requests?
Is the 8% drop uniform across cities or concentrated (surge pricing zones, airport areas, time-of-day)?
Has this drop translated into a rider experience metric change (wait time, cancellation rate)?

Step 2: Diagnose root causes (MECE buckets)

Category	Hypothesis	Data Source
Driver earnings	Surge pricing reduced; earnings per hour declined	Driver earnings reports, surge frequency logs
Competition	Lyft or local competitor surge in driver acquisition	Third-party driver panel data
Demand surge	Rider demand spike without supply response	Dispatch request logs, heat maps
App/tech issue	New app version causing acceptance friction	App version distribution, acceptance latency logs
Driver cohort shift	New drivers have lower acceptance norms	Cohort segmentation by driver tenure

Step 3: Propose an ML solution

Problem framing: Binary classification — predict the probability that a given driver will reject the next ride request given contextual features. Use this score to dynamically adjust dispatch logic (prefer high-acceptance drivers when supply is tight) and trigger proactive incentives before rejection occurs.

Key features:

Driver's rolling 7-day acceptance rate by zone
Current hourly earnings vs. 30-day average
Time to next surge window (model-predicted)
Distance from rider to driver
Ride estimated duration and distance

Model choice: Gradient boosting on tabular data. Low latency required (dispatch must happen in ~2 seconds), so avoid deep learning. Output a probability score at dispatch time.

Success metric: Reduction in unfulfilled ride requests within 5 minutes; driver acceptance rate increase measured via A/B test against current dispatch algorithm.

Step 4: Recommendation

The most likely driver is an earnings decline relative to expectations, compounded by a competitor incentive campaign. The short-term fix is targeted surge bonuses for high-acceptance drivers in the affected zones. The long-term fix is the predictive acceptance model integrated into dispatch, expected to reduce rider wait time by 15–20% in constrained supply conditions.

What BCG X, QuantumBlack, and Bain Vector Actually Assess

BCG X

BCG X candidates should expect role-specific screening. Data science and analytics roles may include Python, SQL, or statistics; product and design roles may emphasize product thinking and implementation trade-offs; client-facing roles can include a technical case. What differentiates top BCG X candidates is the ability to think like a product builder, not just a strategy advisor: if you were building this model in production, what would you actually do?

McKinsey QuantumBlack

QuantumBlack's process includes a rigorous two-hour QuantHub adaptive test covering Python, R, statistics, and data modeling — followed by Technical Experience Interviews (TEI) that go deep on your own ML work, and a case interview applying AI to a business problem. The Personal Experience Interview (PEI) mirrors standard McKinsey behavioral rounds.

Bain Vector

Bain Vector interviews follow a similar arc to BCG X but tend to weight the business case component more heavily. Candidates report a technical screen (Python, ML theory), a business case study where you outline your analytical approach end-to-end, and a behavioral round focused on past analytical impact.

Technical Concepts to Review

Before your interview, you should be comfortable explaining these concepts clearly to a non-technical interviewer:

Concept	Why It Comes Up
A/B testing design	Core experimentation skill for product analytics cases
Precision vs. recall trade-off	Required for churn, fraud, and recommendation ML cases
Statistical significance and p-values	A/B test interpretation questions
Gradient boosting vs. logistic regression	Model selection justification
Feature engineering	ML design cases; showing you understand real-world data
SQL window functions	BCG X technical assessment; product analytics cases
Confusion matrix	Any classification ML design case
Bias-variance trade-off	Asked in TEI-style deep dives at QuantumBlack

Review resources: Interview Query's data science guides, DataLemur for SQL practice, and Towards Data Science case study walkthroughs.

30-Day Prep Plan

Week 1 — Consulting case fundamentals

Practice 2 standard business cases per day using the profitability framework and market entry framework
Review hypothesis-driven case approach
Read your target firm's public case examples (McKinsey's GlobaPharm is excellent for data-heavy cases)

Week 2 — Technical foundations

Review statistical significance, power calculations, and A/B test design
Complete 5 SQL problems on DataLemur (window functions, aggregations, subqueries)
Study precision/recall, AUC-ROC, and confusion matrices
Practice explaining gradient boosting in plain English

Week 3 — Data science case practice

Solve 2 product analytics cases (e.g., "DAU dropped 20% — diagnose")
Practice 2 A/B testing interpretation questions with edge cases (novelty effects, network effects)
Complete 2 ML model design cases end-to-end
Time yourself: target 5 minutes for problem framing, 10 minutes for analysis, 3 minutes for synthesis

Week 4 — Firm-specific prep and mock interviews

BCG X: complete Python data manipulation exercises; practice technical case framing
QuantumBlack: take 1-2 QuantHub practice tests; prepare TEI stories from your own ML work
Bain Vector: practice business impact storytelling with quantified results
Run 2 full mock interviews with a partner using the practice partner guide

Technical Answer Checks

Case prompt	Strong answer checks
A/B test shows +4% conversion with p=0.03	Check guardrails such as retention, support tickets, refund rate, and sample quality before shipping
Churn prediction model where false negatives are costly	Optimize for recall, then explain the false-positive cost of retention offers
Fraud detection in a regulated bank	Compare interpretability and performance; logistic regression may be the baseline, gradient boosting may win with explainability controls
Product metric dropped 20%	Segment by cohort, acquisition source, device, geography, and product release before proposing a model

Checklist

Execution checklist

Reviewed hypothesis-driven case structure
Data science cases still require strong business framing before technical proposals
Practiced 3+ A/B test design questions including edge cases
Experimentation design is the single most commonly tested concept across BCG X, QuantumBlack, and Bain Vector
Can explain precision/recall trade-off with a business example
ML design cases require fluency in evaluation metrics in business terms, not just statistical definitions
Completed SQL window function exercises
BCG X technical assessment specifically tests SQL on real-world data manipulation tasks
Prepared 2 TEI stories from own ML work with quantified business impact
QuantumBlack's Technical Experience Interview goes deep on your personal project history
Practiced synthesizing a 2-minute ML design recommendation
Consulting firms assess communication quality — your solution must be understood by a non-technical partner

For foundational case interview skills that underpin data science cases, review:

Data Science Case Interview: Complete Prep Guide for BCG X, McKinsey QuantumBlack, and Bain Vector (2026)

Why Data Science Case Interviews Are Different

Firms That Use Data Science Cases

The Three Main Case Types

1. Product Analytics Cases

2. A/B Testing and Experimentation Cases

3. ML Model Design Cases

Worked Example: Uber Ride Acceptance Rate

Step 1: Clarify the problem

Step 2: Diagnose root causes (MECE buckets)

Step 3: Propose an ML solution

Step 4: Recommendation

What BCG X, QuantumBlack, and Bain Vector Actually Assess

BCG X

McKinsey QuantumBlack

Bain Vector

Technical Concepts to Review

30-Day Prep Plan

Technical Answer Checks

Execution checklist

Sources (checked June 29, 2026)

Frequently asked questions

Keep reading

Data Science Case Interview: Complete Prep Guide for BCG X, McKinsey QuantumBlack, and Bain Vector (2026)

Why Data Science Case Interviews Are Different

Firms That Use Data Science Cases

The Three Main Case Types

1. Product Analytics Cases

2. A/B Testing and Experimentation Cases

3. ML Model Design Cases

Worked Example: Uber Ride Acceptance Rate

Step 1: Clarify the problem

Step 2: Diagnose root causes (MECE buckets)

Step 3: Propose an ML solution

Step 4: Recommendation

What BCG X, QuantumBlack, and Bain Vector Actually Assess

BCG X

McKinsey QuantumBlack

Bain Vector

Technical Concepts to Review

30-Day Prep Plan

Technical Answer Checks

Related Guides

Sources (checked June 29, 2026)

What is a data science case interview?

How are BCG X data science interviews different from regular BCG interviews?

What technical topics come up in McKinsey QuantumBlack interviews?

How many rounds are in a BCG X data science interview?

Do I need to code in a data science consulting interview?

How do I structure an ML design question in a data science case interview?

Keep reading