Probability & Statistics for Data Science

Core Skills Guide for AI Interviews (Math, Code, SQL) 2025

5 min read 902 words

1️⃣ Probability Foundations

Note

The Top Tech Company Angle (Probability Foundations): This is the language of uncertainty — it’s how algorithms reason about random events, distributions, and unseen data. Interviewers often test your ability to connect probabilistic reasoning to real-world ML behavior (e.g., “What does model confidence actually mean?” or “How does dropout relate to probability?”).

1.1: Understand Random Variables & Sample Spaces

Learn the difference between discrete and continuous random variables.
Understand sample spaces, events, and event operations (union, intersection, complement).
Grasp the basics of probability axioms and Kolmogorov’s rules.

Deeper Insight: Expect questions like, “If two events are independent, what does that mean intuitively and mathematically?” or “Can mutually exclusive events be independent?”

1.2: Conditional Probability & Independence

Understand conditional probability: [ P(A|B) = \frac{P(A \cap B)}{P(B)} ]
Grasp independence and conditional independence — crucial for ML concepts like Naïve Bayes or Bayesian Networks.
Learn chain rule of probability and total probability theorem.

Probing Question: “Why is conditional independence the cornerstone of graphical models?” or “How do you compute joint probabilities for dependent events?”

1.3: Bayes’ Theorem & Bayesian Reasoning

Derive Bayes’ theorem: [ P(A|B) = \frac{P(B|A)P(A)}{P(B)} ]
Understand priors, likelihoods, and posteriors.
Study Bayesian vs Frequentist interpretations — be ready to defend both approaches.

Deeper Insight: In interviews, they often link Bayes’ theorem to spam detection or A/B test updates — “How would you update your belief after new data?”

1.4: Combinatorics & Counting

Learn permutations, combinations, and binomial coefficients.
Understand sampling with/without replacement — a favorite in probability puzzles.

Probing Question: “If you pick 3 cards from a deck, what’s the probability that they’re all face cards?” This tests your combinatorial reasoning speed and clarity.

2️⃣ Probability Distributions

Note

The Top Tech Company Angle (Distributions): Distributions form the backbone of simulation, modeling uncertainty, and even defining loss functions (e.g., Gaussian = L2 loss). You’ll need to identify which distribution fits a process and derive expectations or variances under time pressure.

2.1: Core Discrete Distributions

Learn Bernoulli, Binomial, Poisson, and Geometric distributions.
For each, know their PMF, expected value, and variance.
Understand their use cases (e.g., Binomial → success/failure trials).

Probing Question: “If a rare event happens twice in 1000 trials, what distribution models it?”

2.2: Core Continuous Distributions

Learn Uniform, Normal (Gaussian), Exponential, and Gamma distributions.
Understand PDF and CDF relationships.
Study Standardization (Z-scores) and properties of Normal distributions.

Deeper Insight: Interviewers love “CLT intuition” — why sums of random variables tend toward Gaussian.

2.3: Joint, Marginal, and Conditional Distributions

Define and manipulate joint distributions ( P(X, Y) ).
Compute marginals and conditionals.
Learn Covariance and Correlation, and why “zero correlation” ≠ “independence.”

Probing Question: “Given Cov(X, Y) = 0, are X and Y independent?”

3️⃣ Statistical Inference & Estimation

Note

The Top Tech Company Angle (Inference): Statistical inference underpins model evaluation — understanding bias, variance, and confidence in predictions. Expect interview questions connecting hypothesis testing or confidence intervals to ML validation.

3.1: Sampling & Estimation

Learn sample mean, variance, standard error, and sampling distributions.
Study Law of Large Numbers (LLN) and Central Limit Theorem (CLT).
Understand point vs interval estimation.

Deeper Insight: The CLT often appears as: “Why is normality assumed in so many models?”

3.2: Maximum Likelihood Estimation (MLE)

Derive MLE for simple distributions (Bernoulli, Gaussian).
Understand the intuition — choosing parameters that maximize the likelihood of observed data.
Learn numerical optimization for MLE (e.g., gradient-based methods).

Probing Question: “What if the likelihood is non-convex? How would you ensure convergence?”

3.3: Hypothesis Testing

Learn the null and alternative hypothesis framework.
Understand p-values, significance levels, Type I/II errors.
Practice with z-tests, t-tests, chi-square, ANOVA.

Deeper Insight: Expect A/B testing questions — “What’s your decision if the p-value is 0.06?”

3.4: Confidence Intervals

Learn how to build confidence intervals for means and proportions.
Connect confidence levels (95%) with sampling variability.
Understand bootstrapping as a non-parametric alternative.

Probing Question: “If your confidence interval includes zero, what does it mean for your hypothesis?”

4️⃣ Correlation, Regression & Association

Note

The Top Tech Company Angle (Regression & Association): These concepts link statistics to machine learning. Understanding correlation vs causation, bias, and error decomposition shows deep data literacy.

4.1: Covariance, Correlation, and Their Pitfalls

Learn formulas for covariance and correlation.
Understand spurious correlation and Simpson’s paradox.

Probing Question: “Why doesn’t correlation imply causation? Give an example from data science.”

4.2: Simple Linear Regression (Statistical View)

Derive regression coefficients via least squares.
Understand residuals, R², and assumptions (linearity, normality, independence).
Contrast statistical regression vs machine learning regression.

Deeper Insight: Be ready to discuss how “violating assumptions” (e.g., heteroscedasticity) affects reliability.

5️⃣ Advanced Topics for Data Science Interviews

Note

The Top Tech Company Angle (Advanced Stats): At senior levels, interviews test whether you can reason about uncertainty quantitatively — from Bayesian thinking to variance decomposition and experimental design.

5.1: Bayesian Inference & Priors

Learn conjugate priors (e.g., Beta-Binomial, Normal-Normal).
Understand posterior predictive distributions.
Practice simple Bayesian updates by hand.

Probing Question: “If you have little data, how does your choice of prior affect the posterior?”

5.2: Resampling & Validation

Understand bootstrap and jackknife methods.
Connect these to cross-validation in ML.
Learn bias-variance tradeoff mathematically.

Deeper Insight: “Why might bootstrap overestimate model variance on small datasets?”

5.3: Experimental Design

Learn randomization, control, and blocking.
Study A/B/n testing, sequential testing, and false discovery correction.
Connect to causal inference (randomization ⇒ independence).

Probing Question: “What if your A/B test shows a lift, but you suspect Simpson’s paradox?”

5.3. Experimental Design