Olympiad-Style Problem Set with Complete Solutions
Think Unlimited · Descriptive Statistics · Distributions · Estimation
| Student | Raw Score | Class Mean (μ) | Class Std Dev (σ) |
|---|---|---|---|
| Arya | 72 | 60 | 8 |
| Bhanu | 85 | 78 | 14 |
| Charu | 91 | 88 | 4 |
A uniform shift of origin does not change relative standing — everyone moves equally.
Principle: Multiplying all data by a nonzero constant scales both the mean and standard deviation equally, leaving z-scores invariant. The z-score is dimensionless and scale-free.
Interpretation: Dev scored 23 points below the class average, placing him 2.3 standard deviations below the mean. In a roughly normal distribution, fewer than ~2% of students would score this low, indicating Dev performed in the bottom tail of his class.
One extreme outlier caused almost a 47% shift in the mean but barely affected the median — a vivid demonstration of the median's robustness.
This is why we cannot use the average of deviations as a measure of spread — we must square them first (variance) or take absolute values (mean absolute deviation).
| # | Variable | Sample Values |
|---|---|---|
| 1 | Department | CSE, EEE, MECH, CIVIL |
| 2 | Year of study | 1st, 2nd, 3rd, 4th |
| 3 | Temperature of lab (°C) | 22, 23, 25, 19 |
| 4 | Monthly pocket money (₹) | 2000, 3500, 0, 5000 |
| 5 | Whether student owns a laptop | Yes, No, Yes, Yes |
| Variable | Level | Key Reason |
|---|---|---|
| Department | Nominal | Named categories with no natural order; CSE ≠ > EEE |
| Year of Study | Ordinal | Ordered (1st < 2nd < 3rd), but gaps between years are not equal in meaning |
| Temp (°C) | Interval | Equal units (1°C = 1°C everywhere), but 0°C does not mean "no temperature" — zero is arbitrary |
| Pocket Money (₹) | Ratio | True zero exists (₹0 = no money), ratios are meaningful (₹5000 is 2.5× ₹2000) |
| Laptop Ownership | Nominal (Binary) | Dichotomous category; no ordering, no scale |
| Variable | Mode | Median | Mean |
|---|---|---|---|
| Department (Nominal) | ✅ Only valid option | ❌ No ordering | ❌ Not meaningful |
| Year of Study (Ordinal) | ✅ | ✅ Ordering exists | ⚠️ Technically invalid (unequal gaps) |
| Temp (°C) (Interval) | ✅ | ✅ | ✅ |
| Pocket Money (Ratio) | ✅ | ✅ | ✅ |
| Laptop Ownership (Binary) | ✅ (Majority category) | ⚠️ Tells only majority | ✅ Gives proportion |
The mean of a binary 0/1 variable equals the proportion of "1"s in the sample. This is the one case where the mean is meaningful for nominal data.
To make meaningful temperature ratios, you must convert to Kelvin (a ratio scale with absolute zero): 25°C = 298 K, 20°C = 293 K → ratio = 298/293 ≈ 1.017, i.e., only ~1.7% warmer in absolute thermodynamic terms.
Only Ratio scales support meaningful ratio comparisons.This makes physical sense — rainfall has a hard lower bound of 0 mm but no upper limit (cloudburst events extend the right tail).
The range is extremely sensitive to the new outlier. The IQR, covering only the middle 50%, is largely resistant to it.
| Run | Machine A (mm) | Machine B (mm) |
|---|---|---|
| 1 | 10.1 | 9.5 |
| 2 | 10.0 | 11.0 |
| 3 | 9.9 | 10.2 |
| 4 | 10.2 | 8.8 |
| 5 | 10.0 | 10.5 |
| 6 | 9.8 | 10.0 |
Dividing by n instead of n−1 produces a biased estimator that systematically underestimates the population variance σ². Dividing by (n−1) corrects this bias, making s² an unbiased estimator of σ².
n−1 ensures s² is an unbiased estimator of σ²| Asset | Mean Monthly Return (%) | Std Dev (%) |
|---|---|---|
| Government Bond | 0.40 | 0.10 |
| Blue-Chip Stock | 1.80 | 0.60 |
| Crypto Token | 5.00 | 18.00 |
Left-skewed: Test scores on a very easy exam. The upper bound is 100% — scores cannot exceed this. Most students score near the top (90–100%), while a few who struggle extend the left tail toward lower marks. The ceiling compresses the right side and forces the tail leftward.
Example: Salaries: ₹20K, ₹25K, ₹30K, ₹35K, ₹500K (n=5). Median = ₹30K (50th percentile). Mean = ₹122K. These are drastically different. A student at the 50th percentile earned exactly ₹30K, not ₹122K.
In right-skewed data: mean > median. Scores at P50 ≠ mean.Importance: Standardisation converts any variable into a dimensionless z-score, enabling comparison across variables measured in different units and scales. It is the foundation of z-tests, normal tables, and principal component analysis. All z-scores share the same scale, making cross-variable and cross-study comparisons rigorous.
| Statistic | Server A | Server B |
|---|---|---|
| n | 100 | 100 |
| Mean | 3.2 s | 3.2 s |
| Median | 3.1 s | 2.1 s |
| Mode | 3.0 s | 1.5 s |
| Std Dev | 0.4 s | 2.8 s |
| Min | 2.0 s | 0.5 s |
| Max | 4.5 s | 18.0 s |
| Q1 | 2.9 s | 1.2 s |
| Q3 | 3.5 s | 4.0 s |
IC252 · Introduction to Statistics · Think Unlimited · Problem Set prepared in Olympiad Style(made by Reman Dey(hehe) with Claude sonnet 4.6 llm(locally hoisted))