Scenario D: Revenue Prediction (Regression)A business predic…

Scenario D: Revenue Prediction (Regression)A business predicts weekly revenue using features like ad_spend, number_of_customers, and average_discount.Two models are evaluated on a held-out test set: Model A: R² = 0.62, RMSE = 18,000 Model B: R² = 0.58, RMSE = 16,000 Lower RMSE is better. Higher R² is better.If the business cares about minimizing typical dollar error in forecasts, they should prefer:

Scenario C: Decision Trees and EnsemblesYou train a decision…

Scenario C: Decision Trees and EnsemblesYou train a decision tree classifier for churn with different maximum depths.You observe the following test performance: Depth 2: Accuracy 0.78, Recall(churn) 0.30 Depth 6: Accuracy 0.82, Recall(churn) 0.40 Depth 20: Accuracy 0.80, Recall(churn) 0.28 Which depth most strongly suggests overfitting?

Scenario A: Messy Retail Sales ExtractYou are analyzing a re…

Scenario A: Messy Retail Sales ExtractYou are analyzing a retail dataset with columns: date (string like “2025-03-01”) region (text with inconsistent capitalization and extra spaces) channel (“Online” or “Store”) price (numeric, may contain missing values) quantity (integer) Assume each row is an order line. You will clean the data and compute KPIs.You want total revenue by region. Which expression is best?