Consider two decision trees trained on the exact same data….

Consider two decision trees trained on the exact same data. DT was trained using correlation for splitting, RT was trained using splits determined randomly. Both trees were trained with leaf_size = 1. Which option below correctly describes (in order): The slowest to train, the slowest to query, the highest accuracy on in-sample data?

Consider two decision trees trained on the exact same data….

Consider two decision trees trained on the exact same data. DT was trained using correlation for splitting, RT was trained using splits determined randomly. Both trees were trained with leaf_size = 1. Which option below correctly describes (in order): The fastest to train, the fastest to query, the highest accuracy on in-sample data?

Consider a data set composed of 1500 samples where X is draw…

Consider a data set composed of 1500 samples where X is drawn randomly uniformly from -2*PI to +2*PI, and Y = 2*X^3 + 4*X^2 + 5. Consider linear regression and how it relates to kNN (k=1), decision trees (leaf_size=1), and random trees (leaf_size=1). Which statement is false regarding in-sample RMSE?

Suppose you are using one of the minimizing optimizers from…

Suppose you are using one of the minimizing optimizers from Scipy. You are using it to optimize your portfolio for MINIMUM volatility, and port_vals are the daily total values of the portfolio for a particular allocation. Which of the following would be the best way to compute the objective function for the optimizer?