Consider two decision trees trained on the exact same data….

Consider two decision trees trained on the exact same data. DT was trained using correlation for splitting, RT was trained using splits determined randomly. Both trees were trained with leaf_size = 1. Which option below correctly describes (in order): The slowest to train, the slowest to query, the highest accuracy on in-sample data?

Consider two decision trees trained on the exact same data….

Consider two decision trees trained on the exact same data. DT was trained using correlation for splitting, RT was trained using splits determined randomly. Both trees were trained with leaf_size = 1. Which option below correctly describes (in order): The fastest to train, the fastest to query, the highest accuracy on in-sample data?

Consider a data set composed of 1500 samples where X is draw…

Consider a data set composed of 1500 samples where X is drawn randomly uniformly from -2*PI to +2*PI, and Y = 2*X^3 + 4*X^2 + 5. Consider linear regression and how it relates to kNN (k=1), decision trees (leaf_size=1), and random trees (leaf_size=1). Which statement is false regarding in-sample RMSE?