Upon fitting a Logistic Regression Classifier on an email da…

Upon fitting a Logistic Regression Classifier on an email data set, we obtained the probability of an email to be a junk (event) or not (non-event) for each of the data samples. Below is a figure that shows the distribution of the emails belonging to these classes in the data set. If the threshold T is set at 0.5, as shown in the above figure, what will the Receiver Operating Characteristic (ROC) curve look like? Select the correct answer from a, b, or c below:

You are given a training data set consisting of 10 data poin…

You are given a training data set consisting of 10 data points with a single predictor variable, x, and a numerical response, y, i.e., 10 () pairs. Upon fitting a simple linear regression model to this data set, you obtain an R2 value of 0.96. You also fit a cubic regression model to this data set. If the true relationship between the predictor variable and the numerical response variable is of the form: (i.e., quadratic), where takes a very small value, which of the following method would cause an overfitting on the training data set? Choose the correct answer from the following: