Regress the classification labels in the training set on the…

Regress the classification labels in the training set on the words, pair of words, triplet of consecutive words in the target sentence.   In the space below, report the following: a. Shape of the data and show that the number of variables exceeds the number of variables in part 1a. (8 points) b. Intercept and coefficients for words and combinations of words. (6 points) c. Which coefficient(s) are statistically significant. (6 points)

Along with your written responses, you must submit your Jupy…

Along with your written responses, you must submit your Jupyter Notebook or R script in both of the following formats: Application script file (.ipynb for Jupyter or .R for R) PDF version of the same script Export or knit your code so that all code and outputs are visible. Your script must be well-organized, clearly labeled, and structured so that each section corresponds to the appropriate exam question. Code that is disorganized or unlabeled may result in a deduction.