You are working with a dataset of bike-share trips. Each row…

You are working with a dataset of bike-share trips. Each row represents a single trip and includes: the start and end timestamps, the duration of the bike trip (in minutes), and information such as the start station, end station, and bike ID. You convert the timestamps into datetime format and calculate trip duration. Then, you extract the month and year from the start time and group the data by month to explore usage patterns across the year. After aggregating the data by month, the average trip duration and number of trips in each month is summarized below. You notice that the average trip duration appears to be noticeably different in January 2015 and July 2015.  Which statistical test, among the ones we studied, would you use to determine whether the average trip duration is significantly different between January and August? Describe briefly any preprocessing or assumptions you should check before running this test.