Data science assignment | Computer Science homework help
- Exploratory Data Analysis: College data set: College.csv. It contains a number of variables for 777 different universities and colleges in the US. Do all the exercises in Python:
8a. Read the csv file with pandas 8b.Fix the first row as row headers 8c.
- produce a numerical summary of the variables in the data set.
- produce a scatterplot matrix of the first ten columns or variables of the data.
- produce side-by-side boxplots of Outstate versus Private
- Create a new qualitative variable, called Elite, by binning the Top10perc variable and divide universities into two groups based on whether or not the proportion of students coming from the top 10 % of their high school classes exceeds 50 %
- Produce some histograms with differing numbers of bins for a few of the quantitative variables: Room.Board','Books', 'Personal', 'Expend'
- Examine the elite schools more closely.
Q3 Textbook Applied Questions – Attempt with Python