# Load house sales data: kc_house_data.csv.zip # What is the content, could you read it? do you understand collumns? # Explore the data for housing --> make scatter plot of selected features --> create simple regression model of sqft_living to price --> evaluate a simple model --> is linear function good enough? try quadratic polynomial # Split your data into training sample and test sample --> what is trainign error and testing error of your model? --> predict the house price for a given sqft_living --> predict the sqft_living for a given price of the house --> add more feaures --> is the model better now? --> maybe using range of data would work better? --> predict house price for a house id = 5309101299 what is this house like? --> predict house price for a house id = 1925069082 # Explore more those data!!!