- Notifications
You must be signed in to change notification settings - Fork1
Jayesh88/King-county-House-purchase-price-prediction
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
King County is a county located in the U.S. state of Washington. The population was 2,149,970 ina 2016 census estimate. King is the most populous county in Washington, and the 13th-mostpopulous in the United States. The county seat is Seattle, which is the state’s largest city. KingCounty is one of three Washington counties that are included in the Seattle-Tacoma-Bellevuemetropolitan statistical area. About two-thirds of King County’s population lives in the city’ssuburbs. As of 2011, King County was the 86th highest-income county in the United States. Thisdocument addresses the factors concerning the “house sale prices” in King County sold between May2014 and May 2015.
For this project, I am using a dataset from Kaggle, ‘kc_house_data.csv’(https://www.kaggle.com/harlfoxem/housesalesprediction ). This dataset has a good mix ofcategorical independent variables, and a continuous dependent variable (price). This datasetcontains house sale prices for King County. It is a useful dataset for evaluating simple regressionmodels. In this dataset, I will predict the sales price of houses in King County. It includes homessold between May 2014 and May 2015.
I performed Data cleaning, data modelling, variable selection method (step: forward and backward), Checking for skewness, Correlation to check which variables have positive and negative impact on Price prediction, k-fold cross validation on Gradient boosting.
Different models are used overtime to check Accuracy:
1.Linear regression
2.Decision Tree
3.Random Forest
4.Gradient Boosting
About
Topics
Resources
Uh oh!
There was an error while loading.Please reload this page.