Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
NotificationsYou must be signed in to change notification settings

Jayesh88/King-county-House-purchase-price-prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 

Repository files navigation

King County is a county located in the U.S. state of Washington. The population was 2,149,970 ina 2016 census estimate. King is the most populous county in Washington, and the 13th-mostpopulous in the United States. The county seat is Seattle, which is the state’s largest city. KingCounty is one of three Washington counties that are included in the Seattle-Tacoma-Bellevuemetropolitan statistical area. About two-thirds of King County’s population lives in the city’ssuburbs. As of 2011, King County was the 86th highest-income county in the United States. Thisdocument addresses the factors concerning the “house sale prices” in King County sold between May2014 and May 2015.

For this project, I am using a dataset from Kaggle, ‘kc_house_data.csv’(https://www.kaggle.com/harlfoxem/housesalesprediction ). This dataset has a good mix ofcategorical independent variables, and a continuous dependent variable (price). This datasetcontains house sale prices for King County. It is a useful dataset for evaluating simple regressionmodels. In this dataset, I will predict the sales price of houses in King County. It includes homessold between May 2014 and May 2015.

I performed Data cleaning, data modelling, variable selection method (step: forward and backward), Checking for skewness, Correlation to check which variables have positive and negative impact on Price prediction, k-fold cross validation on Gradient boosting.

Different models are used overtime to check Accuracy:

1.Linear regression

2.Decision Tree

3.Random Forest

4.Gradient Boosting

Releases

No releases published

Packages

No packages published

Languages


[8]ページ先頭

©2009-2025 Movatter.jp