Housing Value Predicted Modelling using Random Forest Regression: Case study California Housing Dataset

Authors

  • Firman Matiinu Sigit Tulungagung University
  • Haniel Rangga Pramuditya Putra Tulungagung University

DOI:

https://doi.org/10.58812/wsist.v2i01.1021

Keywords:

Decision Tree Regression, Housing, Linier Regression, Random Forrest Regression

Abstract

Housing price comes from many factors which are location, population, style of house, age of house, and people income. Many real estate developer companies use this data to predict price of house and give amount of investment for potential housing prices. In this study, we try to help the developer companies to predict price of house based on dataset. We try to build machine learning that can predict for housing price. There are three machine learning models that are used for this study, namely Linier Regression Modelling, Decison Three Regression Modelling, and Random Forest Regression Modelling. Each of those machine learning is trained using California Housing Dataset (1990) which is split into training set and testing set that training set contains 16512 instances and testing set contains 4128 instances. Training dataset is trained into each of machine learning model (Linier Regression, Decison Tree Regression, and Random Forrest Regression) after finished the training followed by evaluting the error prediction using K-Folds Cross Validation and showed by using Root Mean Square Error (RMSE). In this study, Random Forest Regression gives a better performance than two others (Linier Regression and Decision Tree Regression models) with error RMSE =49642.12.

References

Lima Faktor yang mempengaruhi harga jual properti. (2019). Penilaian. Retrieved July 28, 2023, from https://penilaian.id/2019/03/25/5-faktor-yang-mempengaruhi-harga-jual-properti/

California Housing Dataset. (1990). California Housing Price. Retrieved July 25, 2023, from https://www.kaggle.com/datasets/harrywang/housing

Aurelien Geron. (2019). Hands-on Machine Learning with Scikit-Learn and TensorFlow. O’Reilly.

Decision Tree (2023). Decision Tree Regressor — A Visual Guide with Scikit Learn. Retrieved July 25, 2023, from https://towardsdatascience.com/decision-tree-regressor-a-visual-guide-with-scikit-learn-2aa9e01f5d7f

Cross Validation (2019). Cross Validation — Why & How. Retrieved July 25, 2023, from https://towardsdatascience.com/cross-validation-430d9a5fee22

Downloads

Published

2024-04-30

How to Cite

Sigit, F. M., & Putra, H. R. P. (2024). Housing Value Predicted Modelling using Random Forest Regression: Case study California Housing Dataset. West Science Information System and Technology, 2(01), 182–187. https://doi.org/10.58812/wsist.v2i01.1021