Intelligent Feature Selection Ensemble Model for Price Prediction in Real Estate Markets
Abstract
:1. Introduction
2. Literature Review
3. Methodology
3.1. Data Acquisition
3.2. Preprocessing
3.3. Selection Methods
- is the goodness of partitioning at node t using the partitioning criterion s.
- is the impurity of the parent node.
- and are the impurities of the child nodes after partitioning.
- and represent the proportions of data assigned to the right and left nodes, respectively.
3.4. Learning Algorithm Selection
3.4.1. AdaBoost
3.4.2. Gradient Boosting
3.4.3. Random Forest Regressor
3.4.4. Extra Trees Regressor
3.4.5. Bagging Regressor
3.4.6. Stacking Regressor
3.4.7. Voting Regressor
3.5. Model Training
3.6. Model Evaluation
3.6.1. Coefficient of Determination (R2)
3.6.2. Mean Squared Error (MSE)
3.6.3. Root Mean Squared Error (RMSE)
3.6.4. Mean Absolute Error (MAE)
4. Results and Discussion
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Shi, X.; Cheng, Q.; Xia, M. The Industrial Linkages of the Real Estate Industry and Its Impact on the Economy Caused by the COVID-19 Pandemic. In Proceedings of the 25th International Symposium on Advancement of Construction Management and Real Estate; Lu, X., Zhang, Z., Lu, W., Peng, Y., Eds.; Springer: Singapore, 2021; pp. 1015–1028. [Google Scholar] [CrossRef]
- Karanasos, M.; Yfanti, S. On the Economic Fundamentals behind the Dynamic Equicorrelations among Asset Classes: Global Evidence from Equities, Real Estate, and Commodities. J. Int. Financ. Mark. Inst. Money 2021, 74, 101292. [Google Scholar] [CrossRef]
- Abdul Rahman, M.S.; Awang, M.; Jagun, Z.T. Polycrisis: Factors, Impacts, and Responses in the Housing Market. Renew. Sustain. Energy Rev. 2024, 202, 114713. [Google Scholar] [CrossRef]
- Newell, G.; McGreal, S. The Significance of Development Sites in Global Real Estate Transactions. Habitat Int. 2017, 66, 117–124. [Google Scholar] [CrossRef]
- El Bied, S.; Ros Mcdonnell, L.; de-la-Fuente-Aragón, M.V.; Ros Mcdonnell, D. A Comprehensive Bibliometric Analysis of Real Estate Research Trends. Int. J. Financ. Stud. 2024, 12, 95. [Google Scholar] [CrossRef]
- Schoen, E.J. The 2007–2009 Financial Crisis: An Erosion of Ethics: A Case Study. J. Bus. Ethics 2017, 146, 805–830. [Google Scholar] [CrossRef]
- Trovato, M.R.; Clienti, C.; Giuffrida, S. People and the City: Urban Fragility and the Real Estate-Scape in a Neighborhood of Catania, Italy. Sustainability 2020, 12, 5409. [Google Scholar] [CrossRef]
- Zhang, P.; Hu, S.; Li, W.; Zhang, C.; Yang, S.; Qu, S. Modeling Fine-Scale Residential Land Price Distribution: An Experimental Study Using Open Data and Machine Learning. Appl. Geogr. 2021, 129, 102442. [Google Scholar] [CrossRef]
- De la Luz Juárez, G.; Daza, A.S.; González, J.Z. La crisis financiera internacional de 2008 y algunos de sus efectos económicos sobre México. Contaduría Adm. 2015, 60, 128–146. [Google Scholar] [CrossRef]
- Tekouabou, S.C.K.; Gherghina, Ş.C.; Kameni, E.D.; Filali, Y.; Idrissi Gartoumi, K. AI-Based on Machine Learning Methods for Urban Real Estate Prediction: A Systematic Survey. Arch. Comput. Methods Eng. 2024, 31, 1079–1095. [Google Scholar] [CrossRef]
- Porter, L.; Fields, D.; Landau-Ward, A.; Rogers, D.; Sadowski, J.; Maalsen, S.; Kitchin, R.; Dawkins, O.; Young, G.; Bates, L.K. Planning, Land and Housing in the Digital Data Revolution/The Politics of Digital Transformations of Housing/Digital Innovations, PropTech and Housing–the View from Melbourne/Digital Housing and Renters: Disrupting the Australian Rental Bond System and Tenant Advocacy/Prospects for an Intelligent Planning System/What Are the Prospects for a Politically Intelligent Planning System? Plan. Theory Pract. 2019, 20, 575–603. [Google Scholar] [CrossRef]
- Fabozzi, F.J.; Kynigakis, I.; Panopoulou, E.; Tunaru, R.S. Detecting Bubbles in the US and UK Real Estate Markets. J. Real Estate Financ. Econ. 2020, 60, 469–513. [Google Scholar] [CrossRef]
- Pollestad, A.J.; Næss, A.B.; Oust, A. Towards a Better Uncertainty Quantification in Automated Valuation Models. J. Real Estate Financ. Econ. 2024. [Google Scholar] [CrossRef]
- Munanga, Y.; Musakwa, W.; Chirisa, I. The Urban Planning-Real Estate Development Nexus. In The Palgrave Encyclopedia of Urban and Regional Futures; Brears, R.C., Ed.; Springer International Publishing: Cham, Switzerland, 2022; pp. 2046–2054. ISBN 978-3-030-87745-3. [Google Scholar] [CrossRef]
- Kwangwama, N.A.; Mafuku, S.H.; Munanga, Y. A Review of the Contribution of the Real Estate Sector Towards the Attainment of the New Urban Agenda in Zimbabwe. In New Urban Agenda in Zimbabwe: Built Environment Sciences and Practices; Chavunduka, C., Chirisa, I., Eds.; Springer Nature: Singapore, 2024; pp. 151–168. ISBN 978-981-97-3199-2. [Google Scholar] [CrossRef]
- Behera, I.; Nanda, P.; Mitra, S.; Kumari, S. Machine Learning Approaches for Forecasting Financial Market Volatility. In Machine Learning Approaches in Financial Analytics; Maglaras, L.A., Das, S., Tripathy, N., Patnaik, S., Eds.; Springer Nature: Cham, Switzerland, 2024; pp. 431–451. ISBN 978-3-031-61037-0. [Google Scholar] [CrossRef]
- Goldfarb, A.; Greenstein, S.M.; Tucker, C.E. Economic Analysis of the Digital Economy 2015. Available online: http://www.nber.org/books/gree13-1 (accessed on 12 February 2025).
- Bond, S.A.; Shilling, J.D.; Wurtzebach, C.H. Commercial Real Estate Market Property Level Capital Expenditures: An Options Analysis. J. Real Estate Finance Econ. 2019, 59, 372–390. [Google Scholar] [CrossRef]
- Arcuri, N.; De Ruggiero, M.; Salvo, F.; Zinno, R. Automated Valuation Methods through the Cost Approach in a BIM and GIS Integration Framework for Smart City Appraisals. Sustainability 2020, 12, 7546. [Google Scholar] [CrossRef]
- Moorhead, M.; Armitage, L.; Skitmore, M. Real Estate Development Feasibility and Hurdle Rate Selection. Buildings 2024, 14, 1045. [Google Scholar] [CrossRef]
- Htun, H.H.; Biehl, M.; Petkov, N. Survey of feature selection and extraction techniques for stock market prediction. Financ. Innov. 2023, 9, 26. [Google Scholar] [CrossRef]
- Glaeser, E.L.; Nathanson, C.G. An Extrapolative Model of House Price Dynamics. J. Financ. Econ. 2017, 126, 147–170. [Google Scholar] [CrossRef]
- Zheng, M.; Wang, H.; Wang, C.; Wang, S. Speculative Behavior in a Housing Market: Boom and Bust. Econ. Model. 2017, 61, 50–64. [Google Scholar] [CrossRef]
- Fotheringham, A.S.; Crespo, R.; Yao, J. Exploring, Modelling and Predicting Spatiotemporal Variations in House Prices. Ann. Reg. Sci. 2015, 54, 417–436. [Google Scholar] [CrossRef]
- Kumar, S.; Talasila, V.; Pasumarthy, R. A Novel Architecture to Identify Locations for Real Estate Investment. Int. J. Inf. Manag. 2021, 56, 102012. [Google Scholar] [CrossRef]
- Barkham, R.; Bokhari, S.; Saiz, A. Urban Big Data: City Management and Real Estate Markets. In Artificial Intelligence, Machine Learning, and Optimization Tools for Smart Cities: Designing for Sustainability; Pardalos, P.M., Rassia, S.T., Tsokas, A., Eds.; Springer International Publishing: Cham, Switzerland, 2022; pp. 177–209. ISBN 978-3-030-84459-2. [Google Scholar] [CrossRef]
- Moghaddam, S.N.M.; Cao, H. Housing, Affordability, and Real Estate Market Analysis. In Artificial Intelligence-Driven Geographies: Revolutionizing Urban Studies; Moghaddam, S.N.M., Cao, H., Eds.; Springer Nature: Singapore, 2024; pp. 361–393. ISBN 978-981-97-5116-7. [Google Scholar] [CrossRef]
- Parmezan, A.R.S.; Souza, V.M.A.; Batista, G.E.A.P.A. Evaluation of Statistical and Machine Learning Models for Time Series Prediction: Identifying the State-of-the-Art and the Best Conditions for the Use of Each Model. Inf. Sci. 2019, 484, 302–337. [Google Scholar] [CrossRef]
- Rane, N.; Choudhary, S.P.; Rane, J. Ensemble Deep Learning and Machine Learning: Applications, Opportunities, Challenges, and Future Directions. Stud. Med. Health Sci. 2024, 1, 18–41. [Google Scholar] [CrossRef]
- Naz, R.; Jamil, B.; Ijaz, H. Machine Learning, Deep Learning, and Hybrid Approaches in Real Estate Price Prediction: A Comprehensive Systematic Literature Review. Proc. Pak. Acad. Sci. A. Phys. Comput. Sci. 2024, 61, 129–144. [Google Scholar] [CrossRef]
- Breuer, W.; Steininger, B.I. Recent trends in real estate research: A comparison of recent working papers and publications using machine learning algorithms. J. Bus. Econ. 2020, 90, 963–974. [Google Scholar] [CrossRef] [PubMed]
- Adetunji, A.B.; Akande, O.N.; Ajala, F.A.; Oyewo, O.; Akande, Y.F.; Oluwadara, G. House Price Prediction Using Random Forest Machine Learning Technique. Procedia Comput. Sci. 2022, 199, 806–813. [Google Scholar] [CrossRef]
- Jui, J.J.; Imran Molla, M.M.; Bari, B.S.; Rashid, M.; Hasan, M.J. Flat Price Prediction Using Linear and Random Forest Regression Based on Machine Learning Techniques. In Embracing Industry 4.0; Mohd Razman, M.A., Mat Jizat, J.A., Mat Yahya, N., Myung, H., Zainal Abidin, A.F., Abdul Karim, M.S., Eds.; Springer: Singapore, 2020; pp. 205–217. [Google Scholar] [CrossRef]
- Wang, X.; Wen, J.; Zhang, Y.; Wang, Y. Real Estate Price Forecasting Based on SVM Optimized by PSO. Optik 2014, 125, 1439–1443. [Google Scholar] [CrossRef]
- Liu, G. Research on Prediction and Analysis of Real Estate Market Based on the Multiple Linear Regression Model. Sci. Program. 2022, 2022, 5750354. [Google Scholar] [CrossRef]
- Sharma, M.; Chauhan, R.; Devliyal, S.; Chythanya, K.R. House Price Prediction Using Linear and Lasso Regression. In Proceedings of the 2024 3rd International Conference for Innovation in Technology (INOCON), Bangalore, India, 1–3 March 2024; pp. 1–5. [Google Scholar] [CrossRef]
- Harahap, T.H. K-Nearest Neighbor Algorithm for Predicting Land Sales Price. Al’adzkiya Int. Comput. Sci. Inf. Technol. J. 2022, 3, 58–67. [Google Scholar]
- Mohd, T.; Jamil, N.S.; Johari, N.; Abdullah, L.; Masrom, S. An Overview of Real Estate Modelling Techniques for House Price Prediction. In Charting a Sustainable Future of ASEAN in Business and Social Sciences; Kaur, N., Ahmad, M., Eds.; Springer: Singapore, 2020; pp. 321–338. [Google Scholar] [CrossRef]
- Zhu, D.; Cai, C.; Yang, T.; Zhou, X. A Machine Learning Approach for Air Quality Prediction: Model Regularization and Optimization. Big Data Cogn. Comput. 2018, 2, 5. [Google Scholar] [CrossRef]
- Huang, J.-C.; Ko, K.-M.; Shu, M.-H.; Hsu, B.-M. Application and Comparison of Several Machine Learning Algorithms and Their Integration Models in Regression Problems. Neural Comput. Appl. 2020, 32, 5461–5469. [Google Scholar] [CrossRef]
- Ahsan, M.M.; Mahmud, M.A.P.; Saha, P.K.; Gupta, K.D.; Siddique, Z. Effect of Data Scaling Methods on Machine Learning Algorithms and Model Performance. Technologies 2021, 9, 52. [Google Scholar] [CrossRef]
- Rane, N.L.; Rane, J.; Mallick, S.K.; Kaya, Ö. Scalable and Adaptive Deep Learning Algorithms for Large-Scale Machine Learning Systems; Deep Science Publishing: San Francisco, CA, USA, 2024. [Google Scholar]
- Jiaji, S. Machine Learning-Driven Dynamic Scaling Strategies for High Availability Systems. Int. IT J. Res. 2024, 2, 35–42. [Google Scholar] [CrossRef]
- Ho, W.K.O.; Tang, B.-S.; Wong, S.W. Predicting Property Prices with Machine Learning Algorithms. J. Prop. Res. 2021, 38, 48–70. [Google Scholar] [CrossRef]
- Phan, T.D. Housing Price Prediction Using Machine Learning Algorithms: The Case of Melbourne City, Australia. In Proceedings of the 2018 International Conference on Machine Learning and Data Engineering (iCMLDE), Sydney, NSW, Australia, 3–7 December 2018; pp. 35–42. [Google Scholar] [CrossRef]
- Pai, P.-F.; Wang, W.-C. Using Machine Learning Models and Actual Transaction Data for Predicting Real Estate Prices. Appl. Sci. 2020, 10, 5832. [Google Scholar] [CrossRef]
- Dhal, P.; Azad, C. A Comprehensive Survey on Feature Selection in the Various Fields of Machine Learning. Appl. Intell. 2022, 52, 4543–4581. [Google Scholar] [CrossRef]
- Charles, Z.; Papailiopoulos, D. Stability and Generalization of Learning Algorithms That Converge to Global Optima. In Proceedings of the 35th International Conference on Machine Learning, Stockholm, Sweden, 10–15 July 2018; pp. 745–754. Available online: https://tinyurl.com/2p8mj8u9 (accessed on 25 February 2025).
- Gilpin, L.H.; Bau, D.; Yuan, B.Z.; Bajwa, A.; Specter, M.; Kagal, L. Explaining Explanations: An Overview of Interpretability of Machine Learning. In Proceedings of the 2018 IEEE 5th International Conference on Data Science and Advanced Analytics (DSAA), Turin, Italy, 1–3 October 2018; pp. 80–89. [Google Scholar] [CrossRef]
- Aremu, O.O.; Hyland-Wood, D.; McAree, P.R. A Machine Learning Approach to Circumventing the Curse of Dimensionality in Discontinuous Time Series Machine Data. Reliab. Eng. Syst. Saf. 2020, 195, 106706. [Google Scholar] [CrossRef]
- Khan, W.A.; Chung, S.H.; Awan, M.U.; Wen, X. Machine Learning Facilitated Business Intelligence (Part II): Neural Networks Optimization Techniques and Applications. Ind. Manag. Data Syst. 2020, 120, 128–163. [Google Scholar] [CrossRef]
- Qin, H. Athletic Skill Assessment and Personalized Training Programming for Athletes Based on Machine Learning. J. Electr. Syst. 2024, 20, 1379–1387. [Google Scholar] [CrossRef]
- Park, B.; Bae, J.K. Using Machine Learning Algorithms for Housing Price Prediction: The Case of Fairfax County, Virginia Housing Data. Expert Syst. Appl. 2015, 42, 2928–2934. [Google Scholar] [CrossRef]
- Varma, A.; Sarma, A.; Doshi, S.; Nair, R. House Price Prediction Using Machine Learning and Neural Networks. In Proceedings of the 2018 Second International Conference on Inventive Communication and Computational Technologies (ICICCT), Coimbatore, India, 20–21 April 2018; pp. 1936–1939. [Google Scholar] [CrossRef]
- Rafiei, M.H.; Adeli, H. A Novel Machine Learning Model for Estimation of Sale Prices of Real Estate Units. J. Constr. Eng. Manag. 2016, 142, 04015066. [Google Scholar] [CrossRef]
- Ahmad, M.W.; Mourshed, M.; Rezgui, Y. Tree-Based Ensemble Methods for Predicting PV Power Generation and Their Comparison with Support Vector Regression. Energy 2018, 164, 465–474. [Google Scholar] [CrossRef]
- Arabameri, A.; Pal, S.C.; Rezaie, F.; Chakrabortty, R.; Saha, A.; Blaschke, T.; Di Napoli, M.; Ghorbanzadeh, O.; Ngo, P.T.T. Decision Tree Based Ensemble Machine Learning Approaches for Landslide Susceptibility Mapping. Geocarto Int. 2022, 37, 4594–4627. [Google Scholar] [CrossRef]
- Ribeiro, M.H.D.M.; dos Santos Coelho, L. Ensemble Approach Based on Bagging, Boosting and Stacking for Short-Term Prediction in Agribusiness Time Series. Appl. Soft Comput. 2020, 86, 105837. [Google Scholar] [CrossRef]
- Sibindi, R.; Mwangi, R.W.; Waititu, A.G. A Boosting Ensemble Learning Based Hybrid Light Gradient Boosting Machine and Extreme Gradient Boosting Model for Predicting House Prices. Eng. Rep. 2023, 5, e12599. [Google Scholar] [CrossRef]
- Kim, J.; Won, J.; Kim, H.; Heo, J. Machine-Learning-Based Prediction of Land Prices in Seoul, South Korea. Sustainability 2021, 13, 13088. [Google Scholar] [CrossRef]
- Sankar, M.; Chithambaramani, R.; Sivaprakash, P.; Ithayan, V.; Dilip Charaan, R.M.; Marichamy, D. Analysis of Landlord’s Land Price Prediction Using Machine Learning. In Proceedings of the 2024 5th International Conference on Smart Electronics and Communication (ICOSEC), Trichy, India, 18–20 September 2024; pp. 1514–1518. [Google Scholar] [CrossRef]
- Zelaya, C.V.G. Towards Explaining the Effects of Data Preprocessing on Machine Learning. In Proceedings of the 2019 IEEE 35th International Conference on Data Engineering (ICDE), Macao, China, 8–11 April 2019; pp. 2086–2090. [Google Scholar] [CrossRef]
- Nair, P.; Kashyap, I. Hybrid Pre-Processing Technique for Handling Imbalanced Data and Detecting Outliers for KNN Classifier. In Proceedings of the 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon), Faridabad, India, 14–16 February 2019; pp. 460–464. [Google Scholar] [CrossRef]
- Mallikharjuna Rao, K.; Saikrishna, G.; Supriya, K. Data Preprocessing Techniques: Emergence and Selection towards Machine Learning Models-a Practical Review Using HPA Dataset. Multimed. Tools Appl. 2023, 82, 37177–37196. [Google Scholar] [CrossRef]
- Habibi, A.; Delavar, M.R.; Sadeghian, M.S.; Nazari, B.; Pirasteh, S. A Hybrid of Ensemble Machine Learning Models with RFE and Boruta Wrapper-Based Algorithms for Flash Flood Susceptibility Assessment. Int. J. Appl. Earth Obs. Geoinf. 2023, 122, 103401. [Google Scholar] [CrossRef]
- Kursa, M.B. Robustness of Random Forest-Based Gene Selection Methods. BMC Bioinform. 2014, 15, 8. [Google Scholar] [CrossRef]
- Solomatine, D.P.; Shrestha, D.L. AdaBoost.RT: A Boosting Algorithm for Regression Problems. In Proceedings of the 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541), Budapest, Hungary, 25–29 July 2004; Volume 2, pp. 1163–1168. [Google Scholar] [CrossRef]
- Zhang, P.; Yang, Z. A Robust AdaBoost.RT Based Ensemble Extreme Learning Machine. Math. Probl. Eng. 2015, 2015, 260970. [Google Scholar] [CrossRef]
- Zharmagambetov, A.; Gabidolla, M.; Carreira-Perpiñán, M.Ê. Improved Boosted Regression Forests Through Non-Greedy Tree Optimization. In Proceedings of the 2021 International Joint Conference on Neural Networks (IJCNN), Shenzhen, China, 18–22 July 2021; pp. 1–8. [Google Scholar] [CrossRef]
- Hoang, N.-D.; Tran, V.-D.; Tran, X.-L. Predicting Compressive Strength of High-Performance Concrete Using Hybridization of Nature-Inspired Metaheuristic and Gradient Boosting Machine. Mathematics 2024, 12, 1267. [Google Scholar] [CrossRef]
- Hoang, N.-D. Predicting Tensile Strength of Steel Fiber-Reinforced Concrete Based on a Novel Differential Evolution-Optimized Extreme Gradient Boosting Machine. Neural Comput. Appl. 2024, 36, 22653–22676. [Google Scholar] [CrossRef]
- Borup, D.; Christensen, B.J.; Mühlbach, N.S.; Nielsen, M.S. Targeting Predictors in Random Forest Regression. Int. J. Forecast. 2023, 39, 841–868. [Google Scholar] [CrossRef]
- González, S.; García, S.; Del Ser, J.; Rokach, L.; Herrera, F. A Practical Tutorial on Bagging and Boosting Based Ensembles for Machine Learning: Algorithms, Software Tools, Performance Study, Practical Perspectives and Opportunities. Inf. Fusion 2020, 64, 205–237. [Google Scholar] [CrossRef]
- Hajihosseinlou, M.; Maghsoudi, A.; Ghezelbash, R. Stacking: A Novel Data-Driven Ensemble Machine Learning Strategy for Prediction and Mapping of Pb-Zn Prospectivity in Varcheh District, West Iran. Expert Syst. Appl. 2024, 237, 121668. [Google Scholar] [CrossRef]
- Tummala, M.; Ajith, K.; Mamidibathula, S.K.; Kenchetty, P. Driving Sustainable Water Solutions: Optimizing Water Quality Prediction Using a Stacked Hybrid Model with Gradient Boosting and Ridge Regression. In Proceedings of the 2024 3rd International Conference on Automation, Computing and Renewable Systems (ICACRS), Pudukkottai, India, 4–6 December 2024; pp. 1584–1589. [Google Scholar] [CrossRef]
- Revathy, P.; Manju Bhargavi, N.; Gunasekar, S.; Lohit, A. Voting Regressor Model for Timely Prediction of Sleep Disturbances Using NHANES Data. In ICT Systems and Sustainability; Tuba, M., Akashe, S., Joshi, A., Eds.; Springer Nature: Singapore, 2024; pp. 53–65. [Google Scholar] [CrossRef]
- Chen, S.; Luc, N.M. RRMSE Voting Regressor: A Weighting Function Based Improvement to Ensemble Regression. arXiv 2022, arXiv:2207.04837. [Google Scholar] [CrossRef]
- Battineni, G.; Sagaro, G.G.; Nalini, C.; Amenta, F.; Tayebati, S.K. Comparative Machine-Learning Approach: A Follow-Up Study on Type 2 Diabetes Predictions by Cross-Validation Methods. Machines 2019, 7, 74. [Google Scholar] [CrossRef]
- Kumkar, P.; Madan, I.; Kale, A.; Khanvilkar, O.; Khan, A. Comparison of Ensemble Methods for Real Estate Appraisal. In Proceedings of the 2018 3rd International Conference on Inventive Computation Technologies (ICICT), Coimbatore, India, 15–16 November 2018; pp. 297–300. [Google Scholar] [CrossRef]
Model | Features | MAE | MSE | RMSE | R2 | CCC | Time (seg) |
---|---|---|---|---|---|---|---|
AdaBoost | 227 | 23,630 | 1.044 × 109 | 32,320 | 0.851 | 0.918 | 1.709 |
RFE + AdaBoost | 15 | 25,390 | 1.174 × 109 | 34,260 | 0.833 | 0.904 | 0.757 |
Gradient Boosting | 227 | 14,540 | 5.643 × 108 | 23,750 | 0.920 | 0.958 | 1.756 |
RFE + Gradient Boosting | 15 | 17,450 | 6.754 × 108 | 25,990 | 0.904 | 0.947 | 1.231 |
Random Forest | 227 | 15,190 | 6.066 × 108 | 24,630 | 0.914 | 0.953 | 6.445 |
RFE + Random Forest | 15 | 17,540 | 7.455 × 108 | 27,300 | 0.894 | 0.943 | 2.992 |
Extra Trees | 227 | 15,480 | 6.574 × 108 | 25,640 | 0.906 | 0.945 | 5.409 |
RFE + Extra Trees | 15 | 16,760 | 6.783 × 108 | 26,040 | 0.904 | 0.946 | 1.100 |
Bagging | 227 | 15,130 | 6.391 × 108 | 25,280 | 0.909 | 0.950 | 1.964 |
RFE + Bagging | 15 | 17,430 | 7.575 × 108 | 27,520 | 0.892 | 0.939 | 0.761 |
Stacking | 227 | 14,090 | 5.338 × 108 | 23,100 | 0.924 | 0.960 | 67.231 |
RFE + Stacking | 15 | 16,510 | 6.473 × 108 | 25,440 | 0.908 | 0.951 | 20.946 |
Voting | 227 | 15,720 | 5.971 × 108 | 24,440 | 0.915 | 0.953 | 13.433 |
RFE + Voting | 15 | 17,510 | 6.940 × 108 | 26,340 | 0.901 | 0.944 | 4.752 |
Model | Features | MAE | MSE | RMSE | R2 | CCC | Time (seg) |
---|---|---|---|---|---|---|---|
AdaBoost | 227 | 23,630 | 1.044 × 109 | 32,320 | 0.851 | 0.918 | 1.709 |
RF + AdaBoost | 16 | 24,370 | 1.124 × 109 | 33,530 | 0.840 | 0.901 | 0.705 |
Gradient Boosting | 227 | 14,540 | 5.643 × 108 | 23,750 | 0.920 | 0.958 | 1.756 |
RF + Gradient Boosting | 16 | 17,310 | 6.725 × 108 | 25,930 | 0.904 | 0.947 | 1.261 |
Random Forest | 227 | 15,190 | 6.066 × 108 | 24,630 | 0.914 | 0.953 | 6.445 |
RF + Random Forest | 16 | 17,170 | 7.265 × 108 | 26,950 | 0.897 | 0.944 | 2.057 |
Extra Trees | 227 | 15,480 | 6.574 × 108 | 25,640 | 0.906 | 0.945 | 5.409 |
RF + Extra Trees | 16 | 16,780 | 6.684 × 108 | 25,850 | 0.905 | 0.945 | 1.109 |
Bagging | 227 | 15,130 | 6.391 × 108 | 25,280 | 0.909 | 0.950 | 1.964 |
RF + Bagging | 16 | 17,650 | 7.408 × 108 | 27,220 | 0.895 | 0.939 | 0.742 |
Stacking | 227 | 14,090 | 5.338 × 108 | 23,100 | 0.924 | 0.960 | 67.231 |
RF + Stacking | 16 | 16,590 | 6.388 × 108 | 25,280 | 0.909 | 0.951 | 22.465 |
Voting | 227 | 15,720 | 5.971 × 108 | 24,440 | 0.915 | 0.953 | 13.433 |
RF + Voting | 16 | 17,390 | 6.870 × 108 | 26,210 | 0.902 | 0.944 | 4.005 |
Model | Features | MAE | MSE | RMSE | R2 | CCC | Time (seg) |
---|---|---|---|---|---|---|---|
AdaBoost | 227 | 23,630 | 1.04 × 109 | 32,317 | 0.851 | 0.918 | 1.710 |
Boruta + AdaBoost | 16 | 23,529 | 1.04 × 109 | 32,277 | 0.852 | 0.920 | 0.560 |
Gradient Boosting | 227 | 14,537 | 5.64 × 108 | 23,755 | 0.920 | 0.958 | 1.760 |
Boruta + Gradient Boosting | 16 | 16,073 | 6.53 × 108 | 25,555 | 0.907 | 0.950 | 0.830 |
Random Forest | 227 | 15,186 | 6.07 × 108 | 24,629 | 0.914 | 0.953 | 6.440 |
Boruta + Random Forest | 16 | 15,841 | 6.86 × 108 | 26,195 | 0.902 | 0.949 | 1.750 |
Extra Trees | 227 | 15,476 | 6.57 × 108 | 25,639 | 0.907 | 0.945 | 5.410 |
Boruta + Extra Trees | 16 | 15,899 | 7.23 × 108 | 26,893 | 0.897 | 0.946 | 1.030 |
Bagging | 227 | 15,134 | 6.39 × 108 | 25,281 | 0.909 | 0.950 | 1.960 |
Boruta + Bagging | 16 | 15,576 | 6.87 × 108 | 26,204 | 0.902 | 0.946 | 1.120 |
Stacking | 227 | 14,092 | 5.34 × 108 | 23,104 | 0.924 | 0.960 | 67.230 |
Boruta + Stacking | 16 | 15,472 | 6.45 × 108 | 25,401 | 0.908 | 0.951 | 21.970 |
Voting | 227 | 15,724 | 5.97 × 108 | 24,436 | 0.915 | 0.953 | 13.430 |
Boruta + Voting | 16 | 16,228 | 6.58 × 108 | 25,658 | 0.906 | 0.949 | 3.760 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Andrade-Girón, D.C.; Marin-Rodriguez, W.J.; Zuñiga-Rojas, M.G. Intelligent Feature Selection Ensemble Model for Price Prediction in Real Estate Markets. Informatics 2025, 12, 52. https://doi.org/10.3390/informatics12020052
Andrade-Girón DC, Marin-Rodriguez WJ, Zuñiga-Rojas MG. Intelligent Feature Selection Ensemble Model for Price Prediction in Real Estate Markets. Informatics. 2025; 12(2):52. https://doi.org/10.3390/informatics12020052
Chicago/Turabian StyleAndrade-Girón, Daniel Cristóbal, William Joel Marin-Rodriguez, and Marcelo Gumercindo Zuñiga-Rojas. 2025. "Intelligent Feature Selection Ensemble Model for Price Prediction in Real Estate Markets" Informatics 12, no. 2: 52. https://doi.org/10.3390/informatics12020052
APA StyleAndrade-Girón, D. C., Marin-Rodriguez, W. J., & Zuñiga-Rojas, M. G. (2025). Intelligent Feature Selection Ensemble Model for Price Prediction in Real Estate Markets. Informatics, 12(2), 52. https://doi.org/10.3390/informatics12020052