Next Article in Journal
Evolutionary Optimization for Job Shop Scheduling with Blocking: A Genetic Algorithm Approach
Previous Article in Journal
SvelteNeck by EHConv: A Cross-Generational Lightweight Neck for Real-Time Object Detection
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

Classifying and Predicting Household Energy Consumption Using Data Analytics and Machine Learning

1
Computer Science Department, Universitat Oberta de Catalunya, 08018 Barcelona, Spain
2
CIGIP—ValgrAI, Universitat Politècnica de València, Ferrandiz-Carbonell Plaza, 03801 Alcoy, Spain
3
ICSO Analytics, Euncet Business School, Cami del Mas Rubial 1, 08225 Terrassa, Spain
*
Author to whom correspondence should be addressed.
Algorithms 2026, 19(2), 114; https://doi.org/10.3390/a19020114 (registering DOI)
Submission received: 18 December 2025 / Revised: 30 January 2026 / Accepted: 30 January 2026 / Published: 1 February 2026
(This article belongs to the Section Algorithms for Multidisciplinary Applications)

Abstract

Growing pressure on electricity grids and the increasing availability of smart meter data have intensified the need for accurate, interpretable, and scalable methods to analyze and forecast household electricity consumption. In this context, this study presents a general, data-agnostic methodology for predicting and classifying household energy consumption. The proposed workflow unifies data preparation, feature engineering, and machine learning techniques (including clustering, classification, regression, and time series forecasting) within a single interpretable pipeline that supports actionable insights. Rather than proposing new prediction algorithms, this work contributes a fully reproducible, end-to-end methodological pipeline that enables the controlled evaluation of the impact of contextual variables, customer segmentation, and cold-start conditions on household energy forecasting. A distinctive aspect of the pipeline is the explicit use of household- and dwelling-level contextual variables to derive customer typologies via clustering and to enrich forecasting models. The models are evaluated for predictive accuracy, reliability under varying conditions, and suitability for operational use. The results show that incorporating contextual variables and clustering significantly improves forecasting accuracy, particularly in cold-start scenarios where no historical consumption data are available. Although numerous public datasets of residential electricity consumption exist, they rarely provide, in an openly accessible form, both detailed load histories and rich contextual attributes, while many are subject to privacy or licensing restrictions. To ensure full reproducibility and to enable controlled experiments where contextual variables can be switched on and off, the experiments are conducted on a synthetically generated dataset that reproduces realistic behavior and seasonal usage patterns. However, the proposed methodology is independent of the specific data source and can be directly applied to any real or synthetic dataset with similar structure. The approach enables applications such as short- and long-term demand forecasting, estimation of household energy costs, and forecasting demand for new customers. These findings demonstrate that the proposed pipeline provides a transparent and effective framework for end-to-end analysis of household electricity consumption.
Keywords: machine learning; household energy consumption; clustering; forecasting machine learning; household energy consumption; clustering; forecasting

Share and Cite

MDPI and ACS Style

Cordon, D.; Pita, A.; Juan, A.A. Classifying and Predicting Household Energy Consumption Using Data Analytics and Machine Learning. Algorithms 2026, 19, 114. https://doi.org/10.3390/a19020114

AMA Style

Cordon D, Pita A, Juan AA. Classifying and Predicting Household Energy Consumption Using Data Analytics and Machine Learning. Algorithms. 2026; 19(2):114. https://doi.org/10.3390/a19020114

Chicago/Turabian Style

Cordon, David, Antonio Pita, and Angel A. Juan. 2026. "Classifying and Predicting Household Energy Consumption Using Data Analytics and Machine Learning" Algorithms 19, no. 2: 114. https://doi.org/10.3390/a19020114

APA Style

Cordon, D., Pita, A., & Juan, A. A. (2026). Classifying and Predicting Household Energy Consumption Using Data Analytics and Machine Learning. Algorithms, 19(2), 114. https://doi.org/10.3390/a19020114

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop