Next Article in Journal
Magnetic Integrated Multi-Trap Filters Using Mutual Inductance to Mitigate Current Harmonics in Grid-Connected Power Electronics Converters
Previous Article in Journal
Restoration Strategy for Urban Power Distribution Systems Considering Coupling with Transportation Networks Under Heavy Rainstorm Disasters
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Geospatial Forecasting of Electric Energy in Distribution Systems Using Segmentation and Machine Learning with Convolutional Methods

1
Faculty of Electrical and Electronic Engineering, National University of Engineering, Lima 15333, Peru
2
Department of Electrical Engineering, Federal University of Paraíba, João Pessoa 58051-900, PB, Brazil
*
Author to whom correspondence should be addressed.
Energies 2025, 18(2), 424; https://doi.org/10.3390/en18020424
Submission received: 7 December 2024 / Revised: 13 January 2025 / Accepted: 15 January 2025 / Published: 19 January 2025
(This article belongs to the Section F2: Distributed Energy System)

Abstract

:
This paper proposes an innovative methodology for geospatial forecasting of electrical demand across various consumption segments and scales, integrating machine learning and discrete convolution within the framework of global system projections. The study was conducted in two phases: first, machine learning techniques were utilized to classify and determine the relative growth of segments with similar consumption patterns. In the second phase, convolution methods were employed to produce accurate spatial forecasts by incorporating the influence of neighboring areas through a “core matrix” and accounting for geographical constraints in regions with and without consumption. The proposed approach enhances the precision of spatial forecasts, making it suitable for large-scale distribution systems and implementable within short timeframes. The proposed method was validated using data from a Peruvian distribution system serving over one million users, employing 204 historical records and analyzing three georeferenced consumption segments at scales of 1:10,000, 1:1000, and 1:100. The results demonstrate its effectiveness in forecasting across different time horizons, thereby contributing to improved planning of electrical infrastructure.

1. Introduction

The forecast of electrical demand is a crucial input in the planning of the electric industry, playing a significant role in the decision-making process of market agents, and is closely linked to economic, social, and environmental development [1,2]. It is essential for sustainability to optimize the use of energy resources aligned with renewable energy sources and the expansion of infrastructure over short, medium, and long-term horizons, while ensuring a stable supply of electricity and reducing carbon emissions [3,4]. This requires accurate predictions of demand, both in terms of magnitude and geographic locations, over different periods of the planning horizon.
Electric distribution companies are responsible for acknowledging their customers’ needs and supplying energy efficiently while maintaining the electrical grid at an acceptable level of quality and reliability. These networks exhibit a high degree of variation in topology and are continuously expanding due to urban growth and the expansion of electrical service coverage areas [5,6].
Methods for projecting electrical demand are studied in terms of energy (kWh) and power (kW), considering short-, medium-, and long-term time horizons to meet the operational, planning, and other needs within electrical systems. These methods employ statistical techniques, artificial intelligence, and spatial micro-area analysis. Statistical methods explore models such as regression, multiple regression, exponential smoothing, iterative reweighted least squares, adaptive load forecasting, stochastic time series (autoregressive), ARMA, ARIMA, SARIMA, and Prophet models. On the other hand, artificial intelligence methods investigate neural networks, support vector machines, genetic algorithms, machine learning, wavelet neural networks, fuzzy logic methods, and expert systems [7,8,9,10,11,12]. Most methods focus on projections for the serviced area, assuming uniform growth across the region; however, in practice, growth is heterogeneous at different levels of granularity, and over the years new areas become serviced.
Various methodologies have been proposed for spatial prediction in electrical load forecasting. One notable method, introduced by [13], integrates current trends, a clustering algorithm, and a Geographic Information System (GIS) to perform spatial load forecasting. This approach utilizes fuzzy reasoning and fuzzy clustering algorithms to generate spatial load forecasts.
In the work of [14], a fuzzy inference system based on adaptive networks is applied to predict electrical loads across four major regions of Taiwan. Another significant contribution, presented in [15], employs cellular automata—a discrete-time model composed of cells that evolve according to predefined rules. This model emphasizes the significance of developed and redeveloped areas in spatial load forecasts. By leveraging data mining techniques, classification rules are extracted from a dataset with a spatial resolution of 0.5 km2. The cellular automata approach assumes continuous load growth, where growth progresses incrementally, and undeveloped areas become developed when surrounded by at least three developed neighbors. This methodology builds on earlier research by the same group [16,17].
The dynamics of urban areas were analyzed using a multi-agent system in [18], focusing on the distribution of electrical loads across city subzones. This system incorporates static, mobile, and proactive agents, where mobile agents navigate the city, redistributing loads along their paths, and proactive agents represent new, non-natural loads introduced into the system. A spatial load forecasting method based on cluster analysis was proposed in [19], grouping cells by evaluating their results and relative prediction errors from multiple forecasting models. To predict energy consumption in multi-family residential buildings, support vector regression was employed in [20], leveraging on-site measurements for model development. Load monitoring for operational purposes was addressed in [21]. Furthermore, knowledge discovery in databases was utilized in [22] to extract preferential scores for land use changes in spatial load forecasting, while rough set theory was applied in [23]. Aerial imagery was also integrated into spatial load forecasting in [24] to enhance accuracy.
Spatiotemporal forecasting of energy demand plays a pivotal role in conducting detailed analyses in micro-areas and facilitating the efficient development of the electrical grid over various time horizons. This procedure enables a tailored adaptation to customer requirements while supporting strategic planning for future electrification zones. The availability of accurate geolocated demand data, both current and projected, is indispensable, particularly when addressing specific constraints such as capacity saturation and land use limitations. Forecasting at the micro-area level, while ensuring consistent geographical boundaries, provides a granular and precise representation of spatial consumption patterns. By leveraging spatial information, it becomes possible to achieve more accurate data aggregation, thereby minimizing distortions in system-wide demand forecasts.
The integration of machine learning and convolution methods applied to large-scale geospatial forecasting of electrical energy yields efficient results for use in the planning of medium- and long-term electrical distribution systems. Its computational implementation is highly effective for processing large volumes of data. The proposed algorithm facilitates the incorporation of the additive effects of major consumption centers in their surrounding areas through a “2N − 1” order kernel matrix. It also incorporates relative growths, geographic constraints, and saturation of consumption (S-curve), while aligning with the global forecast.
The proposed method for addressing the challenge of geospatial forecasting of electric energy in distribution systems offers several significant contributions:
  • Integration of machine learning and convolution techniques: this study demonstrates the effectiveness of integrating machine learning algorithms with convolution methods for processing large datasets and conducting large-scale geospatial energy demand forecasts. The approach not only aligns with global forecasting results but also facilitates the identification and planning of new electrification areas, optimizing the management of electric distribution networks.
  • Segmentation of the distribution service area: the proposed segmentation method divides the service area into grids, enabling precise identification of critical network components for analysis. This scalable approach adapts to the requirements of both sub-transmission and distribution systems. The phased segmentation model enhances network analysis, supporting more efficient and detailed system management.
  • Adaptability of geospatial analysis: the geospatial analysis framework is adaptable to various global growth scenarios, integrating economic, social, and demographic factors for high accuracy. It allows spatial representation of optimistic and pessimistic growth projections over medium- and long-term horizons, adding strategic value to infrastructure planning and improving responsiveness to diverse demand scenarios.
  • Geospatial forecasting of energy demand: accurate forecasting of medium- and long-term energy demand provides critical insights for planning the expansion of distribution and sub-transmission networks. This includes addressing areas without electrical coverage and those requiring electrification due to urban growth. The results support the development of new forecasting methods tailored to the characteristics of specific network components, improving overall energy planning strategies.
  • Enhancements to the regulatory and methodological framework: the proposed methodology advances the regulatory framework for planning electrical transmission and distribution systems over short-, medium-, and long-term horizons. It introduces a systematic approach that electrical distribution companies can implement to enhance the precision and effectiveness of their expansion plans, contributing to both current practices and future procedural developments.
The structure of this work is organized as follows. Section 2 presents the theoretical foundation and introduces the key concepts necessary for understanding the paper. Section 3 describes the proposed method, detailing the step-by-step procedure and including a simplified example to enhance comprehension. Section 4 validates the methodology through its application to a real-world case study involving the “Luz del Sur” distribution system in Lima, Peru. Finally, Section 5 concludes the work by summarizing the main findings and discussing their implications.

2. Background Work

Developing prediction models for small areas effectively requires a focused analysis on specific elements of the electrical network, such as power substations, feeders, or distribution substations. These elements are analyzed based on the regions they serve, as illustrated in Figure 1a for power substations [25]. However, this approach faces significant challenges due to the intricate geometries of the resulting areas and their temporal variability, which arise from the dynamic evolution of the electrical grid.
An alternative approach involves using square cells overlaid on a grid map, as depicted in Figure 1b. This method is readily implemented using databases or matrix arrays, with cell sizes adaptable to the scale of the network element under study. For instance, larger cells are suited for transmission facilities, while smaller cells are preferable for distribution facilities.
For geospatial forecasting, it is crucial to account for various factors, including the physical description of the electrical network, historical records of georeferenced customer consumption, and geographic data such as streets, mountains, lakes, rivers, roads, and archaeological sites, among others [26]. Additionally, global forecast results at the total system level must be considered. These factors can be effectively addressed by incorporating machine learning methods and convolution techniques, enabling a more comprehensive and accurate analysis.

2.1. Parameters for Geospatial Analysis

The proposed geospatial analysis at the grid level incorporates the following parameters: demand coverage factors, geographic restriction factors, and the relative influence of neighboring areas with and without electrical load. These parameters provide a comprehensive framework for analyzing spatial interactions and constraints within the grid, as outlined in [27].

2.1.1. Coverage Factor (C)

Electrical grids consist of areas designated for consumption, such as residential, commercial, and industrial zones, as well as areas where no consumption is expected, including archaeological sites, parks, and other protected or non-inhabitable regions. Figure 2a illustrates various scenarios: grids with a coverage factor of zero, corresponding to uninhabitable terrains such as hills where no consumption is anticipated; grids with a factor of one, representing fully occupied areas dedicated entirely to consumption; and mixed-use grids with coverage factors less than one, indicating partial occupation or a combination of consumptive and non-consumptive areas.
The coverage factor was determined using the GIS database provided by the distribution company through the following steps. First, the GIS database was analyzed to extract cadastral shapes representing the entire area under the distributor’s coverage. Subsequently, shapes corresponding to the grids of segments S1, S2, and S3 were generated. An overlay operation was then performed between the cadastral shapes and the specified grids to identify polygons within each grid and calculate their respective areas. Finally, the coverage factor for each grid was determined by dividing the total area of the polygons by the area of the corresponding grid, as illustrated in Figure 2b for four cases of segment S1 (100 m side grids).

2.1.2. Constraints (R)

Grids encompassing areas where energy consumption is not anticipated—such as archaeological sites, water bodies (e.g., lakes and rivers), parks, hills, and other similar regions—are assigned a factor of 0. In contrast, grids representing areas with expected energy consumption are assigned a factor of 1. This classification can be specified manually by the user or automatically extracted from a Geographic Information System (GIS), as illustrated in Figure 3.

2.1.3. Attraction Poles (E.L)

Developed regions exert a significant underlying force that drives growth in surrounding areas. Populations tend to settle as close as possible to zones offering enhanced economic opportunities, access to services, quality of life, education, modernization, and technological advancements. This phenomenon, observable at various scales, is frequently modeled using gravity-center frameworks, also referred to as growth pole models. These models typically assume that a pole’s influence diminishes inversely with distance, with the strongest impact occurring in adjacent areas.
Attraction poles, such as city centers, airports, industrial parks, and similar entities, represent focal points of growth. In large metropolitan areas, multiple attraction poles often coexist and can be modeled at varying scales. In the proposal, the influence of the poles of attraction is represented mathematically through the relative growth (E.L) of the grids in segments S1, S2, and S3, as illustrated in Figure 4 for segment S1.

2.1.4. Convolution Kernel (M)

The convolution kernel is represented mathematically by a matrix of order [ 2 N 1 ] . The matrix illustrates the influence of the neighboring grids on the growth in the center, including its own.
M = 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1
For the spatial analysis, the kernels with N = 1, 2, 3, and 4 are analyzed and the metrics R2, MAE, RMSE, MAPE, and WAPE are evaluated to assess their efficiency in forecasting.

2.2. Machine Learning Methods

2.2.1. Clustering

Before applying the clustering algorithm, it is imperative to preprocess and normalize the data to ensure meaningful and robust results. Preprocessing and normalization techniques standardize the data, reduce noise and outliers, and eliminate irrelevant or redundant features. In this research, data preprocessing involved cleaning the dataset using an outlier detection and removal method, selecting the most relevant features for clustering based on the correlation coefficient method, and applying Z-score normalization to transform the features into a consistent scale.
The clustering algorithm used is K-means, recognized for its accuracy and efficiency [28]. The primary mechanism of K-means employs the Euclidean distance as a similarity measure, assigning data points to the nearest centroid by minimizing the distance within clusters. Additionally, the algorithm identifies discrete outliers and groups them into distinct clusters. The objective function for K-means is represented as:
min S E ( u i ) = min S i = 1 k Z c j S i Z c j u i 2
Let S denote the dataset, where each element ( Z c j ) corresponds to the historical trends of the electrical grids. The dataset is partitioned into k clusters, with each cluster characterized by a centroid ( μ i ). To identify the optimal number of clusters, the elbow method and silhouette analysis are applied, ensuring a robust and reliable clustering configuration.

2.2.2. Forecast

The Prophet method is proposed to determine the relative growth of the grids. Prophet is designed to automatically detect seasonality and trends while offering user-friendly parameters without requiring extensive expertise in time series analysis. Prophet implements an additive model with the following components:
E ( t ) = g ( t ) + s ( t ) + h ( t ) + ε ( t )
where:
  • E ( t ) represents energy consumption;
  • g ( t ) is the trend function modeling non-periodic changes in the mean value of the time series, describing long-term behavior;
  • s ( t ) is the seasonal component representing periodic changes of known periods (e.g., monthly), modeled using the Fourier series;
  • h ( t ) accounts for the effect of irregular events impacting the series (e.g., COVID-19);
  • ε ( t ) represents the random component or error term, capturing any changes not accounted for by the other components.

2.3. Convolution

Given a matrix A R n × m and a matrix M R 2 n + 1 × 2 m + 1 , the convolution of matrices A and M is defined as a new matrix B = A M , formulated by the expression:
B i , j = i = i ̲ i ¯ j = j ̲ j ¯ A i , j M i i + n + 1 , j j + m + 1
where i and j are index elements surrounding A i , j , bounded by the size of A as follows:
i ̲ = max { 1 , i n } , i ¯ = min { n , i + n }
j ̲ = max { 1 , j m } , j ¯ = min { m , j + m }
The matrix M is referred to as the kernel of the convolution. A simple example is illustrated in Figure 5a.

Weighted Convolution

To improve the spatial analysis, especially for areas with restrictions equal to 0 and future areas without energy consumption, it is proposed to incorporate a weighting factor, 1 / k , to the above equation. In the example in Figure 5b, the weighting factor is 1/8.
B i , j = 1 k i = i ̲ i ¯ j = j ̲ j ¯ A i , j M ( i i + n + 1 ) , ( j j + m + 1 )
where:
k = k i , j ( k i , j = 0 if A i , j = 0 and k i , j = 1 if A i , j > 0 )
This factor allows for greater emphasis on growth areas and aligns with the actual development patterns of cities.

3. Proposed Method

For the analysis of electricity forecasting in distribution systems, a geospatial demand forecasting method is proposed. This method employs a segmentation process that considers grid maps from the largest to the smallest scale, integrating machine learning and convolution methods. It is designed for application in large-scale systems and accounts for the overall growth of the system across short-, medium-, and long-term horizons. The proposed approach facilitates the processing of large-scale data and their integration with the global forecast, where the cell length at each stage corresponds to the elements of the electrical network. A simplified schematic representation of this method is presented in Figure 6.
1. 
Global forecasting: this refers to estimates made by the distributor at the total system level and serves as a reference for geospatial analysis. Over the years, total system-level forecasting studies have evolved significantly, with abundant literature highlighting diverse approaches. Each author emphasizes the importance of explanatory variables, their selection, and their application through statistical or artificial intelligence methods. This research does not aim to analyze or discuss these methods but focuses on geospatial forecasting.
2. 
Supply database: contains information on the consumption of all clients and their geographical location. This database will be used to determine consumption within the defined grids.
3. 
Geographic Information System (GIS): includes the development of the distribution network and the territorial cadastre within the area served by the distributor. This will form the basis for determining coverage factors and consumption constraints within the grids.
4. 
Global forecast estimate: using the data from point 2, the global forecast can be obtained using the Prophet method. For spatial analysis, a choice must be made between this result and the data from point 1. In this study, it will serve as an input variable. In the proposed geospatial analysis, the results of segment S 1 must be adjusted using the global forecast ( N = 1 ). Similarly, the results of segments S 2 and S 3 must be adjusted using the outcomes of segments S 1 and S 2 , respectively (see Figure 7).
5. 
Definition of number of segments (NS) and scales: the analysis considers different geospatial scales. Smaller grid sizes (e.g., 100 m, 50 m) often result in historical energy consumption data with low correlations, limiting the applicability of existing methods in the literature.
6. 
Segmentation process: this step involves determining the historical consumption of grids at various scales and preparing the database for clustering and relative growth estimation. Additionally, it includes the necessary matrix arrangements for geospatial analysis. This article proposes a three-stage segmentation approach: segment S1 with a scale of 1:10,000, segment S2 with a scale of 1:1000, and segment S3 with a scale of 1:100. To achieve this, the locations of energy supplies are mapped to the corresponding grids for each segment, using their geographic coordinates: S1 with a resolution of 10 4 m (C10,000: UTM/10,000), S2 with a resolution of 10 3 m (C1000: UTM/1000), and S3 with a resolution of 10 2 m (C100: UTM/100). The historical energy consumption within each grid (Zc) is calculated by summing the consumption of all supplies located within it, providing the foundation for clustering and forecasting processes.
7. 
Clustering process: conducted for each segment using the K-means algorithm to determine the optimal number of clusters.
Before applying the algorithm, detecting and removing outliers is crucial, evaluating features through the correlation coefficient, and performing normalization. The elbow and silhouette methods were utilized for optimal clustering of areas with complete historical data due to their simplicity, computational efficiency, and ease of interpretation. For regions with incomplete historical data, a specific group previously identified was assigned through a training process and confidence intervals. Compared with other evaluated methods, DBSCAN was not practical due to its computational inefficiency and the lack of consistent clustering. Additionally, the processing time for Gaussian Mixture Models was significantly longer, making their use less efficient. On the other hand, employing more complex algorithms such as Random Forest or neural networks did not offer significant benefits at this stage due to the increased computational load.
8. 
Relative growth estimation: for each cluster of grids resulting from the previous process, relative growth is estimated using the Prophet method.
Prophet was selected due to its outstanding balance between accuracy and simplicity in time series analysis, particularly in contexts similar to electrical demand forecasting. It is intuitive and robust against missing data, easily adjustable, and effective when data exhibit clear trends and seasonal effects. Prophet replaces more sophisticated models such as LSTM, which requires a larger data set, more tuning time, and careful hyperparameter configuration, or ARIMA and SARIMA models, which face limitations in handling series with multiple seasonalities or dynamic trends. In this study, the comparison of reported errors confirms that Prophet achieves competitive performance, and its selection is justified by the problem’s characteristics and the available resources.
9. 
Geospatial analysis: geospatial analysis involves the application of the convolution algorithm and its subsequent normalization or adjustment based on the results of the global forecast (EG). The fundamental concept behind the convolution of relative growth ( T . E ) with attraction poles (M) is to determine a reference growth (G) for geospatial forecasting, prior to normalization. Since the convolution process includes grids with restricted areas (R) and areas that are partially or fully covered (C), these parameters must be incorporated into the determination of G. The proposed formulation is expressed in the following equation:
G = C · R · ( T . E ) M
Subsequently, this reference growth is normalized to ensure that the globally forecasted growth ( Δ E G ) matches the total growth of the grids within each segment. The mathematical formulation used for normalization is expressed as follows:
E = E + Δ E G i j G i , j · G
where Δ E represents the growth matrix for the grids in segments S 1 , S 2 , and S 3 . Specifically:
Δ E = E E = i S j S E i , j E i , j
New consumption grids are identified when the forecast exceeds the historical average obtained based on the statistics of the distribution company.
10. 
Final outputs: the results of geospatial forecasting at the grid level and for each segment.

3.1. Simple Application

The initial consumption of an area composed of 12 grids, represented by the matrix E 3 × 4 , is 3000 MWh, and the details are as follows:
E = 450 300 0 150 600 300 300 0 300 450 150 0 MWh
A global growth of 50% is anticipated over a time horizon.

3.1.1. Spatial Forecasting with Global Growth Rate (EG)

Considering this growth, the projected global consumption would be 4500 MWh ( Δ E = 1500 MWh). In the scenario of scarce historical information at grid level, the resulting growth would be homogeneous, as shown below:
E = 675 450 0 225 900 450 450 0 450 675 2250 0 MWh

3.1.2. Spatial Forecasting with Relative Growth Rate (T)

In a real scenario, the growth in the grids is heterogeneous. In this context, for a scenario of relative growth T:
T = 80 % 80 % 0 % 80 % 60 % 60 % 60 % 0 % 40 % 40 % 40 % 0 %
The total projected consumption would be 4800 MWh. This result is 10% more than that obtained considering global growth ( Δ E r = 1800 MWh).
T · E = 360 240 0 120 360 180 180 0 120 180 60 0 MWh
Therefore, it is necessary to make the necessary adjustments to normalize it with the global forecast:
E = E + Δ E Δ E r ( T · E ) = 750 500 0 250 900 450 450 0 400 600 200 0 MWh

3.1.3. Geospatial Forecasting Using Convolution

Additionally, in a real scenario, new areas are served due to horizontal growth. In the example, these areas include the grids E ( 1 , 3 ) , E ( 2 , 4 ) , and E ( 3 , 4 ) . To solve this problem, the following parameters are considered in the example: M, C, and R.
M = 1 1 1 1 1 1 1 1 1
C = 100 % 100 % 80 % 80 % 100 % 80 % 90 % 90 % 80 % 80 % 90 % 0 %
R = 1 1 1 1 0 1 1 1 1 1 1 0
where R ( 0 , 1 ) represents an area that is saturated and not growing, and R ( 3 , 4 ) corresponds to an area occupied by a river.

Convolution

( T · E ) M = 285 264 180 150 240 210 160 120 210 180 150 120 MWh

Reference Growth ( W )

W = C · R · [ ( T · E ) M ] = 285 264 144 120 0 168 144 108 168 144 135 0 MWh
In total, this amounts to 1680 MWh, which represents a 6% increase over the global forecast.

Final Adjustment

Finally, an adjustment is made to these results to reach the total forecast:
E = E + Δ E i j R i , j · W = 704 536 129 257 600 450 429 96 450 579 270 0 MWh

3.2. Error Measurement

The spatial forecast error is measured using the following metrics: Coefficient of determination (R2), Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), Mean Absolute Percentage Error (MAPE), and Weighted Absolute Percentage Error (WAPE). These metrics are defined as follows:
ε ^ R 2 = 1 i = 1 n j = 1 m ( e i j ) 2 i = 1 n j = 1 m ( E i j E ^ ) 2
ε ^ MAE = 1 m n i = 1 n j = 1 m | e i j |
ε ^ MSE = 1 m n i = 1 n j = 1 m ( e i j ) 2
ε ^ RMSE = 1 m n i = 1 n j = 1 m ( e i j ) 2
ε ^ MAPE = 1 m n i = 1 n j = 1 m | % e i j |
ε ^ WAPE = i = 1 n j = 1 m | % e i j | i = 1 n j = 1 m E i j
where:
  • e i j denotes the error value at grid cell ( i , j ) ;
  • E i j denotes the actual value at grid cell ( i , j ) ;
  • E ^ represents the average value of the grid cells;
  • n and m are the dimensions of the spatial grid.
These metrics provide valuable insights into the accuracy and reliability of the spatial forecasting model, helping to assess its performance effectively.

4. Validation

In this section, the proposed method is validated in a Peruvian electrical distribution system, whose served area is illustrated in Figure 8. For validation, historical data from the period 2007–2017 are considered for training the models and the period 2018–2023 is projected, comparing the results obtained with the actual values recorded in the same period.
The number of users and historical consumption considered in the analysis is shown in Table 1.
The data in Table 1 show the impact on energy consumption in 2020 and 2021 due to the pandemic caused by COVID-19 and its recovery in 2022 and 2023. For validation, the real global growth recorded year-on-year in the period 2018–2023 is considered.

4.1. Segmentation and Clustering

Table 2 highlights two groups within segments S1, S2, and S3: grids with energy consumption records for the entire period from 2007 to 2023 (complete data) and grids that began recording consumption after 2007 (incomplete data).
The data in Table 2 reveal that, within segment S1, consumption was recorded in a new grid starting in 2009. In contrast, segments S2 and S3 incorporated 173 and 8319 new grids, respectively, during the period 2008–2023. Figure 9 highlights the new areas in segments S1, S2, and S3 in blue. In these grids, the initial consumption recorded represents on average 1% of the consumption in the surrounding areas.
The analysis results of the historical consumption behavior across the grids for the period 2007–2017 are presented in Table 3. This table categorizes the grids by segment, identifying those with complete and incomplete data. Additionally, it lists the number of outliers and specifies the sample size to be considered for clustering purposes.
The results of the clustering of grids in segments S1, S2, and S3 are shown in Table 4.

4.2. Parameters for Geospatial Analysis in Segment S1

The following tables present the base spatial energy (E), consumption groups, annual relative growth rate (T), coverage factors (C) and restrictions (R) for segment S1. A similar procedure is applied for segments S2 and S3.
The spatial analysis to forecast consumption in 2018–2023 is carried out sequentially each year. Table 5 contains the energy for the base year recorded in 2017 and the consumption groups obtained for segment S1. In this context, X and Y represent the UTM Easting and Northing coordinates, respectively, scaled to 1:10,000.
Regarding the last column labeled “36”, it is suggested to remove or eliminate it across all tables, specifically Tables 5, 7, 9 and 10.
Table 6 shows the relative growth rates of the consumption groups of segment S1 obtained for the period 2018–2023.
Table 7 presents the coverage and restriction factors obtained for the grids in segment S1. The tables emphasize the factors related to new areas that should be considered for horizontal growth. The areas highlighted in blue indicate regions where future energy consumption is anticipated. Conversely, in the remaining areas, future energy consumption would not be feasible, resulting in a restriction factor of zero. Regarding the coverage factors, an estimate of 2.0% has been assigned, as the majority of these areas are covered by rocky mountains.
Table 8 shows the energy consumption in the grids for the training and validation period.
Finally, for the spatial analysis, the kernels with N = 1, 2, 3, and 4 are analyzed and the metrics R 2 , MAE, RMSE, MAPE, and WAPE are evaluated to assess their efficiency in forecasting.

4.3. Geospatial Forecast Results in Segment S1

The analysis was conducted on a year-by-year basis using the proposed formulation through the convolution algorithm, covering the period from 2018 to 2023. Table 9 presents the geospatial forecast for 2018 within segment S1, utilizing the 3 × 3 core matrix. The table highlights the distribution of energy demand across geospatial coordinates, represented by rows (Y) and columns (X). The results reveal concentrated demand in specific grid cells, particularly along coordinates Y = 866 and Y = 867 , with peaks of 2949.2 GWh and 605.8 GWh, respectively. These high-demand regions correspond to urban or industrial zones, where consumption is typically more intense. In contrast, cells with minimal or zero values indicate sparsely populated or low-consumption areas. The total energy demand for the segment in 2018 amounts to 8910.3 GWh, reflecting the spatial variability inherent in the segment.
The geospatial forecast for 2023, presented in Table 10, provides a detailed distribution of energy demand across a 3 × 3 core matrix. The forecast highlights significant variability in energy consumption, with notable concentrations in specific geographic zones. The highest energy demand is observed in cell (866, 28), with a value of 3052.5 GWh, followed by cell (866, 27) at 1014.5 GWh and cell (865, 28) at 1762.4 GWh, indicating densely populated or high-consumption areas. Conversely, the majority of cells show negligible or zero consumption, reflecting sparsely populated or low-demand zones. This spatial distribution underscores the algorithm’s ability to identify hotspots of energy usage and allocate resources accordingly. The total projected demand for the region amounts to 9394.2 GWh, providing a comprehensive basis for infrastructure planning and optimization. These results demonstrate the utility of the geospatial forecasting methodology in capturing localized consumption patterns critical for effective energy distribution.
Figure 10 highlights, in blue, the new grid areas that recorded electricity consumption during the 2018–2023 period. Specifically, Figure 10a, Figure 10b, and Figure 10c provide details for segments S1, S2, and S3, respectively. In real-world scenarios, such as those illustrated in Figure 10b,c, smaller-scale segments reveal new consumption areas located far from urban zones (isolated regions). These areas are typically associated with industrial loads or regions that previously lacked electrical coverage, particularly in rural distribution systems.
Figure 11 presents the geospatial forecasting results for 2018–2023, depicted in red. Specifically, Figure 11a, Figure 11b, and Figure 11c illustrate the forecasts corresponding to segments S1, S2, and S3, respectively. In Figure 11b,c, it is evident that isolated areas from the real-case scenario were not accurately identified by the convolutional kernels analyzed, which included sizes of 3 × 3, 5 × 5, 7 × 7, and 9 × 9. For these cases, particularly in smaller-scale segments, it is recommended to explore the use of higher-order kernels to enhance their range and improve forecasting accuracy.

4.4. Calculation of Metrics

The metrics were determined year-on-year and for the analyzed projection scenarios as follows:
  • Spatial forecast considering global growth rates (global).
  • Spatial forecast considering growth rates by groups in segments S1, S2, and S3 (zonal).
  • Spatial forecasting using standard convolution (spatial [27]).
Additionally, convolution kernels of sizes 3 × 3 , 5 × 5 , 7 × 7 , and 9 × 9 were incorporated into the proposed methodology. In the measurement tables of the metrics MAE, RMSE, MAPE, and WAPE, the values with the lowest magnitude for each year are highlighted in blue for clarity and emphasis.
The results of the metrics obtained in segment S1 are summarized in Table 11.
The data analysis presented in Table 11 highlights the comparative performance of different methodologies and grid sizes regarding the metrics MAE, RMSE, MAPE, and WAPE. The proposed methodology consistently demonstrates superior accuracy compared with the global, zonal, and spatial methods across all metrics and years.
In terms of MAE and RMSE, the proposed 9 × 9 grid configuration achieves the lowest error values across all years. For instance, in 2018, the 9 × 9 grid records an MAE of 3.5 GWh and an RMSE of 6.6 GWh, outperforming other grid configurations such as 3 × 3 , 5 × 5 , and 7 × 7 . Similar trends are observed in subsequent years, where the 9 × 9 grid consistently yields the smallest error margins, demonstrating its effectiveness in capturing demand variations with high precision.
The MAPE values further corroborate the effectiveness of the proposed 9 × 9 grid. It achieves the lowest percentages in multiple years, particularly in 2019 (12.3%) and 2020 (18.8%). These results highlight the ability of the 9 × 9 configuration to minimize relative prediction errors, making it a robust choice for accurate energy demand forecasting.
The MAPE is an indicator that is frequently used in demand forecasting because it is easily interpretable. However, in spatial analysis it can be biased when there are divisions by zero, outliers, or due to the scale of the actual values. For example, if a grid with a consumption of 1000 GWh has an error of 1% (10 GWh) and another with a consumption of 10 GWh has an error of 10% (1 GWh), the MAPE of the set would be 5.5%.
The WAPE that weights the errors by the volume of demand is a highly recommended measure since it minimizes the impact of grids with very variable demands. In the example above, the WAPE of the set would be 1.09% = [(10 + 1)/(1000 + 10)], which is more reasonable on a global scale.
The WAPE metric also underlines the advantages of the 9 × 9 grid. It achieves the smallest WAPE percentages, notably 1.6% in 2018 and 6.7% in 2023, indicating its reliability in maintaining low-weighted aggregate prediction errors. Overall, the 9 × 9 grid consistently outperforms other configurations, making it the most effective choice within the proposed methodology.
Finally, Table 12 presents the average values of the MAPE and WAPE metrics obtained for segments S1, S2, and S3 during the periods 2018–2019 and 2018–2023, which include the years affected by the COVID-19 pandemic. The scenario with the lowest metrics is highlighted in blue. The table compares results obtained using standard convolution with a 3 × 3 kernel [27] and weighted convolution for kernels of sizes 3 × 3, 5 × 5, 7 × 7, and 9 × 9. The Global and Zonal scenarios are not included, as these do not forecast new consumption areas, such as those shown in Figure 11 (marked in red).
The metrics MAPE and WAPE obtained through weighted convolution outperform those obtained through standard convolution, except for the estimate for the period 2018–2023 in segment S3. In segment S1, the proposed method reduces the MAPE by up to 7.7% and the WAPE by up to 1.3% during the period 2018–2019. For the period 2018–2023, the MAPE is reduced by up to 5.6% and the WAPE by up to 1.0%.
Similarly, the MAPE and WAPE metrics for segment S2 increased by over 3% during the period 2018–2019 and by over 5% during 2018–2023. In segment S3, these metrics increased by more than 5% during 2018–2019 and by over 10% during 2018–2023.

5. Long-Term Forecast

This section presents a proposal for spatial energy forecasting over a 20-year horizon, extending to 2043. The results of the global forecast, generated using the Prophet method, are summarized in Table 13. The forecast indicates a steady annual increase in energy consumption, starting at 9659 GWh in 2024 and reaching 14,750 GWh by 2043. The annual growth rates (T.C.) exhibit a gradual decline over the forecast period, decreasing from 2.9% in 2024 to 1.9% in 2043. As previously indicated, this article aims to explore geospatial forecasting in alignment with the global growth of the system. In this context, the results produced by the Prophet model are analyzed; however, in practical applications, the input data would likely be derived from more advanced and efficient methods or techniques. Additionally, alternative scenarios, both pessimistic and optimistic, may also be considered for a more comprehensive analysis.
The relative growth of consumption groups within segment S1 is presented in Table 14. This table provides a detailed overview of the annual growth rates for seven consumption groups from 2024 to 2043. The analysis reveals distinct trends, with Group 1 consistently exhibiting positive growth, albeit at a declining rate, starting from 4.9% in 2024 and tapering to 2.2% in 2043. In contrast, Group 2 shows minimal or negative growth throughout the period, with rates fluctuating between 0.1% and −1.0%, highlighting challenges in this segment. Groups 3, 4, and 5 maintain moderate but steady growth rates, with notable peaks in the earlier years, such as 4.8%, 7.5%, and 6.8%, respectively, in 2024, before gradually decreasing over time. Groups 6 and 7 demonstrate lower growth rates overall, with values generally remaining below 3.6% and 5.8% in the initial years and tapering to less than 2.0% by 2043. These trends suggest a general deceleration in growth across most groups, emphasizing the need for targeted strategies to address consumption dynamics, particularly for underperforming groups such as Group 2.
The remaining parameters are held constant for the geospatial analysis.
The spatial forecast presented in Table 15 illustrates the projected energy demand within segment S1 using a 3 × 3 convolution matrix for a 20-year horizon. The data reveal significant variations in energy consumption across different spatial coordinates, with high-demand zones concentrated in specific areas while others exhibit negligible or no demand. This variation emphasizes the importance of geospatial forecasting in identifying critical zones for targeted infrastructure development and efficient resource allocation.
A detailed examination of Table 15 highlights that rows 866 and 865, particularly in columns 28 and 29, represent areas of peak energy demand, with values reaching 3416.7 GWh and 3772.2 GWh, respectively. These zones are likely indicative of urban or industrial hubs that will continue to drive energy consumption growth in the future. Additionally, localized demand spikes, such as the 745.2 GWh forecast in row 867, column 29, suggest emerging consumption centers that may require strategic interventions to ensure adequate energy supply. Conversely, many regions in the forecast exhibit minimal or zero demand, reflecting underdeveloped or sparsely populated areas where significant growth is not expected within the projection horizon. However, incremental increases in previously low-demand regions, such as the 46.9 GWh forecast in row 868, column 29, indicate the gradual expansion of electrical coverage and development in these zones.
The forecast projects the addition of five new service areas within segment S1 over the 20-year horizon, a marked acceleration compared with the historical trend. Between 2007 and 2023, the distribution company expanded its serviced area by only one grid, achieved in 2019. This forecasted growth highlights the increasing electricity demand and underscores the need for proactive infrastructure planning to meet future consumption requirements.
Five (5) new areas, highlighted in blue, were identified due to the spatial analysis. This outcome is optimistic, considering the current level of electrical coverage provided by the distribution company and the historical precedent recorded during the 2007–2023 period, where only one new area was served for 17 years.
However, with the 9 × 9 core matrix, the projection for a 20-year horizon anticipates the inclusion of two (2) new areas in segment S1, with estimated consumption values highlighted in blue, as shown in Table 16. This result aligns more closely with the historical horizontal growth of the distribution company, which expanded its coverage by only one (1) grid square in 2019 during the 2007–2023 period.
The spatial forecast for 2043, presented in Table 16, highlights the predicted energy consumption across various grid segments using the 9 × 9 matrix. The results indicate significant variability in predicted consumption, with certain grid cells exhibiting high demand, while others remain at minimal or zero levels. This variability underscores the importance of spatial granularity in energy demand forecasting, as it enables the identification of critical consumption areas and potential zones for infrastructure development. Key findings from the forecast include notable energy concentrations in specific segments, such as cells (866, 28), (866, 29), and (865, 28), which show the highest consumption levels of 3607.7 GWh, 1158.2 GWh, and 2582.2 GWh, respectively. These high-demand zones are predominantly situated in areas characterized by urban or industrial growth. In contrast, several cells display zero or negligible demand, indicating either uninhabited regions or areas with limited infrastructure and minimal consumption activity. Highlighted values in blue, such as 16.1 GWh in cell (864, 31) and 17.3 GWh in cell (857, 34), represent smaller yet strategically significant growth zones. These areas may require targeted infrastructure enhancements to address future energy demand effectively and support anticipated development.
Additionally, the energy distribution pattern suggests horizontal growth in specific segments, aligning with historical expansion trends documented by the distribution company. Furthermore, the use of the 9 × 9 core matrix demonstrates its capability to accurately predict long-term energy consumption patterns, including the identification of two (02) new areas in segment S1 over the 20-year horizon. This result is consistent with the historical horizontal growth observed by the distribution company, emphasizing the reliability and robustness of the proposed forecasting approach in accommodating dynamic urban expansion and evolving consumption trends.
Finally, Figure 12 shows the geospatial forecasting results for the year 2043, depicted in red. Figure 12a, Figure 12b, and Figure 12c correspond to segments S1, S2, and S3, respectively. These results were obtained using the 9 × 9 convolutional kernel.

6. Conclusions

The proposed geospatial analysis algorithm demonstrates remarkable versatility and efficacy in addressing both system-wide energy forecasting and segmented consumption analysis. By integrating relative growth trends across consumption areas of varying scales, the algorithm enables precise modeling and strategic planning. Its ability to incorporate new consumption zones, constrained only by geographic limitations, ensures adaptability to dynamic urban expansion and evolving development needs.
In the evaluated case study, which utilized 17 years of historical data (11 years for training and 6 years for validation), the 9 × 9 core matrix exhibited superior performance for segments with scales of 1:10,000 and 1:1000. For smaller-scale segments at a 1:100 ratio, higher-order core matrices yielded better results. However, due to the smaller scale and inherent historical distortions in consumption patterns, additional data would be required to enhance accuracy in long-term forecasting for these cases.
The proposed methodology, along with the developed algorithms for consumption segmentation, learning models, and convolution operations, is fully implementable by electrical distribution companies, even those with limited historical records. Moreover, for the analyzed distribution system, the proposed geospatial energy consumption forecasting model extends over a 20-year horizon, offering a practical and scalable solution for long-term planning and system optimization.

7. Recommendations

The proposed model’s primary limitation is the dynamic nature of coverage factors, which require greater precision for smaller-scale grids (e.g., 100 m resolution). To enhance long-term geospatial forecasting, geographic systems and cadastral data must be periodically updated.
Incorporating additional spatial analysis parameters, such as physical or economic proximity factors, can further refine forecasting models. Growth patterns from logistic and Gompertz curves (S-curves) could also be integrated at various scales using optimization or machine learning techniques.
The analysis of satellite images to detect night lights offers potential applications, in particular for assessing demand in areas without an electricity grid. These data can be incorporated into the proposed algorithm as an additional parameter and guide the location of new “isolated” grids in the forecast and their integration into the existing electricity grid.
Incorporating real-time data from IoT sensors can significantly enhance geospatial demand analysis. Data such as electricity consumption, solar resources, wind resources, temperature, and other environmental factors can make spatial analyses more dynamic and responsive.
For more heterogeneous or larger-scale datasets, it is recommended to evaluate the suitability of alternative methods, such as DBSCAN and GMM for clustering, as well as models like LSTMs to determine relative growth. Defining the number of segments, grid sizes, and convolutional kernel is essential to ensure the effectiveness of the proposed methodology.
The proposed methodology can also be applied to other essential services such as gas and water. However, adaptations are necessary to accommodate different types of data and the specific dynamics of each resource. These modifications are crucial to achieve effective integration into the planning of multi-energy systems.
Include the analysis of the demand characterization factors for the geospatial forecast of the maximum demand in the different components of the electrical infrastructure. This analysis can be addressed using machine learning techniques (load factor) and the Monte Carlo method (simultaneity factor).

Author Contributions

Conceptualization, H.C. and Y.M.; methodology, H.C. and Y.M.; validation, H.C. and Y.M.; investigation, H.C. and Y.M.; writing—review and editing, H.C. and Y.M.; visualization, H.C. and Y.M.; supervision, Y.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Council for Scientific and Technological Development (CNPq) under Grant No. 03/2021 and Process No. 304779/2021-0.

Data Availability Statement

The original contributions presented in the study are included in the article, further inquiries can be directed to the corresponding author.

Acknowledgments

The authors gratefully acknowledge the invaluable support provided by the Faculty of Mechanical Engineering at the National University of Engineering, Lima, Peru, and the Department of Electrical Engineering at the Federal University of Paraíba, Brazil.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hammad, M.A.; Jereb, B.; Rosi, B.; Dragan, D. Methods and Models for Electric Load Forecasting: A Comprehensive Review. Logist. Supply Chain. Sustain. Glob. Chall. 2020, 11, 51–76. [Google Scholar] [CrossRef]
  2. Panda, S.K.; Jagadev, A.K.; Mohanty, S.N. Forecasting Methods in Electric Power Sector. Int. J. Energy Optim. Eng. (IJEOE) 2018, 7, 1–21. [Google Scholar] [CrossRef]
  3. Singh, A.K.; Ibraheem; Khatoon, S.; Muazzam, M.; Chaturvedi, D.K. Load forecasting techniques and methodologies: A review. In Proceedings of the 2012 2nd International Conference on Power, Control and Embedded Systems, Allahabad, India, 17–19 December 2012; pp. 1–10. [Google Scholar] [CrossRef]
  4. Klyuev, R.V.; Morgoev, I.D.; Morgoeva, A.D.; Gavrina, O.A.; Martyushev, N.V.; Efremenkov, E.A.; Mengxu, Q. Methods of Forecasting Electric Energy Consumption: A Literature Review. Energies 2022, 15, 8919. [Google Scholar] [CrossRef]
  5. Dashti, R.; Rouhandeh, M. Power distribution system planning framework (A comprehensive review). Energy Strategy Rev. 2023, 50, 101256. [Google Scholar] [CrossRef]
  6. Li, Q.; Yang, L.; Huang, S.; Liu, Y.; Guo, C. The Effects of Urban Sprawl on Electricity Consumption: Empirical Evidence from 283 Prefecture-Level Cities in China. Land 2023, 12, 1609. [Google Scholar] [CrossRef]
  7. Kuster, C.; Rezgui, Y.; Mourshed, M. Electrical load forecasting models: A critical systematic review. Sustain. Cities Soc. 2017, 35, 257–270. [Google Scholar] [CrossRef]
  8. Ahmad, N.; Ghadi, Y.; Adnan, M.; Ali, M. Load Forecasting Techniques for Power System: Research Challenges and Survey. IEEE Access 2022, 10, 71054–71090. [Google Scholar] [CrossRef]
  9. Mystakidis, A.; Koukaras, P.; Tsalikidis, N.; Ioannidis, D.; Tjortjis, C. Energy Forecasting: A Comprehensive Review of Techniques and Technologies. Energies 2024, 17, 1662. [Google Scholar] [CrossRef]
  10. Habbak, H.; Mahmoud, M.; Metwally, K.; Fouda, M.M.; Ibrahem, M.I. Load Forecasting Techniques and Their Applications in Smart Grids. Energies 2023, 16, 1480. [Google Scholar] [CrossRef]
  11. Verwiebe, P.A.; Seim, S.; Burges, S.; Schulz, L.; Müller-Kirchenbauer, J. Modeling Energy Demand—A Systematic Literature Review. Energies 2021, 14, 7859. [Google Scholar] [CrossRef]
  12. Kumar, R. A Comprehensive Study on Demand Forecasting Methods and Algorithms for Retail Industries. J. Univ. Shanghai Sci. Technol. 2021, 23, 409–420. [Google Scholar]
  13. Blagajac, S.; Krajcar, S.; Skrlec, D. A computational tool to support spatial electric load forecasting. In Proceedings of the 2000 10th Mediterranean Electrotechnical Conference. Information Technology and Electrotechnology for the Mediterranean Countries. Proceedings. MeleCon 2000 (Cat. No.00CH37099), Lemesos, Cyprus, 29–31 May 2000; Volume 3, pp. 1116–1119. [Google Scholar] [CrossRef]
  14. Ying, L.C.; Pan, M.C. Using adaptive network based fuzzy inference system to forecast regional electricity loads. Energy Convers. Manag. 2008, 49, 205–211. [Google Scholar] [CrossRef]
  15. Carreno, E.M.; Rocha, R.M.; Padilha-Feltrin, A. A Cellular Automaton Approach to Spatial Electric Load Forecasting. IEEE Trans. Power Syst. 2011, 26, 532–540. [Google Scholar] [CrossRef]
  16. Melo, J.D.; Carreno, E.M.; Padilha-Feltrin, A. Spatial load forecasting using a demand propagation approach. In Proceedings of the 2010 IEEE/PES Transmission and Distribution Conference and Exposition: Latin America (TD-LA), Sao Paulo, Brazil, 8–10 November 2010; pp. 196–203. [Google Scholar] [CrossRef]
  17. Melo, J.D.; Carreno, E.M.; Padilha-Feltrin, A. Multi-agent framework for spatial load forecasting. In Proceedings of the 2011 IEEE Power and Energy Society General Meeting, Detroit, MI, USA, 24–29 July 2011; pp. 1–8. [Google Scholar] [CrossRef]
  18. Melo, J.D.; Carreno, E.M.; Padilha-Feltrin, A. Considering Urban Dynamics in spatial electric load forecasting. In Proceedings of the 2012 IEEE Power and Energy Society General Meeting, San Diego, CA, USA, 22–26 July 2012; pp. 1–7. [Google Scholar] [CrossRef]
  19. Xiao, B.; Guo, P.; Mu, G.; Yan, G.; Li, P.; Cheng, H.; Li, J.; Bai, Y. A Spatial Load Forecasting Method Based on the Theory of Clustering Analysis. Phys. Procedia 2012, 24, 176–183. [Google Scholar] [CrossRef]
  20. Jain, R.K.; Smith, K.M.; Culligan, P.J.; Taylor, J.E. Forecasting energy consumption of multi-family residential buildings using support vector regression: Investigating the impact of temporal and spatial monitoring granularity on performance accuracy. Appl. Energy 2014, 123, 168–178. [Google Scholar] [CrossRef]
  21. Shin, J.H.; Yi, B.J.; Kim, Y.I.; Lee, H.G.; Ryu, K.H. Spatiotemporal Load-Analysis Model for Electric Power Distribution Facilities Using Consumer Meter-Reading Data. IEEE Trans. Power Deliv. 2011, 26, 736–743. [Google Scholar] [CrossRef]
  22. Wu, H.C.; Lu, C.N. A Data Mining Approach for Spatial Modeling in Small Area Load Forecast. IEEE Power Eng. Rev. 2002, 22, 60. [Google Scholar] [CrossRef]
  23. Wu, J.; He, S.; Yang, Y. Application of Particularity Decision Method in Spatial Load Forecasting. In Proceedings of the 2010 Second WRI Global Congress on Intelligent Systems, Wuhan, China, 16–17 December 2010; Volume 2, pp. 173–176. [Google Scholar] [CrossRef]
  24. Vasquez-Arnez, R.L.; Jardini, J.A.; Casolari, R.; Magrini, L.C.; Semolini, R.; Pascon, J.R. A methodology for electrical energy forecast and its spatial allocation over developing boroughs. In Proceedings of the 2008 IEEE PES Transmission and Distribution Conference and Exposition, Chicago, IL, USA, 21–24 April 2008; pp. 1–6. [Google Scholar] [CrossRef]
  25. Willis, H.L. Power Distribution Planning Reference Book, 2nd ed.; CRC Press: Boca Raton, FL, USA, 2004. [Google Scholar] [CrossRef]
  26. Willis, H.L. Spatial Electric Load Forecasting, 2nd ed.; CRC Press: Boca Raton, FL, USA, 2002. [Google Scholar] [CrossRef]
  27. Vieira, D.; Silva, B.; Menezes, T.; Lisboa, A. Large scale spatial electric load forecasting framework based on spatial convolution. Int. J. Electr. Power Energy Syst. 2020, 117, 105582. [Google Scholar] [CrossRef]
  28. Liu, Z.; Pan, W.; Liu, J.; Wen, K.; Gong, J. A combined approach for forecasting regional long-term natural gas demand integrating K-means clustering, grey theory, and BP neural network. Oil Gas Storage Transp. 2024, 103–110. [Google Scholar]
Figure 1. Overview of network development. (a): Area of power substations and medium voltage network development; (b): medium voltage network development and grid mapping.
Figure 1. Overview of network development. (a): Area of power substations and medium voltage network development; (b): medium voltage network development and grid mapping.
Energies 18 00424 g001
Figure 2. Coverage factor. (a): Coverage factor in grids; (b): coverage factor calculation S3: 100 m.
Figure 2. Coverage factor. (a): Coverage factor in grids; (b): coverage factor calculation S3: 100 m.
Energies 18 00424 g002
Figure 3. Constraints factor.
Figure 3. Constraints factor.
Energies 18 00424 g003
Figure 4. Attraction poles.
Figure 4. Attraction poles.
Energies 18 00424 g004
Figure 5. Matrix convolution. (a): Standard convolution (unweighted kernel); (b): weighted convolution.
Figure 5. Matrix convolution. (a): Standard convolution (unweighted kernel); (b): weighted convolution.
Energies 18 00424 g005
Figure 6. Flowchart of the proposed method.
Figure 6. Flowchart of the proposed method.
Energies 18 00424 g006
Figure 7. Grid C10,000, C1000, and C100 in segments S1, S2, and S3.
Figure 7. Grid C10,000, C1000, and C100 in segments S1, S2, and S3.
Energies 18 00424 g007
Figure 8. Area served by the “Lima Sur” electrical distribution system.
Figure 8. Area served by the “Lima Sur” electrical distribution system.
Energies 18 00424 g008
Figure 9. Consumption grids. (a): Segment S1; (b): segment S2; (c): segment S3.
Figure 9. Consumption grids. (a): Segment S1; (b): segment S2; (c): segment S3.
Energies 18 00424 g009
Figure 10. New grids with real consumption in the period 2018–2023. (a): Segment S1; (b): segment S2; (c): segment S3.
Figure 10. New grids with real consumption in the period 2018–2023. (a): Segment S1; (b): segment S2; (c): segment S3.
Energies 18 00424 g010
Figure 11. Forecast of new grids with consumption in the period 2018–2023. (a): Segment S1; (b): segment S2; (c): segment S3.
Figure 11. Forecast of new grids with consumption in the period 2018–2023. (a): Segment S1; (b): segment S2; (c): segment S3.
Energies 18 00424 g011
Figure 12. Geospatial forecast for the year 2043: kernel 9 × 9 . (a): Segment S1; (b): segment S2; (c): segment S3.
Figure 12. Geospatial forecast for the year 2043: kernel 9 × 9 . (a): Segment S1; (b): segment S2; (c): segment S3.
Energies 18 00424 g012
Table 1. Global energy consumption and number of supplies during the period 2007–2023.
Table 1. Global energy consumption and number of supplies during the period 2007–2023.
YearEnergy Consumption (GWh)Number of Supplies
20074573739,832
20084942767,013
20095774795,316
20106285824,621
20116721859,094
20127162890,984
20137547924,800
20148120957,574
20158239984,299
201686531,012,234
201786331,041,790
201889101,074,496
201992331,109,807
202089301,130,554
202185861,176,210
202290951,215,024
202393941,255,141
Table 2. Number of grids in segments S1, S2, and S3.
Table 2. Number of grids in segments S1, S2, and S3.
YearGrids with ConsumptionNew Grids
S1: C10,000 S2: C1000 S3: C100 S1: C10,000 S2: C1000 S3: C100
20074188832,038
20084189532,47107433
20094190633,040011569
20104192033,578014538
20114193034,180010602
20124193834,62308443
20134194235,15804535
20144194835,59606438
20154195535,85907263
20164196636,197011338
20174198036,667014470
20184198637,08906422
201942100637,630120541
202042101637,950010320
202142102738,919011969
202242104039,577013658
202342106140,357021780
Table 3. Analysis of historical consumption in grids.
Table 3. Analysis of historical consumption in grids.
GridsComplete DataIncomplete Data
S1: C10,000 S2: C1000 S3: C100 S1: C10,000 S2: C1000 S3: C100
Total4188531,9430954773
Outliers8761578024637
Clusters3380930,3650714136
Table 4. Clustering results.
Table 4. Clustering results.
GroupsS1: C10,000S2: C1000S3: C100
Optimal6714
Outliers111
Total7815
Table 5. Base energy (E) and consumption groups.
Table 5. Base energy (E) and consumption groups.
Baseline Energy: 2017 Consumption Groups
Y↓/X→ 27 28 29 30 31 32 33 34 35 36 Y↓/X→ 27 28 29 30 31 32 33 34 35 36
8710.5 8711
8700.20.05.2 870157
8690.70.01.31.1 8695557
86840.43.31.65.2 86815225
86762.8585.1162.525.90.2 86716222
866963.52891.6204.922.4 8662221
865351.91560.1370.56.3 8652276
864201.6498.8226.2 864164
8639.3133.6 86377
86212.19.7 86211
8615.784.60.30.6 8611733
8605.016.4 86062
859135.3 8597
85819.86.9 85827
857 857
Table 6. Relative growth rate in consumption groups of segment S1 (T).
Table 6. Relative growth rate in consumption groups of segment S1 (T).
Group201820192020202120222023
16.4%6.0%5.7%5.4%5.1%4.8%
22.0%1.9%1.8%1.9%1.8%1.8%
39.0%8.3%7.6%7.1%6.6%6.2%
48.5%7.7%7.0%6.9%6.3%5.9%
5−0.1%−0.2%−0.6%0.2%−0.2%−0.2%
61.1%1.0%1.0%1.0%1.0%1.0%
74.9%4.0%3.3%4.4%3.6%3.5%
Table 7. Coverage factor (C) and constraints (R).
Table 7. Coverage factor (C) and constraints (R).
Coverage Factor Constraints
Y↓/X→ 27 28 29 30 31 32 33 34 35 36 Y↓/X→ 27 28 29 30 31 32 33 34 35 36
8710.2% 8711
8700.1%0.0%0.1% 870111
8692.0%0.2%0.0%0.2%0.2% 86911111
8682.4%3.7%1.2%0.3%0.2% 86811111
8674.6%25.2%12.6%4.2%0.1% 86711111
86611.7%48.3%21.6%5.7% 8661111
8653.4%58.0%15.7%2.6% 8651111
86414.8%41.0%7.6%2.0% 8641111
8630.6%6.8%2.0% 863111
8623.8%2.9%2.0% 862111
8612.1%8.0%0.1%0.2% 8611111
8600.5%2.7% 86011
8594.6% 8591
8582.0%4.2% 85811
8572.0% 8571
Table 8. Energy consumption in S1 grids (GWh).
Table 8. Energy consumption in S1 grids (GWh).
S1Model Training PeriodModel Validation
C10,00020072008200920102011201220132014201520162017201820192020202120222023
27-865225243263278289306318328341361352352362319285316345
27-866638682709763822867876904937986963973999904801872960
28-864819599113124135153166173188202213225239233244265
28-8659401012105511181191126913341400144415321560161816821798158316161703
28-86618141919213323062422253526622784278929192892291229662746257527382841
28-8673032323639434752566263697271828687
29-8632222223358991117161615
29-864203224231266321384422478475488499525545579560572559
29-8653337347366354344337465417396370402440327456484437
29-866117129139149160166175183191203205213220252227230242
29-867194230289353397453508565586616585622656632661710718
29-8681516161818
30-8612233333446667881214
30-8626677891011111212131317161616
30-86322225062778092125122131134144144135155173159
30-8644266718688112156175193212226242238212252281287
30-86523344566666778888
30-8661112131415161819202222242529282829
30-867103114123130139148154164167167163171179192187195197
31-86022223334455566667
31-861910111113162027406585100120112132138144
31-862233344445610141715182121
31-8671819202223232425262726272831282929
31-8683032333536374041414240414247424241
32-8581011121213141516171820191921212022
32-859141197105114119119132127130135140140136135164168
32-86089101112121315161616171922192021
32-86100000000000001111
32-86822222333333444458
32-86911111111111111111
32-87000000000000000000
33-8582223456667788991010
33-86100000001111111111
33-86700000000000000000
33-86811111111222222222
33-86900000000000000000
33-87000000000000000000
33-87100000001101100111
34-868555556665556779109
34-86911111111111112112
35-869111122932133111355433
35-87022322224555553344
Total45734942577462856721716275478120823986538633891092338930858690959394
Table 9. Geospatial forecast for 2018: core matrix 3 × 3.
Table 9. Geospatial forecast for 2018: core matrix 3 × 3.
Y↓/X→27282930313233343536
8710.00.00.00.00.00.00.50.00.00.0
8700.00.00.00.00.00.20.00.05.20.0
8690.00.00.00.00.00.70.01.41.10.0
8680.00.00.00.040.73.31.65.20.00.0
8670.068.3605.8165.026.20.00.20.00.00.0
866981.42949.2224.524.20.00.00.00.00.00.0
865357.51633.0387.57.70.00.00.00.00.00.0
8640.0213.5530.2230.30.00.00.00.00.00.0
8630.00.09.6136.10.00.00.00.00.00.0
8620.00.00.012.710.10.00.00.00.00.0
8610.00.00.05.985.00.30.60.00.00.0
8600.00.00.00.05.116.80.00.00.00.0
8590.00.00.00.00.0135.80.00.00.00.0
8580.00.00.00.00.020.17.60.00.00.0
8570.00.00.00.00.00.00.00.00.00.0
Total: 8910.3
Table 10. Geospatial forecast for 2023: core matrix 3 × 3 (GWh).
Table 10. Geospatial forecast for 2023: core matrix 3 × 3 (GWh).
Y↓/X→27282930313233343536
8710.00.00.00.00.00.00.50.00.00.0
8700.00.00.00.00.00.20.00.05.20.0
8690.00.00.00.00.00.70.01.41.10.0
8680.00.01.20.041.33.31.65.20.00.0
8670.077.7642.1169.426.90.00.20.00.00.0
8661014.53052.5259.327.10.00.00.00.00.00.0
865367.61762.4416.79.80.00.00.00.00.00.0
8640.0233.8581.0236.50.00.00.00.00.00.0
8630.00.010.1139.90.00.00.00.00.00.0
8620.00.00.013.510.70.00.00.00.00.0
8610.00.00.06.285.80.30.60.00.00.0
8600.00.00.00.05.217.30.00.00.00.0
8590.00.00.00.00.0136.50.00.00.00.0
8580.00.00.00.00.020.68.60.00.00.0
8570.00.00.00.00.00.00.00.00.00.0
Total: 9394.2
Table 11. Calculation of MAE, MSE, RMSE, and MAPE.
Table 11. Calculation of MAE, MSE, RMSE, and MAPE.
ScenarioGrid SizeMAE (GWh)RMSE (GWh)
201820192020202120222023201820192020202120222023
Global 5.38.822.425.025.520.813.122.752.858.859.353.3
Zonal 5.09.823.125.023.318.111.220.252.458.755.648.0
Spatial [27] 5.310.221.025.622.019.19.218.347.660.444.433.1
Proposed3 × 34.37.219.325.323.418.68.414.245.359.250.940.3
5 × 53.86.518.925.921.215.97.212.846.060.847.835.1
7 × 73.66.318.825.821.315.56.911.945.760.547.634.6
9 × 93.56.118.625.721.115.06.611.345.560.347.233.8
ScenarioGrid SizeMAPEWAPE
201820192020202120222023201820192020202120222023
Global 8.0%13.7%23.9%26.2%25.1%24.5%2.4%4.0%10.5%12.2%11.8%9.3%
Zonal 7.9%13.8%24.0%26.3%24.1%22.9%2.3%4.4%10.9%12.2%10.8%8.1%
Spatial [27] 15.2%20.8%20.9%27.9%23.9%24.7%2.5%4.6%9.9%12.5%10.2%8.5%
Proposed3 × 38.4%14.6%23.2%26.1%25.3%25.3%2.0%3.3%9.1%12.4%10.8%8.3%
5 × 58.7%14.5%22.4%26.7%24.5%24.4%1.8%2.9%8.9%12.6%9.8%7.1%
7 × 78.5%13.4%21.1%27.0%23.3%23.1%1.7%2.8%8.8%12.6%9.8%6.9%
9 × 98.4%12.3%18.8%27.4%20.8%19.0%1.6%2.8%8.8%12.6%9.8%6.7%
Table 12. MAPE and WAPE metrics in segments S1, S2, and S3: average for periods 2018–2019 and 2018–2023.
Table 12. MAPE and WAPE metrics in segments S1, S2, and S3: average for periods 2018–2019 and 2018–2023.
ScenarioGrid SizeMAPEWAPE
Segment S1 Segment S2 Segment S3 Segment S1 Segment S2 Segment S3
18–19 18–23 18–19 18–23 18–19 18–23 18–19 18–23 18–19 18–23 18–19 18–23
Spatial [27] 18.0%22.2%16.7%31.4%16.6%25.8%3.5%8.0%6.3%12.8%12.9%23.3%
Proposed3 × 311.5%20.5%13.9%26.2%15.9%26.9%2.6%7.6%6.1%12.7%12.6%23.2%
5 × 511.6%20.2%14.1%27.4%16.4%27.1%2.4%7.2%6.0%12.5%12.8%23.3%
7 × 711.0%19.4%14.3%27.7%16.7%26.8%2.3%7.1%5.8%12.3%12.8%23.2%
9 × 910.3%17.8%14.6%27.6%17.0%27.3%2.2%7.0%5.8%12.3%12.8%23.2%
Table 13. Global forecast, period 2024–2043.
Table 13. Global forecast, period 2024–2043.
AñoEnergy Consumption (GWh)T.C. AñoEnergy Consumption (GWh)T.C.
202496592.9% 203412,3392.2%
202599272.8% 203512,6072.2%
202610,1952.7% 203612,8742.1%
202710,4632.6% 203713,1422.1%
202810,7312.6% 203813,4102.0%
202910,9992.5% 203913,6782.0%
203011,2672.4% 204013,9462.0%
203111,5352.4% 204114,2141.9%
203211,8032.3% 204214,4821.9%
203312,0712.3% 204314,7501.9%
Table 14. Reference growth rate for segment S1 consumption groups.
Table 14. Reference growth rate for segment S1 consumption groups.
AñoS1 Consumer Groups
1234567
20244.9%0.0%4.8%7.5%6.8%3.6%5.8%
20254.2%0.1%4.0%6.8%6.4%2.8%4.2%
20263.6%−0.5%3.3%6.1%5.9%2.1%3.1%
20273.5%−0.5%3.2%5.7%5.6%2.1%3.0%
20283.0%−0.9%2.7%5.2%5.3%1.6%2.1%
20293.7%0.1%3.5%5.4%5.1%2.5%3.7%
20303.2%−0.5%2.9%4.9%4.8%2.0%2.7%
20313.1%−0.5%2.8%4.7%4.6%1.9%2.6%
20322.7%−0.9%2.4%4.3%4.3%1.4%1.8%
20333.2%0.1%3.1%4.4%4.2%2.3%3.3%
20342.8%−0.5%2.6%4.1%4.0%1.8%2.5%
20352.7%−0.5%2.6%3.9%3.8%1.8%2.4%
20362.4%−0.9%2.1%3.6%3.7%1.3%1.7%
20372.9%0.1%2.8%3.8%3.6%2.2%3.0%
20382.5%−0.5%2.4%3.5%3.4%1.7%2.2%
20392.5%−0.5%2.3%3.4%3.3%1.7%2.2%
20402.2%−1.0%1.9%3.2%3.2%1.3%1.5%
20412.6%0.1%2.6%3.3%3.2%2.0%2.8%
20422.3%−0.5%2.2%3.1%3.0%1.6%2.0%
20432.2%−0.5%2.1%3.0%2.9%1.6%2.0%
Table 15. Spatial forecast for the year 2043 in S1—3 × 3 matrix (GWh).
Table 15. Spatial forecast for the year 2043 in S1—3 × 3 matrix (GWh).
Y↓/X→27282930313233343536
8710.6
8700.50.34.6
8694.93.40.62.33.7
86846.945.711.23.010.2
86793.4745.2352.652.62.5
866959.63416.7290.9145.2
865347.83772.2722.768.9
864689.01602.0430.521.7
86361.2255.617.8
86233.633.02.0
86121.2164.25.61.1
86016.233.3
859189.0
85833.634.6
8570.9
14,754.2
Table 16. Spatial forecast for the year 2043 in S1—9 × 9 matrix (GWh).
Table 16. Spatial forecast for the year 2043 in S1—9 × 9 matrix (GWh).
Y↓/X→27282930313233343536
87121.3
8702.121.09.8
8693.828.08.78.4
86855.579.532.646.528.2
867168.81088.8364.674.243.4
8661158.23607.7549.4102.1
865401.92582.2653.541.4
864489.01296.4417.616.1
86345.5264.4
86248.745.9
86166.4340.349.745.5
86052.483.4
859211.5
85838.144.1
85717.3
14,754.2
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chávez, H.; Molina, Y. Geospatial Forecasting of Electric Energy in Distribution Systems Using Segmentation and Machine Learning with Convolutional Methods. Energies 2025, 18, 424. https://doi.org/10.3390/en18020424

AMA Style

Chávez H, Molina Y. Geospatial Forecasting of Electric Energy in Distribution Systems Using Segmentation and Machine Learning with Convolutional Methods. Energies. 2025; 18(2):424. https://doi.org/10.3390/en18020424

Chicago/Turabian Style

Chávez, Héctor, and Yuri Molina. 2025. "Geospatial Forecasting of Electric Energy in Distribution Systems Using Segmentation and Machine Learning with Convolutional Methods" Energies 18, no. 2: 424. https://doi.org/10.3390/en18020424

APA Style

Chávez, H., & Molina, Y. (2025). Geospatial Forecasting of Electric Energy in Distribution Systems Using Segmentation and Machine Learning with Convolutional Methods. Energies, 18(2), 424. https://doi.org/10.3390/en18020424

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop