1. Introduction
The year 2022 was marked by a significant increase in electricity prices. Electricity traders observed price increases at the level of several hundred EUR/MWh. The year 2021 also saw a rise in prices due to the COVID-19 pandemic. When we examine electricity prices in Germany, the number of instances where prices exceeded 100 EUR/MWh is shown in
Table 1. Over the entire observed period (2015 to 2024), the number of instances where the electricity price exceeded 100 EUR/MWh was recorded 14,654 times which is approximately 18% from history data, specifically:
Number price between 100–199.9 EUR/MWh 9052 times
Number price between 200–299.9 EUR/MWh 3020 times
Number price between 300–399.9 EUR/MWh 1304 times
Number price between 400–499.9 EUR/MWh 761 times
Number price between 500–599.9 EUR/MWh 356 times
Number price between 600–699.9 EUR/MWh 106 times
Number price between 700–799.9 EUR/MWh 48 times
Number price between 700–799.9 EUR/MWh 7 times
Number price between 800–899.9 EUR/MWh 0 times
All analyses in Chapters 1 and 2 were performed in Excel by Microsoft VBA.
It was not until 2021 that electricity prices began to rise significantly in Germany, starting around September 2021. Since 2021, the frequency of prices exceeding 100 EUR/MWh has not returned to pre-2021 levels. This increase is mainly due to various fundamental factors, including the COVID-19 pandemic, the war in Ukraine, and various political decisions. A detailed analysis of electricity price trends in 2022 indicates that one of the most significant factors influencing electricity prices in that year was the war in Ukraine. This conflict led to a substantial disruption of natural gas and oil supplies from Russia, a major supplier of energy resources to Europe. Sanctions imposed on Russia and restrictions on gas supplies caused a sharp rise in energy prices on global markets. Since natural gas is often used for electricity generation, the increase in its prices had a direct impact on the cost of electricity production.
Following the lifting of pandemic restrictions in 2021, there was a significant revival in economic activity, which increased energy demand. Industrial sectors sought to make up for lost time by ramping up production, resulting in higher electricity demand. This increased demand contributed to higher prices as supply could not keep pace with the growing demand. The COVID-19 pandemic caused significant disruptions in global supply chains, leading to shortages of critical components and materials needed for electricity production and distribution. The shortage of components, such as semiconductors and other technologies, affected the production and maintenance of energy equipment, subsequently impacting the stability and reliability of electricity supply.
Extreme climatic conditions also contributed to higher electricity prices. In 2022, several extreme heatwaves and droughts were recorded, affecting electricity production from renewable sources such as hydroelectric plants.
2. Analysis of Electricity Price Trends from 2015 to 2024
Recent electricity price data indicate increasing volatility in electricity prices. For instance, 2022 was recorded as the most volatile year in recent years. In Germany (
Figure 1), the highest recorded electricity price was EUR871/MWh, while the lowest was −EUR19.04/MWh, resulting in a price range of EUR890.04. The primary causes of commodity price fluctuations are various fundamental factors. In 2022, several such factors were identified [
1,
2,
3]:
Energy Crisis Due to the Russia–Ukraine Conflict: The war between Russia and Ukraine led to significant restrictions on the supply of natural gas and oil from Russia to Europe, causing an increase in the prices of these commodities and subsequently electricity, which is also generated from gas.
Increase in Gas Prices: Gas is a key input for electricity generation in gas-fired power plants. The rise in its price directly impacted the cost of electricity production, thereby affecting its market price.
Increased Demand for Energy: Following the COVID-19 pandemic, economic activities resumed, leading to an increased demand for energy, including electricity.
Supply Constraints of Renewable Sources: Unfavorable weather conditions during certain periods of 2022 limited electricity production from renewable sources such as wind and solar power plants.
Insufficient Capacity of Nuclear Power Plants: Countries like France had to reduce electricity production from nuclear power plants due to technical issues and maintenance, reducing the overall availability of electricity.
Regulations and Emission Allowances: The European Union’s policies on reducing CO2 emissions led to higher prices for emission allowances, increasing the cost of electricity production from fossil fuels.
When examining electricity prices, frequency charts were created for each year from 2015 to 2024 in Germany (
Figure 2). The
x-axis represents price ranges with a bin width of EUR10/MWh. The highest and lowest prices in a given year were identified to determine the number of bins, thus establishing the number of columns on the
x-axis. From the data analysis, the following conclusions can be drawn:
The results of this research indicate that the number of columns, and thus the number of price ranges, was significantly smaller from 2015 to 2020 compared to 2021 to 2024. In 2022, the highest number of price ranges was recorded in the observed period. As previously mentioned and shown in
Figure 1, this is due to the high volatility of electricity prices in 2022. The results also show that the number of price ranges from 2015 to 2021 was approximately 4. This means that electricity prices varied by up to 4 * EUR10/MWh = EUR40/MWh during this period. In 2022, this number was significantly higher. However, after 2022, the number of price ranges did not return to 4 but increased to approximately 16 for Germany. This suggests that the trend in electricity price evolution did not return to pre-2022 levels after 2022. As of 2024, the number of price ranges in Germany is approximately 8, although this considers data only up to the end of May.
Table 2 displays the minimum range of electricity prices for each year from 2015 to 2024 for the countries Czech Republic, Germany, Hungary, Poland, and Slovakia. It is important to note that the frequency within these minimum ranges is low. For instance, in 2018, the minimum range of electricity prices for Germany was −80 to −70 EUR/MWh. However, the price appeared in this range only three times. The analysis was conducted on hourly data, and thus, in Germany, the price appeared in the range of −80 to −70 EUR/MWh for only three hours. Similarly, the same applies to the maximum ranges of electricity prices; their frequency is low.
Based on
Figure 2, it is possible to observe that from 2015 to 2020, the distribution of electricity prices follows a Gaussian curve. However, 2021 and 2022 were different. The peak (the range with the highest values) was approximately in the center before 2021, and the distribution was relatively even. In 2021 and 2022, however, the peak shifted to the left towards lower prices. In these years, there are extremes towards higher ranges, indicating extreme electricity prices. These extremes are not very frequent, though. The main distribution of electricity prices in these years appears to the left towards lower prices. This is also due to the frequency (although low) of extreme electricity prices towards the right.
Figure 3 shows the graph of maximum and minimum electricity prices for the years 2015 to 2024 for the countries Czech Republic, Germany, Hungary, Poland, and Slovakia. For the reasons mentioned above, the greatest range between the maximum and minimum electricity prices was in 2022. Similarly, in 2021, the range was higher than in the years 2015 to 2020. In 2023 and 2024, the range returned to normal values as before 2021. From 2015 to 2020, electricity prices remained relatively stable in most of these countries, with a slight increase in some cases. However, around 2020, there was a significant surge in prices, culminating in a peak in 2022. This sharp increase likely reflects market disruptions caused by external factors such as geopolitical events, energy crises and so on. After 2022, electricity prices began to decline again in these countries, with the rate of decline varying across nations. In Germany, the prices experienced a dramatic drop after 2022, even falling below zero in 2023, indicating a period of negative electricity prices, possibly due to oversupply or specific market conditions. Poland recorded the highest price peak in 2022, with prices exceeding 1000 EUR/MWh, followed by a sharp decline in the subsequent years.
Table 3 shows the number of negative electricity prices for the individual years from 2015 to 2024 for the countries Czech Republic, Germany, Hungary, Poland, and Slovakia. Naturally, the lowest number of negative prices in recent years was in 2022, when high electricity prices were a problem in these and other countries. However, the table shows a trend of negative prices in the recent period. As other publications describe, the main reason for negative electricity prices is excessive production from photovoltaics and wind turbines. In the case of the Czech Republic, Germany, and Slovakia, a significant number of negative prices can be seen over the entire observed period. Moreover, the analyses were conducted on data up to the end of April 2024, when photovoltaic plants had not yet produced their highest amount of energy. Interestingly, Poland recorded its first negative prices in 2023 during the observed period. The reason might be that the amount of electricity produced from photovoltaics increased by 7.5 times from 2020 to 2023.
- -
Due to the volatility of electricity prices, there has been an increased emphasis on the possibilities of electricity price prediction and deeper analysis of electricity price data. In 1935, Benford’s Law was discovered, which describes the frequency with which the digits 1 through 9 appear as the leading significant digits in a wide range of data sets. While digit one appears in about one-third of cases, the probability for digit two is 17.6%, for digit three is 12.5%, and this proportion gradually decreases to the digit nine, which appears as the leading digit with a probability of 4.6%. Scientists discovered that this relationship was actually identified by astronomer Newcomb in 1881, but at that time, he could not convince many people of its validity. Both discoverers noticed while reading logarithmic tables in the library that the first pages were much dirtier than the others, indicating that their colleagues from various fields were more frequently looking up logarithmic values for numbers starting with one. Benford found that the law applies to various sets of data—population numbers, death rates, chemical and physical constants, baseball statistics, half-lives of radioactive isotopes, prime numbers, elements of the Fibonacci sequence, and many others. On the other hand, Benford’s Law does not apply to random sets with certain restrictions, such as lottery numbers, telephone numbers, dates, heights, and weights of groups of people. According to scientists, the law also applies to the second digit, but with less reliability; for further digits, the numbers have a random distribution with equal probability of occurrence. Scientists have not yet found a rule that can determine whether Benford’s Law applies to a given set of numbers or not [
4]. Benford’s Law has been applied to many different data sets [
5,
6,
7,
8,
9,
10,
11,
12]:
- -
To determine whether the lists of COVID-19 infection count, which claim to measure actual events, have been manipulated,
- -
Analysis of lightning data and evaluation of negative effects with precise lightning parameters in kA units,
- -
Detecting image tampering during resizing and compression,
- -
Identifying anomalies in the number of publications per researcher and the number of researchers per publication,
- -
Analyzing the distribution of initial letters in novels and similar studies,
- -
Evaluating the quality of economic data in enterprises,
- -
Application to Chinese texts,
- -
Detecting whether a given text is typed by a human or a machine (robot).
However, to date, there is no research that focuses on the application of Benford’s Law to electricity prices or other commodities. As previously mentioned, Benford’s Law has been applied to various types of data, not just in technical environments. This paper aims to investigate the following questions through research:
- (a)
Is a one-year sample of electricity price data sufficient?
- (b)
Considering that electricity prices can be negative, how should negative numbers be handled? In practice, the situation may arise where the electricity price is 41 EUR/MWh as well as −41 EUR/MWh. Will the research results utilizing Benford’s Law differ when considering negative numbers versus not considering negative numbers?
- (c)
Can Benford’s Law be used for potential electricity price prediction?
- (d)
Can the statistical results be utilized in other ways?
3. Methodology and Data
The aim of the research was to apply Benford’s Law to electricity prices and determine whether it can be effectively used for electricity price analysis. Since this area has not been previously explored, we established the following objectives:
To verify whether Benford’s Law can be applied to electricity prices.
To compare the results of the analysis for the years before 2020 and from 2020 onward, due to the differing conditions in the electricity market.
To determine the future direction of research based on the results obtained.
It may seem surprising that the first person to encounter this phenomenon was the American astronomer Simon Newcomb in 1881. He wrote a two-page article about it [
7], in which he noticed that the pages at the beginning of logarithmic tables were more worn and more frequently used than those towards the end. At that time, no one paid much attention to this article.
In the 1930s, Frank Benford observed the same phenomenon regarding the wear of logarithmic tables. Apparently unaware of Newcomb’s article, Benford published his own more extensive article in 1938 [
8], in which he collected 20,229 observations from various fields (human populations, values of constants, numbers in newspapers, lengths of rivers, addresses, etc.).
Both concluded the same: people using logarithmic tables encounter numbers starting with the digits 1 and 2 much more frequently than those starting with the digits 8 and 9.
According to the rules of the law, the first significant digit probability can be expressed. Benford found various datasets and stated a few dataset properties, which made the dataset best suited according to BL.
The definition of a significant number is as follows [
9]: every real number excluding zero and denoting it as
x, the first significant decimal number of the real number
x, denoted as
D1(
x), is the unique integer j ∈ {1, 2, …, 9} satisfying conditions [
10]
for some (unique) k ∈ Z. Additionally, for every number m ≥ 2, m ∈ N, the m-th significant decimal digit of
x is denoted as
Dm(
x). This is defined inductively as a unique integer number j ∈ {0, 1, …, 9} in the following formula [
7]:
for some unique number k ∈ Z; for convenience,
Dm(0): = 0 for all m ∈ N. By definition, the first significant digit
D1(
x) of x! = 0 is never zero. However, when considering the second, third, fourth and so on, these significant digits may be any integers including zeros, i.e., they are defined by the set of {0, 1, …, 9} decimal numbers.
Thus, Benford’s law is a statistical law describing the probability that a specific digit becomes the first significant digit of the number from a dataset. However, the dataset that follows BL must fulfill the following basic requirements:
1. The dataset must not be radically restricted in value range (e.g., the dataset of people’s height or IQ is a radical restriction because of a very small range of possible values) [
10].
2. The dataset must not be influenced by any kind of artificial effects caused by human actions aiming to change the values intentionally [
10].
3. The value range in the dataset must be large enough [
11]:
where max and min stand for maximal and minimal dataset values, respectively.
4. The dataset should be large enough.
The Formula (2) describes the calculation of the first significant digit probability according to BL [
12]:
where
Fd is the probability value. (In the case of decimal numbers, we have to use decimal logarithm), and
Then, for digit 1:
i.e., the probability that the decimal digit 1 is the first significant digit in approximately 30,103% of cases. The sum of all probabilities is 1, i.e., 100% probability for all decimal digits d ϵ D = (1, 2… 9), the digit 0 cannot be the first significant digit, and dataset samples with the value 0 (e.g., 0 or 0.000) are excluded from the dataset because they do not contain a significant digit.
From the equations, we can observe a descending trend of the probability. The probability distribution mentioned above is valid for decimal number systems; however, BL is not restricted to this system, and it is also applicable for other numeric systems, e.g., octal or hexadecimal numeric systems. Then, we can write the first significant digit probability equation in a more general form, assuming that the difference between 2 logarithms equals the logarithm of the ratio:
where
d denotes a particular digit from a particular numeric system, and n denotes the particular numeric system [
10].
Analogously to the first significant digit, it is possible to define the second, third, fourth, and potentially further significant digits. For instance, the second significant digit for any non-zero real number is defined as the digit to the right of the first significant digit. The third significant digit then directly follows the second significant digit to the right, and the fourth significant digit follows the third significant digit to the right. Therefore, the second and subsequent significant digits can take on 10 values (0, 1, 2, 3, …, 9). For example, the number 245, understood as the value of a ratio variable (thus, 245.0), has its first significant digit as 2, the second as 4, the third as 5, and the fourth as 0. The number 8.556 has 8 as its first significant digit, 5 as both its second and third significant digits, and 6 as the fourth significant digit. In
Table 4 we can see probability estimates for BL.
For data analysis, available data from the website energy-charts.com were used. To statistically process data using Benford’s Law, a sufficient amount of data is required. The analyses described in the next chapter were applied to data from 2015 to the present, specifically up to the end of April 2024. Data for Germany were selected for the analysis.
4. Results and Discussion
In the first step of the research, it was necessary to address the issue of the leading digit 0. Since Benford’s Law does not consider this digit, this problem needs to be resolved. Therefore, an initial experimental study was conducted for one market; specifically, the German market, which is one of the largest in Europe.
Table 5 shows the results of applying Benford’s Law to electricity prices in the German market. During the analysis of prices, it was found that in 2015, 2.7% of the values began with the digit 0. This can also be considered a statistical deviation. Therefore, electricity prices starting with the digit 0 were not considered further. Negative values are also not considered in
Table 5. Negative prices in 2015 accounted for approximately 1.3%, which is also a low value. Comparing the entire history of data from 2015 to May 2024, about 1.4% of numbers start with the digit 0. Negative prices in 2015 accounted for approximately 1.9%, which is also a low value. Combined, this is approximately 3.3%, which is still a low value. Considering only the digits 1 to 9, the results for the German market indicate that:
The digit 1 does not appear most frequently in 2015. It appears in exactly 11.08% of cases.
As mentioned above, the digit 0 occurred in 2.7% of cases in 2015. Prices starting with the digit 0 range from 0.00 to 0.99, which are prices close to 1 EUR/MWh. Therefore, the 2.7% occurrence of the digit 0 can be added to the percentage occurrence of the digit 1. This increases the occurrence of the digit 1 from 11.08% to 13.78%.
In
Table 5, the “Delta” value indicates the difference between the percentage occurrence of a digit according to Benford’s Law and the actual values applied to electricity prices. The analysis results show that for the digits 2, 3, and 4, the Delta value is negative. This means that the occurrence of these digits is higher than defined by Benford’s Law. A low Delta value means that the result is closest to the value from Benford’s Law.
The digit 5 has the lowest Delta value. Conversely, the digit 1 has the highest Delta value. As mentioned above, 2.7% of electricity prices start with the digit 0. If this number is added based on the explanation in point 2, the Delta value for the digit 1 is 16.32%.
Now, consider the scenario where we include negative electricity prices in the analysis in the following way: if the electricity price is −45 EUR/MWh, this price is added to the frequency count of the digit 4; if the price is −14.8 EUR/MWh, it is added to the frequency count of the digit 1, and so on. The results (
Table 6) show the following:
The most frequent digit is again digit 2, which appears in 33.25% of cases.
The digit 1 has the highest Delta value. Given the small number of negative prices in 2015, the results did not change significantly.
Similarly to the previous analysis, the frequency of the digits does not match the frequency predicted by Benford’s Law. The results could have been influenced by several factors. Firstly, in 2015, negative numbers did not occur as frequently as in other years, as shown in
Table 3.
In
Figure 4, the comparison of the frequency of the digit 1 for the years 2015 (blue curve) to 2024 with the frequency of 30.1% according to Benford’s Law (red curve) is shown. From the results displayed in
Table 5,
Table 7,
Table 8,
Table 9,
Table 10,
Table 11,
Table 12,
Table 13,
Table 14,
Table 15 and
Table 16, the following conclusions can be drawn:
From 2015 to 2020, the frequency of the digit 1 was relatively low and did not approach the expected frequency of 30.1%.
In 2021, the frequency of the digit 1 exceeded 20% and gradually increased. In 2024, it is currently below 20%, but the data only extend to the end of May.
In 2022 and 2023, the frequency of the digit 1 reached its highest values. This can be attributed to the fact that in these years, there was a high number of electricity prices above 100 EUR/MWh—see
Table 1. All prices ranging from 100 to 199.99 were counted under the digit 1.
In
Table 16, data for the entire period from 2015 to 2024 are displayed. The results show that the frequency of the digit 1 is 17.03%. This still does not approach the value of 30.1%. Among all digits, the highest Delta value is for the digit 1.
Figure 5 shows the comparison of the frequency of the digit 2 for the years 2015 (blue curve) to 2024 with a frequency of 17.6% according to Benford’s Law (red curve). The frequency of the digit 2 for the years 2015 to 2024 hovers around 17.6%. Considering all available data, the frequency of the digit 2 is 19.62%. The Delta value is −2.02%, which is relatively low. Therefore, it can be said that Benford’s Law can be applied to the digit 2.
The comparison of the frequency of the digit 3 is shown in
Figure 6. The results indicate that in recent years, low Delta values can be observed. For the entire data history from 2015 to 2024, the Delta value for the digit 3 is −9.36%, which is relatively high. Therefore, Benford’s Law cannot be applied to the digit 3.
Figure 7 represents the trend of the frequency of the digit 4 during the years 2015 to 2024. Values range from 2.52% to 27.93%, which are high values. The frequency value for the entire data set is 15.02%, which is also high. Similar to the case of digit 3, Benford’s Law cannot be applied to digit 4.
The frequency of the digit 5 ranges from 3.14% to 19.86% between 2015 and 2024 (
Figure 8). Considering all data from 2015, the frequency of the digit 5 is 8.48%. The values in individual years, as well as for the entire period, are high, and thus Benford’s Law cannot be applied to the digit 5.
The frequency of the digit 6 (
Figure 9) ranges from 1.12% to 17.18%, while according to Benford’s Law, the frequency of the digit 6 should be approximately 6.7%. The value of 17.18% is from 2024, where the data sample for the whole year is not yet available. If 2024 is excluded, the highest frequency of the digit 6 is 11.05%. The frequency of the digit 6 fluctuates significantly during this period. When applied to the entire data set, the frequency of the digit 6 is 4.61%, and the Delta value is 2.09%, which can be considered relatively low. Therefore, Benford’s Law could be applied to the digit 6 for the entire data set, but not for each year individually.
The trend of the frequency of the digit 7 (
Figure 10) is similar to that of the digit 6. The lowest frequency is 0.25%, and the highest is 16.88%. Considering that 2024 (where the frequency is 16.88%) is not yet complete, the highest value is 9.67%. Similarly, for the digit 7, when applied to the entire data set, the frequency is 3.79%, and the Delta value is 2.01%, which is relatively low. Therefore, Benford’s Law could be applied to the digit 7 for the entire data set, but not for each year individually.
The frequency of the digit 8 ranges from 0.21% to 13.34% (
Figure 11). Excluding 2024, the maximum value is 10.84%. For the entire data history, the frequency of the digit 8 is 3.46%, while according to Benford’s Law, it should be 5.1%. Therefore, as in the previous cases, Benford’s Law could be applied to the entire data set, but not to each year individually.
The lowest frequency of the digit 9 is 0.39%, while the highest is 14.21% (
Figure 12). For the entire data history, the frequency of the digit 9 is 3.3%, while according to Benford’s Law, it should be 4.6%. Therefore, as in the previous cases, Benford’s Law could be applied to the entire data set, but not to each year individually.
During this period, electricity prices were characterized by a relatively stable trend, influenced mainly by long-term factors such as the increasing share of renewable energy sources, low gas prices, and stable geopolitical conditions. Electricity prices followed a Gaussian distribution, with the main peak of the distribution being symmetric and located in the center. This indicates that there were no significant price fluctuations during this period.
In contrast, the period 2020–2022 was marked by significant changes in the electricity market, reflected in the price developments. The main factors were the COVID-19 pandemic, which caused a sharp drop in demand, the subsequent economic recovery leading to rising prices, and particularly the energy crisis due to the Russia–Ukraine conflict, which resulted in extremely volatile electricity prices. During this period, there was a notable increase in the number of price extremes, either very high or even negative. These changes caused the price distribution to become asymmetric, with the main peak of the distribution shifting towards lower prices, but with distinct extremes on both sides of the spectrum.
We are increasingly observing negative electricity prices on exchanges. Several analysts agree that the reason for this lies in photovoltaic power plants and wind turbines. Whether this is the case might be revealed by further research, which could determine a possible correlation between production from photovoltaics and wind turbines and electricity prices. Looking at current electricity prices, it is apparent that in countries where electricity is generated from conventional sources (nuclear, coal, etc.), negative prices do not appear as frequently. This is also due to the fact that the share of photovoltaic and wind power plants is not as significant. The issue of applying Benford’s law to negative prices would require further research. The range of negative electricity prices is not significant. Therefore, similar to the case of positive prices, the research would focus on analyzing the second digit in negative prices.
5. Conclusions
The application of Benford’s Law to electricity prices has not been previously studied. The results presented in this article indicate that Benford’s Law can be an effective tool for analyzing electricity prices, but with certain limitations and specifics. The analysis showed that for electricity prices in the German market from 2015 to May 2024, not all digits occurred as predicted by Benford’s Law. The most significant deviations were observed for the digit 1, whose frequency was significantly lower than Benford’s Law predicts.
One of the main reasons for these deviations is the fact that Benford’s Law does not account for prices starting with the digit 0, as well as negative prices that occur in the electricity market. These anomalies were removed in our analysis, which may have impacted the overall results. Prices ranging from 0 to 0.99 could be considered as starting with the digit 1 since the difference in price between 0 and 1 is minimal. On the other hand, prices starting with the digit 1 can range from 0 EUR/MWh to 19.99 EUR/MWh and 100 EUR/MWh to 199.99 EUR/MWh, which is not as common. This is also the reason why the frequencies of the digits 2, 3, and 4 are the highest. This is due to the frequent occurrence of prices ranging from 20 EUR/MWh to 49.99 EUR/MWh.
A significant part of our analysis was also monitoring the frequency of individual digits in various price ranges. We found that electricity prices above 100 EUR/MWh and negative prices affect the frequency of individual digits and should be included in future analyses for more accurate results.
The direction for future research could be realized in:
Extending the scope of analysis: It would be beneficial to extend the analysis to other electricity markets in Europe and other parts of the world to determine whether the observed deviations are specific to the German market or represent a global phenomenon.
Detailed analysis of prices above 100 EUR/MWh: As mentioned in the conclusion, prices above 100 EUR/MWh significantly impact digit frequency. Future research should focus on a more detailed examination of this impact and identifying patterns within these high prices.
Including negative prices: Negative electricity prices, although relatively rare (2–4%), can have a significant impact on the analysis. Future studies should include these prices and examine how they affect the overall frequency of digits according to Benford’s Law. This aspect needs further attention as negative prices are becoming increasingly common across almost all markets, not just in Germany.
Using Benford’s Law for prediction: Research should also focus on the potential use of Benford’s Law for predicting electricity prices. Although this area is not yet sufficiently explored, the potential for predictive analysis could be significant. Even if the application of Benford’s Law to electricity prices did not yield the desired results, the statistical frequency results of individual digits could be useful for further research, such as determining the probability of price ranges starting with a specific digit. Further research could focus on the analysis of the second digit. The results indicate that the most frequent digits are 1 (17.03%), 2 (19.62%), and 3 (21.86%). Considering that in 58.51% of cases, the price ranges between 1 EUR/MWh and 39.99 EUR/MWh, analyzing the second digit could potentially increase the accuracy of electricity price predictions.
Other statistical methods: In addition to Benford’s Law, other statistical methods should be explored for analyzing electricity prices. This could provide a more comprehensive view of the dynamics of the electricity market.
The results of the research and analysis of electricity prices in Germany suggest that a one-year data sample shows higher Delta values compared to analyses focused on a larger data history, specifically in this case, data from 2015 to 2024.
These directions for future research could contribute to a deeper understanding of electricity prices and the development of more effective tools for their analysis and prediction. Commodity prices are relatively specific, as it is unlikely (for example, in the case of electricity prices) that they would fall in the range of 0 EUR/MWh to 19.99 EUR/MWh and 100 EUR/MWh to 199.99 EUR/MWh, starting with the digit 1, in 30.1% of cases. According to Benford’s Law, the digit 1 should appear in 30.1% of cases.
In conclusion, despite the results, it can be stated that the application of Benford’s Law to electricity prices requires deeper analysis. As mentioned earlier, one possible approach is to exclude data from years that were atypical. However, since the onset of the pandemic, electricity prices have been unstable, and the years 2021 to 2024 are not representative of typical market conditions. These global factors have caused high volatility in the electricity market, which is likely to persist. Given that the application did not yield the expected results, future research should consider focusing on the second digit, which is not as clearly defined as the first digit. The second digit may appear more frequently in any form from 0 to 9. As previously noted, the first digit is significantly influenced by the electricity price itself.
Benford’s Law, which describes the distribution of leading digits in various data sets, can offer valuable insights when analyzing electricity prices. Although we observed some deviations from the expected values according to Benford’s Law, the statistical analysis of digit frequency can reveal interesting patterns that could form the basis for further research and potentially for price prediction. For instance, if we know that a certain price range has a high probability of starting with a specific leading digit, this information could be used in modeling or forecasting future prices.
Our results suggest that while Benford’s Law did not fully predict the distribution of electricity prices, the frequency of occurrence of individual digits provides fundamental data that could be utilized in predictive models. Specifically, we identified that certain digits, such as digit 1, occur at a frequency that differs from what is expected by Benford’s Law, particularly in years with extreme prices. This information can be used to identify likely price intervals in the future, which is a crucial step in developing predictive algorithms. In conclusion, it is important to emphasize that while this research focused on the analysis of electricity prices using Benford’s Law, similar analytical approaches could potentially be applied to other commodities, such as gas. Like electricity, gas is subject to significant market and geopolitical factors that influence its price. We anticipate that applying Benford’s Law to this commodity could reveal similar patterns and deviations as observed with electricity. However, this aspect requires further research to verify its suitability and effectiveness in predicting the prices of other commodities. By expanding this application, the results of our research could be applied more broadly and contribute to a better understanding of price dynamics across various markets.