Review and Evaluation of Multi-Agent Control Applications for Energy Management in Buildings
Abstract
:1. Introduction
1.1. Motivation
1.2. Literature Analysis Approach
- Article Selection: Primarily utilizing databases like Scopus and Google Scholar, this research involved a preliminary review of over 500 papers through abstracts, from which the most pertinent were chosen for in-depth examination.
- Keyword Research: Extensive keyword research was conducted, encompassing phrases like “Model-free Decentralized control in IBEMS”, “Adaptive Decentralized control in IBEMS”, “Multi-agent Control in IBEMS”, “Distributed Control in BEMS”, and specific terms related to each subsystem. This strategy ensures the capture of the diverse challenges and dynamics in optimizing IBEMS performance
- Data Collection: Data from each publication were categorized, focusing on the multi-agent control techniques used for IBEMS management and its application context. Considerations included the advantages, limitations, and practical implications, especially in optimal IBEMS management scenarios.
- Quality Assessment: Each selected study underwent a thorough quality evaluation based on citation count, the academic contribution of the authors, and the research methodologies used. This helped determine the relative significance and impact of each piece of research. It should be noted that the papers from the last decade that have been elected scored more than 10 citations according to Scopus. Such integration ensured a large number of integrated papers and also directed the evaluation and the final conclusions towards the field of multi-agent control in IBEMSs.
- Data Synthesis: Finally, the gathered insights were organized into distinct categories, facilitating straightforward comparisons and a clear understanding of the research landscape in model-based and model-free multi-agent control for IBEMS.
1.3. Previous Work
1.4. Novelty and Contribution
1.5. Paper Structure
2. Integrated Building Energy Management Systems
2.1. Primary IBEMS Subsystems
- HVACs: Essential for maintaining indoor climate by regulating temperature, humidity, and air quality, using components like furnaces, air conditioners, and heat pumps.
- DHW: Provides hot water for household and commercial use, typically involving water heaters and storage tanks to ensure a steady supply.
- LS: Lighting systems, including LEDs and smart controls, provide appropriate illumination, enhancing safety and comfort.
- RES: Systems such as solar panels and wind turbines generate power from sustainable sources, offering a clean energy alternative.
- ESS: Store excess energy from a RES for later use, ensuring a consistent energy supply during high demand or low generation periods.
- EVs: Integrated into a BEMS as mobile energy storage, EVs can be charged during low-demand periods and supply power during peak times, aiding in load management.
2.2. Multi-Agent Control Types for IBEMS
- Centralized Multi-Agent Control (CE): In this approach, a single central agent makes decisions based on information from all other agents, enabling globally optimal solutions and highly efficient energy management. However, it is prone to scalability issues and risks a single point of failure.
- Decentralized Multi-Agent Control (DE): Here, multiple agents operate independently, making decisions based on local observations. This offers robustness and scalability, as the system is resilient to individual agent failures. The main challenge is the potential for suboptimal global performance due to limited coordination.
- Cooperative Multi-Agent Control (CO): Agents in this approach share information and coordinate actions to achieve common goals, balancing local and global optimization. While this enhances overall efficiency, it requires reliable communication networks and complex algorithms, which can add cost and complexity.
- Non-Cooperative Multi-Agent Control (Non-CO): This approach is characterized by agents operating independently without coordination, offering simplicity and ease of implementation. However, it can lead to inefficiencies and conflicts, as agents may work at cross-purposes, reducing overall system performance.
2.3. General Description of Multi-Agent Control Processes in IBEMS Applications
- Model-based Control
- 1.
- Data Acquisition: Collects real-time and historical data to capture the current state and improve model accuracy.
- 2.
- System Modeling: Develops and refines models based on data, crucial for predicting future states.
- 3.
- Future State Prediction: Uses models to predict future states and optimize control actions across subsystems.
- 4.
- Control Strategy Execution: Executes optimized actions and makes real-time adjustments based on feedback.
- 5.
- Performance Evaluation and Feedback: Monitors performance and refines models for continuous improvement.
- Model-free Control
- 1.
- Data Acquisition: Collects real-time and historical data for pattern recognition and learning.
- 2.
- Learning Algorithm Initialization: Selects and initializes model-free algorithms like reinforcement learning.
- 3.
- Real-Time Learning and Adaptation: Continuously adapts decisions based on new data and learning outcomes.
- 4.
- Control Strategy Execution: Executes control actions independently in each subsystem based on learning outcomes.
- 5.
- Performance Evaluation and Feedback: Monitors performance and refines algorithms for ongoing improvement.
3. Mathematical Concept Multi-Agent Control Methodologies for IBEMS
3.1. Reinforcement Learning
- S is the set of possible states. Each state represents a specific configuration of the building environment, such as temperature levels, energy consumption, and occupant activities.
- A is the set of actions available to the agent. An action could represent decisions like adjusting the HVAC system or controlling lighting levels.
- is the state transition probability function, which describes the likelihood of moving to a new state after taking action a in state s. This probability models the dynamics of the environment.
- is the reward function, which assigns a numerical value to each state–action pair. The reward serves as feedback to the agent, encouraging actions that lead to desirable states (e.g., high energy efficiency or occupant comfort) and discouraging undesirable ones.
- is the discount factor, which determines the weight given to future rewards. A lower prioritizes immediate rewards, while a higher encourages the agent to plan for long-term benefits.
- Value-based: Methods like Q-learning and Deep Q-Networks (DQNs) estimate action–state values, suitable for discrete actions but limited in complex, continuous tasks.
- Policy-based: Methods like Proximal Policy Optimization (PPO) optimize policies directly, effective for continuous control but computationally intensive.
- Actor–critic: Combining value- and policy-based methods, techniques like Soft Actor–Critic (SAC) balance exploration and exploitation, ideal for complex, multi-agent environments but requiring careful tuning.
3.2. Model Predictive Control
- is the state vector at time t, representing the internal state of agent i’s subsystem (e.g., temperature in a zone or energy usage of a system);
- is the control input vector;
- is the state transition matrix, governing how the state evolves over time in the absence of control inputs;
- is the control input matrix, which defines how control actions influence the state evolution.
3.3. Evolutionary Algorithms
- is the velocity of particle i at time t;
- w is the inertia weight that controls the exploration–exploitation trade-off;
- and are cognitive and social coefficients, respectively;
- and are random numbers drawn from the uniform distribution in ;
- is the particle’s personal best position;
- is the global best position found by the swarm.
- 1.
- Selection: Individuals are selected based on their fitness, with higher-fitness individuals more likely to reproduce.
- 2.
- Crossover: Selected individuals are paired to exchange genetic material, combining their chromosomes (solution encodings) to produce offspring.
- 3.
- Mutation: Random changes are introduced to offspring to maintain diversity and prevent premature convergence.
3.4. Alternating Direction Method of Multipliers
- represents the local decision variables of agent i;
- z is a global variable shared among the agents;
- is the local cost function for agent i;
- is a global cost function;
- and are matrices that define the coupling between local and global variables;
- is a constant vector representing constraints.
- 1.
- Update : Each agent solves its local optimization problem based on its current state and the global variable z.Here, is a penalty parameter that controls the convergence speed, and is the Lagrange multiplier associated with the constraints.
- 2.
- Update z: A central coordinator updates the global variable z by solving
- 3.
- Update Lagrange multipliers : Each agent updates its local multiplier:
4. Review on Multi-Agent Control for Energy Management Systems
- Ref.: illustrating the reference application in the first column;
- Year: illustrating the publication year of for each research application;
- Type: defining whether the applied multi-agent control strategy concerns the centralized (CE); decentralized (DE); cooperative (CO); or non-cooperative (Non-CO) interaction type between the agents;
- Method: illustrating the specific RL; MPC; ADMM; evolutionary algorithms; Lyapunov optimization; hybrid; or other multi-agent algorithmic methodologies applied in each work;
- IBEMS: illustrating the specific integrated equipment that each IBEMS integrates considering the most common devices in the building setting: HVAC, RES, ESS, LS, EVs, DHW, and Other;
- Residential: defining if the testbed application concerns a Residential Building Control Application with an “x”;
- Commercial: defining if the testbed application concerns a Commercial Building Control Application with an “x”;
- Simulation: defining if the testbed application concerns a Simulative Building Control Application with an “x”;
- Real-life: defining if the testbed application concerns a Real-world or Real-life Building Control Application with an “x”;
- Citations: portrays the number of citations—according to Scopus—of each work.
Ref. | Year | Type | Method | IBEMS | Residential | Commercial | Simulation | Real-Life | Citations |
---|---|---|---|---|---|---|---|---|---|
[55] | 2015 | CO | Q-learning | RES/ESS | x | x | 37 | ||
[56] | 2017 | CO | BL | RES/DHW/ESS | x | x | 218 | ||
[57] | 2017 | DE/CO | eJAL | HVAC/LS/Other | x | x | 50 | ||
[58] | 2018 | DE/CO | Q-learning | RES/ESS/Other | x | x | 179 | ||
[59] | 2019 | DE/CO | DQN | RES/ESS/Other | x | x | 34 | ||
[60] | 2020 | DE/CO | DQN | HVAC | x | x | x | 77 | |
[61] | 2020 | CO | DQN | HVAC | x | x | 42 | ||
[62] | 2020 | CO | A2C | HVAC/RES/ESS | x | x | 112 | ||
[63] | 2020 | DE | Q-learning | HVAC/RES/EVs | x | x | 209 | ||
[64] | 2020 | DE | ISA | HVAC/RES/Other | x | x | 43 | ||
[65] | 2021 | CO | MAAC | HVAC | x | x | 175 | ||
[66] | 2022 | CO | DQN | HVAC | x | x | 21 | ||
[67] | 2022 | CO | MAAC | HVAC | x | x | 38 | ||
[68] | 2022 | CO | D3QN | HVAC/RES/ESS | x | x | 50 | ||
[69] | 2022 | CO | Q-learning | HVAC | x | x | 23 | ||
[70] | 2022 | CE/CO | BDQ | HVAC | x | x | x | 48 | |
[71] | 2022 | CO | SAC | RES/ESS | x | x | x | 21 | |
[72] | 2022 | DE | ACKTR | RES/ESS/Other | x | x | 8 | ||
[73] | 2022 | CO | TD3 | RES/ESS/Other | x | x | x | 17 | |
[74] | 2023 | DE/CO | PPO | RES/ESS | x | x | x | 11 | |
[75] | 2023 | DE/CO | DDPG | HVAC/RES/ESS | x | x | x | 30 | |
[76] | 2023 | DE/CO | MAAC | HVAC | x | x | 18 | ||
[77] | 2023 | Non-CO/DE | SAC | RES/ESS | x | x | 11 | ||
[78] | 2023 | CO | DQN | HVAC | x | x | 15 |
Ref. | Year | Agent | Method | IBEMS | Residential | Commercial | Simulation | Real-Life | Citations |
---|---|---|---|---|---|---|---|---|---|
[79] | 2014 | DE | N/A | HVAC | x | x | 16 | ||
[80] | 2017 | DE | E-MPC | HVAC | x | x | 66 | ||
[81] | 2017 | DE | Grey-Box | HVAC | x | x | 12 | ||
[82] | 2018 | DE/CO | Game | HVAC | x | x | x | 13 | |
[83] | 2018 | CO | N/A | HVAC | x | x | 34 | ||
[84] | 2018 | DE | EBDC | RES/EVs | x | x | x | 71 | |
[85] | 2019 | DE | DP | HVAC/RES/EVs | x | x | 11 | ||
[86] | 2020 | DE | QP | HVAC/ESS | x | x | 12 | ||
[87] | 2020 | DE | Naive | HVAC | x | x | 44 | ||
[88] | 2020 | DE | Hierarchical | HVAC/RES/ESS | x | x | x | 18 | |
[89] | 2020 | DE | DP | HVAC | x | x | 31 | ||
[90] | 2021 | DE | N/A | HVAC | x | x | x | 28 | |
[91] | 2022 | CO | DD | HVAC/ESS/Other | x | x | x | 29 |
Ref. | Year | Agent | Method | IBEMS | Residential | Commercial | Simulation | Real-Life | Citations |
---|---|---|---|---|---|---|---|---|---|
[92] | 2015 | CO | PSO | HVAC/LS/ESS | x | x | 71 | ||
[93] | 2016 | CO | GA | HVAC/LS/RES/ESS | x | x | 69 | ||
[94] | 2017 | CO | PSO | HVAC/LS/RES/ESS | x | x | 20 | ||
[95] | 2018 | CO | GA | HVAC/LS/RES/ESS | x | x | 43 | ||
[96] | 2023 | DE/CO | PSO | HVAC/RES/ESS/EVs | x | x | 16 |
Ref. | Year | Agent | Method | IBEMS | Residential | Commercial | Simulation | Real-Life | Citations |
---|---|---|---|---|---|---|---|---|---|
[97] | 2016 | CO | ADMM | HVAC | x | x | 72 | ||
[98] | 2017 | CO | J-ADMM | HVAC | x | x | 34 | ||
[99] | 2019 | CO | NC-ADMM | RES | x | x | 73 | ||
[100] | 2020 | CO | J-ADMM | HVAC | x | x | 54 | ||
[101] | 2021 | DE/CO | J-ADMM | HVAC | x | x | 21 | ||
[102] | 2021 | DE/CO | DC-ADMM | HVAC/RES/ESS/EVs | x | x | x | 80 | |
[103] | 2022 | CO | H-ADMM | HVAC | x | x | 11 |
Ref. | Year | Agent | Method | IBEMS | Residential | Commercial | Simulation | Real-Life | Citations |
---|---|---|---|---|---|---|---|---|---|
[104] | 2014 | Non-CO | LCMA | RES/EES/Other | x | x | 147 | ||
[105] | 2014 | Non-CO | Lyapunov | HVAC | x | x | 87 | ||
[106] | 2017 | Non-CO | Lyapunov | HVAC | x | x | 50 |
Ref. | Year | Agent | Method | IBEMS | Residential | Commercial | Simulation | Real-Life | Citations |
---|---|---|---|---|---|---|---|---|---|
[107] | 2014 | CO | ULDC/FLC | HVAC | x | x | 162 | ||
[108] | 2016 | DE | PSO/ANN | HVAC | x | x | 73 | ||
[109] | 2018 | CO | CBR/ANN | HVAC | x | x | 105 | ||
[110] | 2018 | CO | ADMM/MPC | HVAC | x | x | x | 31 | |
[111] | 2020 | CE | RL/ANN | HVAC | x | x | 60 | ||
[112] | 2020 | CE | RL/ANN | HVAC | x | x | 96 | ||
[113] | 2021 | CO | PSO/MPC | HVAC/RES/ESS | x | x | x | 30 | |
[114] | 2021 | CO | PSO/MPC | HVAC | x | x | 19 | ||
[115] | 2022 | CO | RL/FLC | HVAC/LS/Other | x | x | 29 | ||
[116] | 2022 | CO | ADMM/MPC | HVAC | x | x | 19 |
Ref. | Year | Agent | Method | IBEMS | Residential | Commercial | Simulation | Real-Life | Citations |
---|---|---|---|---|---|---|---|---|---|
[117] | 2014 | DE | Consensus | HVAC/EVs/Other | x | x | 225 | ||
[118] | 2016 | CO | N/A | HVAC | x | x | x | 46 | |
[119] | 2016 | CO | SoC | RES/ESS | x | x | x | 58 | |
[120] | 2018 | CE/CO | L4GPCAO | HVAC | x | x | 51 | ||
[121] | 2019 | DE/CO | TBSA | HVAC | x | x | 85 | ||
[122] | 2019 | DE | CHPD | HVAC/RES/ESS | x | x | 44 | ||
[123] | 2020 | CO | DAC | HVAC | x | x | 41 | ||
[124] | 2021 | DE/CO | Consensus | RES/ESS/Other | x | x | x | 49 | |
[125] | 2021 | CE/DE/CO | RPC/APT | HVAC/DHW/RES/ESS | x | x | 10 | ||
[126] | 2021 | CO | Hierarchical | RES/ESS/Other | x | x | x | 25 | |
[127] | 2021 | DE/CO | Blockchain | HVAC/LS/Other | x | x | 35 | ||
[128] | 2022 | CO | N/A | HVAC/RES/ESS | x | x | 22 |
4.1. Model-Free Control Strategies
Reinforcement Learning
4.2. Model-Based Control Strategies
4.2.1. Model Predictive Control (MPC)
4.2.2. Model-Based Control with Evolutionary-Based Optimization
4.2.3. Model-Based Control with ADMM-Based Optimization
4.2.4. Model-Based Control with Lyapunov-Based Optimization
4.3. Hybrid Strategies
4.4. Other Model-Free and Model-Based Strategies
5. Evaluation
5.1. Evaluation per Multi-Agent Algorithmic Methodology
5.1.1. Reinforcement Learning
5.1.2. Model Predictive Control
5.1.3. Model-Based Control with Evolutionary-Based Optimization
5.1.4. Model-Based Control with ADMM-Based Optimization
5.1.5. Model-Based Control with Lyapunov-Based Optimization
5.1.6. Hybrid Strategies
5.2. Evaluation per Agent Type
5.3. Evaluation per BEMS Type
5.4. Evaluation per Building Type
5.5. Evaluation per Application Type
5.6. Evaluation per Simulation Tool
6. Conclusions
- Prevalence of Algorithmic Control Methodologies: RL remains the most prominent algorithmic approach for multi-agent control in IBEMS, with methods such as Q-learning, Deep Q-Networks, and actor–critic approaches (e.g., MAAC) being the most frequently used. Such strategies excelled at handling complex environments with large state spaces, and their effectiveness in optimizing energy management and occupant comfort makes them a top choice for multi-agent scenarios. However, it is observed that, overall, model-based control strategies are more commonly used in the literature than model-free ones. In particular, MPC and other model-based approaches, such as ADDM-based and PSO-based, have been shown to offer strong predictive capabilities and real-time adaptability, making them essential for managing dynamic environments where anticipatory control is required. The dominance of model-based control methodologies, however, highlights an opportunity for modern research to delve deeper into model-free methodologies, which offer more flexible and adaptive solutions that are potentially less reliant on predefined system models. This trend could also be supported by the rise of IoT technologies, enabling more data-driven, real-time control without the need for complex system modeling.
- Rising Adoption of Hybrid Methodologies: Hybrid control schemes, integrating methodologies like RL, MPC, EAs, and ANN/FLC, are becoming increasingly popular due to their ability to leverage the strengths of multiple techniques. Such methods offer enhanced flexibility and adaptability, particularly in dynamically changing environments where multiple control objectives must be optimized simultaneously. According to the evaluation, hybrid methodologies hold strong potential for the future, not only due to their ability to synergize strengths from different algorithmic paradigms but also because they provide a framework for adaptive control in increasingly complex and uncertain environments. To this end, as the complexity of IBEMS frameworks continues to grow—requiring robust coordination and real-time adaptability—hybrid methodologies will play a crucial role in optimizing energy management, improving system efficiency, and enhancing occupant comfort.
- Prevalence of Cooperative Agent Type Strategies: According to the evaluation, cooperative agents represent a particularly advantageous scheme in multi-agent control systems, especially in RL and evolutionary algorithms. Cooperation enables agents to share critical information and align their objectives, which is crucial in interdependent systems like HVAC, RES, and ESS. By coordinating actions, cooperative agents can optimize energy management more effectively than decentralized agents, which operate independently and may struggle in complex environments with high interdependencies. The ability to exchange experiences accelerates learning in RL, leading to better collective outcomes. Similarly, in evolutionary algorithms, cooperation prevents premature convergence by promoting diverse exploration of the solution space. In contrast, purely decentralized approaches may underperform in scenarios requiring high coordination, highlighting the superiority of cooperative methods in managing complex, distributed energy systems.
- Constraints on Multi-Equipment Integration in IBEMS: The evaluation highlights HVAC systems as the predominant focus of multi-agent control in IBEMS due to their high energy consumption. RES, often combined with ESS, also feature prominently, contributing to sustainable building energy management. A key trend identified concerns the relatively low frequency of applications involving more than four types of systems. While multi-system interactions involving HVAC, RES, ESS, and EVs remain limited, such integrations are essential in real-world applications for achieving comprehensive energy efficiency. As IBEMS grow more complex with diverse subsystems, the need for advanced multi-agent control methods capable of managing this complexity, ensuring real-time adaptability, and facilitating effective inter-agent coordination, becomes increasingly critical.
- Limited Real-life Implementations: Despite the potential of multi-agent control systems, real-world applications are still limited due to the high cost, complexity, and logistical challenges of deployment. However, it is noticeable that real-life implementations tend to favor novel approaches–like Hybrid or Other approaches—because they offer greater flexibility and customization to specific building conditions. Unlike conventional approaches, customized approaches may provide a better handling of real-world complexities, uncertainties, and system-specific constraints, making them more practical for deployment in diverse and dynamic environments. Such a fact is also supported by numerous literature works, where comparisons often highlight that novel approaches outperform conventional methods. Another key observation is that real-life implementations predominantly focus on commercial buildings, underscoring the need for more extensive research and deployment at the residential level. Given the unique energy demands and dynamics of residential environments, there is a clear opportunity to expand multi-agent control systems to optimize energy use and sustainability in homes, which remains underexplored in comparison to commercial real-world applications.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
AC | Actor–critic |
ACKTR | Actor–critic using Kronecker-Factored Trust Region |
ADMM | Alternating Direction Method of Multipliers |
ANN | Artificial Neural Network |
BDQ | Branching Dueling Q-network |
BOC | Building Optimization and Control |
CQL | Coordinated Q-learning |
CO | Cooperative |
D3QN | Dueling Double Deep Q-Network |
DCCMARL | Deep Clustering of Cooperative Multi-agent Reinforcement Learning |
DE | Decentralized |
DHW | Domestic Hot Water systems |
DQN | Deep Q-Networks |
DR | Demand Response |
EA | Evolutionary Algorithm |
E-MPC | Economic-MPC |
ESS | Energy Storage Systems |
EVs | Electric Vehicles |
FLC | Fuzzy logic control |
GA | Genetic Algorithm |
HVAC | Heating, Ventilation, and Air conditioning |
IBEMS | Integrated Building Energy Management Systems |
IoT | Internet of Things |
J-ADMM | Jacobian ADMM |
L4GPCAO | Local-for-Global Parameterized Cognitive Adaptive Optimization |
LCMA | Lyapunov-based Cost Minimization Algorithm |
LSTM | Long short-term memory networks |
MAAC | Multi-Actor Attention–Critic |
MARDL | Multi-agent reinforcement learning |
MDP | Markov Decision Process |
MLP | Multi-Layer Perceptron |
MPC | Model Predictive Control |
N/A | Not identified |
Non-CO | Non-cooperative |
NR | Newton–Raphson |
PPO | Proximal Policy Optimization |
PSO | Particle Swarm Optimization |
QP | Quadratic Programming |
RES | Renewable Energy Sources |
SAC | Soft Actor–Critic |
Smart-TBSA | Smart-Token-Based Scheduling Algorithm |
SoC | State of Charge |
SoS | System of Systems |
SQP | Sequential Quadratic Programming |
TD3 | Twin-Delayed Deep Deterministic |
References
- Santamouris, M.; Feng, J. Recent progress in daytime radiative cooling: Is it the air conditioner of the future? Buildings 2018, 8, 168. [Google Scholar] [CrossRef]
- Zhang, W.; Wu, Y.; Calautit, J.K. A review on occupancy prediction through machine learning for enhancing energy efficiency, air quality and thermal comfort in the built environment. Renew. Sustain. Energy Rev. 2022, 167, 112704. [Google Scholar] [CrossRef]
- Zhou, Y.; Liu, J. Advances in emerging digital technologies for energy efficiency and energy integration in smart cities. Energy Build. 2024, 315, 114289. [Google Scholar] [CrossRef]
- Papada, L.; Balaskas, A.; Katsoulakos, N.; Kaliampakos, D.; Damigos, D. Fighting energy poverty using user-driven approaches in mountainous Greece: Lessons learnt from a living lab. Energies 2021, 14, 1525. [Google Scholar] [CrossRef]
- Minelli, F.; Ciriello, I.; Minichiello, F.; D’Agostino, D. From Net Zero Energy Buildings to an energy sharing model-The role of NZEBs in Renewable Energy Communities. Renew. Energy 2024, 223, 120110. [Google Scholar] [CrossRef]
- Dhabliya, D.; Gopalakrishnan, S.; Mudigonda, A.; Omirbayevna, T.G.; Rajalakshmi, K.; Kulshreshtha, K.; Shnain, A.H.; KrishnaBhargavi, Y. Utilizing Big Data and environmentally-focused innovations to create smart, sustainable cities by integrating energy management, energy-efficient buildings, pollution mitigation, and urban circulation. In Proceedings of the 2023 International Conference for Technological Engineering and Its Applications in Sustainable Development (ICTEASD), Al-Najaf, Iraq, 14–15 November 2023; IEEE: New York, NY, USA, 2023; pp. 402–408. [Google Scholar]
- Michailidis, P.; Michailidis, I.; Gkelios, S.; Kosmatopoulos, E. Artificial Neural Network Applications for Energy Management in Buildings: Current Trends and Future Directions. Energies 2024, 17, 570. [Google Scholar] [CrossRef]
- Hannan, M.A.; Faisal, M.; Ker, P.J.; Mun, L.H.; Parvin, K.; Mahlia, T.M.I.; Blaabjerg, F. A review of internet of energy based building energy management systems: Issues and recommendations. IEEE Access 2018, 6, 38997–39014. [Google Scholar] [CrossRef]
- Ahmed, I.; Asif, M.; Alhelou, H.H.; Khalid, M.; Khalid, M. A review on enhancing energy efficiency and adaptability through system integration for smart buildings. J. Build. Eng. 2024, 89, 109354. [Google Scholar]
- Michailidis, P.; Pelitaris, P.; Korkas, C.; Michailidis, I.; Baldi, S.; Kosmatopoulos, E. Enabling optimal energy management with minimal IoT requirements: A legacy A/C case study. Energies 2021, 14, 7910. [Google Scholar] [CrossRef]
- Ahn, J. An Adaptive Control Model for Thermal Environmental Factors to Supplement the Sustainability of a Small-Sized Factory. Sustainability 2023, 15, 16619. [Google Scholar] [CrossRef]
- Minelli, F.; D’Agostino, D.; Migliozzi, M.; Minichiello, F.; D’Agostino, P. PhloVer: A Modular and integrated tracking photovoltaic shading Device for sustainable large urban spaces—preliminary Study and prototyping. Energies 2023, 16, 5786. [Google Scholar] [CrossRef]
- Fotopoulou, M.C.; Drosatos, P.; Petridis, S.; Rakopoulos, D.; Stergiopoulos, F.; Nikolopoulos, N. Model predictive control for the energy Management in a District of buildings equipped with building integrated photovoltaic systems and batteries. Energies 2021, 14, 3369. [Google Scholar] [CrossRef]
- Papantoniou, S.; Mangili, S.; Mangialenti, I. Using intelligent building energy management system for the integration of several systems to one overall monitoring and management system. Energy Procedia 2017, 111, 639–647. [Google Scholar] [CrossRef]
- Tsaousoglou, G.; Efthymiopoulos, N.; Makris, P.; Varvarigos, E. Multistage energy management of coordinated smart buildings: A multiagent Markov decision process approach. IEEE Trans. Smart Grid 2022, 13, 2788–2797. [Google Scholar] [CrossRef]
- Xia, W.; Goh, J.; Cortes, C.A.; Lu, Y.; Xu, X. Decentralized coordination of autonomous AGVs for flexible factory automation in the context of Industry 4.0. In Proceedings of the 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), Hong Kong, China, 20–21 August 2020; IEEE: New York, NY, USA, 2020; pp. 488–493. [Google Scholar]
- Tan, K.K.; Putra, A.S. Drives and Control for Industrial Automation; Springer Science & Business Media: London, UK, 2010. [Google Scholar]
- Tsintotas, K.A.; Kansizoglou, I.; Konstantinidis, F.K.; Mouroutsos, S.G.; Syrakoulis, G.C.; Psarommatis, F.; Aloimonos, Y.; Gasteratos, A. Active vision: A promising technology for achieving zero-defect manufacturing. Procedia Comput. Sci. 2024, 232, 2821–2830. [Google Scholar] [CrossRef]
- Michailidis, I.T.; Manolis, D.; Michailidis, P.; Diakaki, C.; Kosmatopoulos, E.B. Autonomous self-regulating intersections in large-scale urban traffic networks: A Chania City case study. In Proceedings of the 2018 5th International Conference on Control, Decision and Information Technologies (CoDIT), Thessaloniki, Greece, 10–13 April 2018; IEEE: New York, NY, USA, 2018; pp. 853–858. [Google Scholar]
- Michailidis, I.T.; Manolis, D.; Michailidis, P.; Diakaki, C.; Kosmatopoulos, E.B. A decentralized optimization approach employing cooperative cycle-regulation in an intersection-centric manner: A complex urban simulative case study. Transp. Res. Interdiscip. Perspect. 2020, 8, 100232. [Google Scholar] [CrossRef]
- Baskar, L.; De Schutter, B.; Hellendoorn, H. Decentralized traffic control and management with intelligent vehicles. In Proceedings of the 9th TRAIL Congress, Delft, The Netherlands, 21 November 2006; Volume 1, p. 3. [Google Scholar]
- Karatzinis, G.D.; Michailidis, P.; Michailidis, I.T.; Kapoutsis, A.C.; Kosmatopoulos, E.B.; Boutalis, Y.S. Coordinating heterogeneous mobile sensing platforms for effectively monitoring a dispersed gas plume. Integr. Comput.-Aided Eng. 2022, 29, 411–429. [Google Scholar] [CrossRef]
- Figetakis, E.; Bello, Y.; Refaey, A.; Shami, A. Decentralized semantic traffic control in avs using rl and dqn for dynamic roadblocks. arXiv 2024, arXiv:2406.18741. [Google Scholar]
- Cervo, A.; Goldoni, G.; Fantuzzi, C.; Borsari, R. Decentralized line equipment detection and production control by multi-agent technology. In Proceedings of the IECON 2019-45th Annual Conference of the IEEE Industrial Electronics Society, Lisbon, Portugal, 14–17 October 2019; IEEE: New York, NY, USA, 2019; Volume 1, pp. 2940–2945. [Google Scholar]
- Ramadan, R.; Huang, Q.; Zalhaf, A.S.; Bamisile, O.; Li, J.; Mansour, D.E.A.; Lin, X.; Yehia, D.M. Energy Management in Residential Microgrid Based on Non-Intrusive Load Monitoring and Internet of Things. Smart Cities 2024, 7, 1907–1935. [Google Scholar] [CrossRef]
- Hamid, M.N.A.; Banakhr, F.A.; Mohamed, T.H.; Ali, S.M.; Mahmoud, M.M.; Mosaad, M.I.; Albla, A.A.H.; Hussein, M.M. Adaptive Frequency Control of an Isolated Microgrids Implementing Different Recent Optimization Techniques. Int. J. Robot. Control Syst. 2024, 4, 1000–1012. [Google Scholar] [CrossRef]
- Mussi, M.; Pellegrino, L.; Pindaro, O.F.; Restelli, M.; Trovò, F. A Reinforcement Learning controller optimizing costs and battery State of Health in smart grids. J. Energy Storage 2024, 82, 110572. [Google Scholar] [CrossRef]
- Rajani, D.; Gopal, J.V.; Saxena, A.; Salman, Z.N.; Jain, A. Enhancing Electric Vehicle Charging with Dynamic Adaptation: A Machine Learning Approach to Improving Grid Alignment Precision. In Proceedings of the 2024 International Conference on Communication, Computer Sciences and Engineering (IC3SE), Gautam Buddha Nagar, India, 9–11 May 2024; IEEE: New York, NY, USA, 2024; pp. 702–707. [Google Scholar]
- Sivakumar, S.; Yadav, M.R.; Bharathi, A.; Akila, D.; Dineshkumar, P.; Banupriya, V. Reinforcement Learning Driven Smart Charging Algorithms to Enhance Battery Lifespan and Grid Sustainability. In Proceedings of the 2024 International Conference on Advances in Modern Age Technologies for Health and Engineering Science (AMATHE), Shivamogga, India, 16–17 May 2024; IEEE: New York, NY, USA, 2024; pp. 1–7. [Google Scholar]
- Michailidis, P.; Michailidis, I.; Vamvakas, D.; Kosmatopoulos, E. Model-Free HVAC Control in Buildings: A Review. Energies 2023, 16, 7124. [Google Scholar] [CrossRef]
- Gao, C.; Wang, D. Comparative study of model-based and model-free reinforcement learning control performance in HVAC systems. J. Build. Eng. 2023, 74, 106852. [Google Scholar] [CrossRef]
- Biagioni, D.; Zhang, X.; Adcock, C.; Sinner, M.; Graf, P.; King, J. From model-based to model-free: Learning building control for demand response. arXiv 2022, arXiv:2210.10203. [Google Scholar]
- Lazaridis, C.R.; Michailidis, I.; Karatzinis, G.; Michailidis, P.; Kosmatopoulos, E. Evaluating Reinforcement Learning Algorithms in Residential Energy Saving and Comfort Management. Energies 2024, 17, 581. [Google Scholar] [CrossRef]
- Gautam, M. Deep Reinforcement learning for resilient power and energy systems: Progress, prospects, and future avenues. Electricity 2023, 4, 336–380. [Google Scholar] [CrossRef]
- Xin, X.; Zhang, Z.; Zhou, Y.; Liu, Y.; Wang, D.; Nan, S. A comprehensive review of predictive control strategies in Heating, Ventilation, and Air-conditioning (HVAC): Model-free VS Model. J. Build. Eng. 2024, 94, 110013. [Google Scholar] [CrossRef]
- Vamvakas, D.; Michailidis, P.; Korkas, C.; Kosmatopoulos, E. Review and evaluation of reinforcement learning frameworks on smart grid applications. Energies 2023, 16, 5326. [Google Scholar] [CrossRef]
- Kurte, K.; Amasyali, K.; Munk, J.; Zandi, H. Comparative analysis of model-free and model-based HVAC control for residential demand response. In Proceedings of the 8th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Coimbra, Portugal, 17–18 November 2021; pp. 309–313. [Google Scholar]
- Mahela, O.P.; Khosravy, M.; Gupta, N.; Khan, B.; Alhelou, H.H.; Mahla, R.; Patel, N.; Siano, P. Comprehensive overview of multi-agent systems for controlling smart grids. CSEE J. Power Energy Syst. 2020, 8, 115–131. [Google Scholar]
- Naylor, S.; Gillott, M.; Lau, T. A review of occupant-centric building control strategies to reduce building energy use. Renew. Sustain. Energy Rev. 2018, 96, 1–10. [Google Scholar] [CrossRef]
- Merabet, G.H.; Essaaidi, M.; Haddou, M.B.; Qolomany, B.; Qadir, J.; Anan, M.; Al-Fuqaha, A.; Abid, M.R.; Benhaddou, D. Intelligent building control systems for thermal comfort and energy-efficiency: A systematic review of artificial intelligence-assisted techniques. Renew. Sustain. Energy Rev. 2021, 144, 110969. [Google Scholar] [CrossRef]
- Kathirgamanathan, A.; De Rosa, M.; Mangina, E.; Finn, D.P. Data-driven predictive control for unlocking building energy flexibility: A review. Renew. Sustain. Energy Rev. 2021, 135, 110120. [Google Scholar] [CrossRef]
- Al Dakheel, J.; Del Pero, C.; Aste, N.; Leonforte, F. Smart buildings features and key performance indicators: A review. Sustain. Cities Soc. 2020, 61, 102328. [Google Scholar] [CrossRef]
- Latif, M.; Nasir, A. Decentralized stochastic control for building energy and comfort management. J. Build. Eng. 2019, 24, 100739. [Google Scholar] [CrossRef]
- Wang, Z.; Hong, T. Reinforcement learning for building controls: The opportunities and challenges. Appl. Energy 2020, 269, 115036. [Google Scholar] [CrossRef]
- Váňa, Z.; Cigler, J.; Širokỳ, J.; Žáčeková, E.; Ferkl, L. Model-based energy efficient control applied to an office building. J. Process. Control. 2014, 24, 790–797. [Google Scholar] [CrossRef]
- Moroşan, P.D.; Bourdais, R.; Dumur, D.; Buisson, J. Building temperature regulation using a distributed model predictive control. Energy Build. 2010, 42, 1445–1452. [Google Scholar] [CrossRef]
- Eser, S.; Stoffel, P.; Kümpel, A.; Müller, D. Distributed model predictive control of a nonlinear building energy system using consensus ADMM. In Proceedings of the 2022 30th Mediterranean Conference on Control and Automation (MED), Athens, Greece, 28 June–1 July 2022; pp. 902–907. [Google Scholar]
- Yang, R.; Wang, L. Multi-zone building energy management using intelligent control and optimization. Sustain. Cities Soc. 2013, 6, 16–21. [Google Scholar] [CrossRef]
- Smitha, S.; Chacko, F.M. Intelligent energy management in smart and sustainable buildings with multi-agent control system. In Proceedings of the 2013 International Mutli-Conference on Automation, Computing, Communication, Control and Compressed Sensing (iMac4s), Kottayam, India, 22–23 March 2013; pp. 190–195. [Google Scholar]
- Buşoniu, L.; Babuška, R.; De Schutter, B. Multi-agent reinforcement learning: An overview. In Innovations in Multi-Agent Systems and Applications-1; Springer: Berlin/Heidelberg, Germany, 2010; pp. 183–221. [Google Scholar]
- Morari, M.; Garcia, C.E.; Prett, D.M. Model predictive control: Theory and practice. IFAC Proc. Vol. 1988, 21, 1–12. [Google Scholar] [CrossRef]
- Doukas, H.; Patlitzianas, K.D.; Iatropoulos, K.; Psarras, J. Intelligent building energy management system using rule sets. Build. Environ. 2007, 42, 3562–3569. [Google Scholar] [CrossRef]
- Yang, Y.; Hu, G.; Spanos, C.J. HVAC energy cost optimization for a multizone building via a decentralized approach. IEEE Trans. Autom. Sci. Eng. 2020, 17, 1950–1960. [Google Scholar] [CrossRef]
- Boyd, S.; Parikh, N.; Chu, E.; Peleato, B.; Eckstein, J. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends ® Mach. Learn. 2011, 3, 1–122. [Google Scholar]
- Raju, L.; Sankar, S.; Milton, R. Distributed optimization of solar micro-grid using multi agent reinforcement learning. Procedia Comput. Sci. 2015, 46, 231–239. [Google Scholar] [CrossRef]
- Anvari-Moghaddam, A.; Rahimi-Kian, A.; Mirian, M.S.; Guerrero, J.M. A multi-agent based energy management solution for integrated buildings and microgrid system. Appl. Energy 2017, 203, 41–56. [Google Scholar] [CrossRef]
- Hurtado, L.A.; Mocanu, E.; Nguyen, P.H.; Gibescu, M.; Kamphuis, R.I. Enabling cooperative behavior for building demand response based on extended joint action learning. IEEE Trans. Ind. Inform. 2017, 14, 127–136. [Google Scholar] [CrossRef]
- Kofinas, P.; Dounis, A.I.; Vouros, G.A. Fuzzy Q-Learning for multi-agent decentralized energy management in microgrids. Appl. Energy 2018, 219, 53–67. [Google Scholar] [CrossRef]
- Prasad, A.; Dusparic, I. Multi-agent deep reinforcement learning for zero energy communities. In Proceedings of the 2019 IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe), Bucharest, Romania, 29 September–2 October 2019; pp. 1–5. [Google Scholar]
- Gupta, A.; Badr, Y.; Negahban, A.; Qiu, R.G. Energy-efficient heating control for smart buildings with deep reinforcement learning. J. Build. Eng. 2021, 34, 101739. [Google Scholar] [CrossRef]
- Nagarathinam, S.; Menon, V.; Vasan, A.; Sivasubramaniam, A. Marco-multi-agent reinforcement learning based control of building hvac systems. In Proceedings of the Eleventh ACM International Conference on Future Energy Systems, Virtual, 22–26 June 2020; pp. 57–67. [Google Scholar]
- Lee, S.; Choi, D.H. Federated reinforcement learning for energy management of multiple smart homes with distributed energy resources. IEEE Trans. Ind. Inform. 2020, 18, 488–497. [Google Scholar] [CrossRef]
- Xu, X.; Jia, Y.; Xu, Y.; Xu, Z.; Chai, S.; Lai, C.S. A multi-agent reinforcement learning-based data-driven method for home energy management. IEEE Trans. Smart Grid 2020, 11, 3201–3211. [Google Scholar] [CrossRef]
- Vazquez-Canteli, J.R.; Henze, G.; Nagy, Z. MARLISA: Multi-agent reinforcement learning with iterative sequential action selection for load shaping of grid-interactive connected buildings. In Proceedings of the 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Virtual, 18–20 November 2020; pp. 170–179. [Google Scholar]
- Yu, L.; Sun, Y.; Xu, Z.; Shen, C.; Yue, D.; Jiang, T.; Guan, X. Multi-agent deep reinforcement learning for HVAC control in commercial buildings. IEEE Trans. Smart Grid 2020, 12, 407–419. [Google Scholar] [CrossRef]
- Fu, Q.; Chen, X.; Ma, S.; Fang, N.; Xing, B.; Chen, J. Optimal control method of HVAC based on multi-agent deep reinforcement learning. Energy Build. 2022, 270, 112284. [Google Scholar] [CrossRef]
- Yu, L.; Xu, Z.; Zhang, T.; Guan, X.; Yue, D. Energy-efficient personalized thermal comfort control in office buildings based on multi-agent deep reinforcement learning. Build. Environ. 2022, 223, 109458. [Google Scholar] [CrossRef]
- Shen, R.; Zhong, S.; Wen, X.; An, Q.; Zheng, R.; Li, Y.; Zhao, J. Multi-agent deep reinforcement learning optimization framework for building energy system with renewable energy. Appl. Energy 2022, 312, 118724. [Google Scholar] [CrossRef]
- Blad, C.; Bøgh, S.; Kallesøe, C.S. Data-driven offline reinforcement learning for HVAC-systems. Energy 2022, 261, 125290. [Google Scholar] [CrossRef]
- Lei, Y.; Zhan, S.; Ono, E.; Peng, Y.; Zhang, Z.; Hasama, T.; Chong, A. A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings. Appl. Energy 2022, 324, 119742. [Google Scholar] [CrossRef]
- Pinto, G.; Kathirgamanathan, A.; Mangina, E.; Finn, D.P.; Capozzoli, A. Enhancing energy management in grid-interactive buildings: A comparison among cooperative and coordinated architectures. Appl. Energy 2022, 310, 118497. [Google Scholar] [CrossRef]
- Chu, Y.; Wei, Z.; Sun, G.; Zang, H.; Chen, S.; Zhou, Y. Optimal home energy management strategy: A reinforcement learning method with actor-critic using Kronecker-factored trust region. Electr. Power Syst. Res. 2022, 212, 108617. [Google Scholar] [CrossRef]
- Gao, Y.; Matsunami, Y.; Miyata, S.; Akashi, Y. Multi-agent reinforcement learning dealing with hybrid action spaces: A case study for off-grid oriented renewable building energy system. Appl. Energy 2022, 326, 120021. [Google Scholar] [CrossRef]
- Pigott, A.; Crozier, C.; Baker, K.; Nagy, Z. Gridlearn: Multiagent reinforcement learning for grid-aware building energy management. Electr. Power Syst. Res. 2022, 213, 108521. [Google Scholar] [CrossRef]
- Qiu, D.; Xue, J.; Zhang, T.; Wang, J.; Sun, M. Federated reinforcement learning for smart building joint peer-to-peer energy and carbon allowance trading. Appl. Energy 2023, 333, 120526. [Google Scholar] [CrossRef]
- Xie, J.; Ajagekar, A.; You, F. Multi-agent attention-based deep reinforcement learning for demand response in grid-responsive buildings. Appl. Energy 2023, 342, 121162. [Google Scholar] [CrossRef]
- Nweye, K.; Sankaranarayanan, S.; Nagy, Z. MERLIN: Multi-agent offline and transfer learning for occupant-centric operation of grid-interactive communities. Appl. Energy 2023, 346, 121323. [Google Scholar] [CrossRef]
- Homod, R.Z.; Yaseen, Z.M.; Hussein, A.K.; Almusaed, A.; Alawi, O.A.; Falah, M.W.; Abdelrazek, A.H.; Ahmed, W.; Eltaweel, M. Deep clustering of cooperative multi-agent reinforcement learning to optimize multi chiller HVAC systems for smart buildings energy management. J. Build. Eng. 2023, 65, 105689. [Google Scholar] [CrossRef]
- Lauro, F.; Longobardi, L.; Panzieri, S. An adaptive distributed predictive control strategy for temperature regulation in a multizone office building. In Proceedings of the 2014 IEEE International Workshop on Intelligent Energy Systems (IWIES), San Diego, CA, USA, 8 October 2014; IEEE: New York, NY, USA, 2014; pp. 32–37. [Google Scholar]
- Pedersen, T.H.; Hedegaard, R.E.; Petersen, S. Space heating demand response potential of retrofitted residential apartment blocks. Energy Build. 2017, 141, 158–166. [Google Scholar] [CrossRef]
- Pedersen, T.H.; Hedegaard, R.E.; Knudsen, M.D.; Petersen, S. Comparison of centralized and decentralized model predictive control in a building retrofit scenario. Energy Procedia 2017, 122, 979–984. [Google Scholar] [CrossRef]
- Abobakr, S.A.; Sadid, W.H.; Zhu, G. A game-theoretic decentralized model predictive control of thermal appliances in discrete-event systems framework. IEEE Trans. Ind. Electron. 2018, 65, 6446–6456. [Google Scholar] [CrossRef]
- Sangi, R.; Müller, D. A novel hybrid agent-based model predictive control for advanced building energy systems. Energy Convers. Manag. 2018, 178, 415–427. [Google Scholar] [CrossRef]
- Yang, Y.; Jia, Q.S.; Guan, X.; Zhang, X.; Qiu, Z.; Deconinck, G. Decentralized EV-based charging optimization with building integrated wind energy. IEEE Trans. Autom. Sci. Eng. 2018, 16, 1002–1017. [Google Scholar] [CrossRef]
- Zhuo, W.; Savkin, A.V.; Meng, K. Decentralized optimal control of a microgrid with solar PV, BESS and thermostatically controlled loads. Energies 2019, 12, 2111. [Google Scholar] [CrossRef]
- Lyons, B.; O’Dwyer, E.; Shah, N. Model reduction for Model Predictive Control of district and communal heating systems within cooperative energy systems. Energy 2020, 197, 117178. [Google Scholar] [CrossRef]
- El Geneidy, R.; Howard, B. Contracted energy flexibility characteristics of communities: Analysis of a control strategy for demand response. Appl. Energy 2020, 263, 114600. [Google Scholar] [CrossRef]
- Wang, J.; Garifi, K.; Baker, K.; Zuo, W.; Zhang, Y.; Huang, S.; Vrabie, D. Optimal renewable resource allocation and load scheduling of resilient communities. Energies 2020, 13, 5683. [Google Scholar] [CrossRef]
- Saletti, C.; Gambarotta, A.; Morini, M. Development, analysis and application of a predictive controller to a small-scale district heating system. Appl. Therm. Eng. 2020, 165, 114558. [Google Scholar] [CrossRef]
- Wu, Y.; Mäki, A.; Jokisalo, J.; Kosonen, R.; Kilpeläinen, S.; Salo, S.; Liu, H.; Li, B. Demand response of district heating using model predictive control to prevent the draught risk of cold window in an office building. J. Build. Eng. 2021, 33, 101855. [Google Scholar] [CrossRef]
- Lefebure, N.; Khosravi, M.; de Badyn, M.H.; Bünning, F.; Lygeros, J.; Jones, C.; Smith, R.S. Distributed model predictive control of buildings and energy hubs. Energy Build. 2022, 259, 111806. [Google Scholar] [CrossRef]
- Hurtado, L.; Nguyen, P.; Kling, W. Smart grid and smart building inter-operation using agent-based particle swarm optimization. Sustain. Energy Grids Networks 2015, 2, 32–40. [Google Scholar] [CrossRef]
- Shaikh, P.H.; Nor, N.B.M.; Nallagownden, P.; Elamvazuthi, I.; Ibrahim, T. Intelligent multi-objective control and management for smart energy efficient buildings. Int. J. Electr. Power Energy Syst. 2016, 74, 403–409. [Google Scholar] [CrossRef]
- Altayeva, A.; Omarov, B.; Suleimenov, Z.; Im Cho, Y. Application of multi-agent control systems in energy-efficient intelligent building. In Proceedings of the 2017 Joint 17th World Congress of International Fuzzy Systems Association and 9th International Conference on Soft Computing and Intelligent Systems (IFSA-SCIS), Otsu, Japan, 27–30 June 2017; IEEE: New York, NY, USA, 2017; pp. 1–5. [Google Scholar]
- ur Rehman, H.; Hirvonen, J.; Sirén, K. Performance comparison between optimized design of a centralized and semi-decentralized community size solar district heating system. Appl. Energy 2018, 229, 1072–1094. [Google Scholar] [CrossRef]
- Ghazimirsaeid, S.S.; Jonban, M.S.; Mudiyanselage, M.W.; Marzband, M.; Martinez, J.L.R.; Abusorrah, A. Multi-agent-based energy management of multiple grid-connected green buildings. J. Build. Eng. 2023, 74, 106866. [Google Scholar] [CrossRef]
- Cai, J.; Kim, D.; Jaramillo, R.; Braun, J.E.; Hu, J. A general multi-agent control approach for building energy system optimization. Energy Build. 2016, 127, 337–351. [Google Scholar] [CrossRef]
- Hou, X.; Xiao, Y.; Cai, J.; Hu, J.; Braun, J.E. Distributed model predictive control via proximal Jacobian ADMM for building control applications. In Proceedings of the 2017 American Control Conference (ACC), Seattle, DA, USA, 24–26 May 2017; IEEE: New York, NY, USA, 2017; pp. 37–43. [Google Scholar]
- Carli, R.; Dotoli, M. Decentralized control for residential energy management of a smart users’ microgrid with renewable energy exchange. IEEE/CAA J. Autom. Sin. 2019, 6, 641–656. [Google Scholar] [CrossRef]
- Li, W.; Wang, S. A multi-agent based distributed approach for optimal control of multi-zone ventilation systems considering indoor air quality and energy use. Appl. Energy 2020, 275, 115371. [Google Scholar] [CrossRef]
- Li, W.; Li, H.; Wang, S. An event-driven multi-agent based distributed optimal control strategy for HVAC systems in IoT-enabled smart buildings. Autom. Constr. 2021, 132, 103919. [Google Scholar] [CrossRef]
- Lyu, C.; Jia, Y.; Xu, Z. Fully decentralized peer-to-peer energy sharing framework for smart buildings with local battery system and aggregated electric vehicles. Appl. Energy 2021, 299, 117243. [Google Scholar] [CrossRef]
- Wang, Z.; Zhao, Y.; Zhang, C.; Ma, P.; Liu, X. A general multi agent-based distributed framework for optimal control of building HVAC systems. J. Build. Eng. 2022, 52, 104498. [Google Scholar] [CrossRef]
- Guo, Y.; Pan, M.; Fang, Y.; Khargonekar, P.P. Decentralized coordination of energy utilization for residential households in the smart grid. IEEE Trans. Smart Grid 2013, 4, 1341–1350. [Google Scholar] [CrossRef]
- Zheng, L.; Cai, L. A distributed demand response control strategy using Lyapunov optimization. IEEE Trans. Smart Grid 2014, 5, 2075–2083. [Google Scholar] [CrossRef]
- Yu, L.; Xie, D.; Jiang, T.; Zou, Y.; Wang, K. Distributed real-time HVAC control for cost-efficient commercial buildings under smart grid environment. IEEE Internet Things J. 2017, 5, 44–55. [Google Scholar] [CrossRef]
- Jazizadeh, F.; Ghahramani, A.; Becerik-Gerber, B.; Kichkaylo, T.; Orosz, M. User-led decentralized thermal comfort driven HVAC operations for improved efficiency in office buildings. Energy Build. 2014, 70, 398–410. [Google Scholar] [CrossRef]
- Javed, A.; Larijani, H.; Ahmadinia, A.; Emmanuel, R.; Mannion, M.; Gibson, D. Design and implementation of a cloud enabled random neural network-based decentralized smart controller with intelligent sensor nodes for HVAC. IEEE Internet Things J. 2016, 4, 393–403. [Google Scholar] [CrossRef]
- González-Briones, A.; Prieto, J.; De La Prieta, F.; Herrera-Viedma, E.; Corchado, J.M. Energy optimization using a case-based reasoning strategy. Sensors 2018, 18, 865. [Google Scholar] [CrossRef] [PubMed]
- Joe, J.; Karava, P.; Hou, X.; Xiao, Y.; Hu, J. A distributed approach to model-predictive control of radiant comfort delivery systems in office spaces with localized thermal environments. Energy Build. 2018, 175, 173–188. [Google Scholar] [CrossRef]
- Lork, C.; Li, W.T.; Qin, Y.; Zhou, Y.; Yuen, C.; Tushar, W.; Saha, T.K. An uncertainty-aware deep reinforcement learning framework for residential air conditioning energy management. Appl. Energy 2020, 276, 115426. [Google Scholar] [CrossRef]
- Zou, Z.; Yu, X.; Ergan, S. Towards optimal control of air handling units using deep reinforcement learning and recurrent neural network. Build. Environ. 2020, 168, 106535. [Google Scholar] [CrossRef]
- Rochd, A.; Benazzouz, A.; Abdelmoula, I.A.; Raihani, A.; Ghennioui, A.; Naimi, Z.; Ikken, B. Design and implementation of an AI-based & IoT-enabled Home Energy Management System: A case study in Benguerir—Morocco. Energy Rep. 2021, 7, 699–719. [Google Scholar]
- Li, Z.; Zhang, J. Study on the distributed model predictive control for multi-zone buildings in personalized heating. Energy Build. 2021, 231, 110627. [Google Scholar] [CrossRef]
- Homod, R.Z.; Togun, H.; Hussein, A.K.; Al-Mousawi, F.N.; Yaseen, Z.M.; Al-Kouz, W.; Abd, H.J.; Alawi, O.A.; Goodarzi, M.; Hussein, O.A. Dynamics analysis of a novel hybrid deep clustering for unsupervised learning by reinforcement of multi-agent to energy saving in intelligent buildings. Appl. Energy 2022, 313, 118863. [Google Scholar] [CrossRef]
- Mork, M.; Xhonneux, A.; Müller, D. Nonlinear Distributed Model Predictive Control for multi-zone building energy systems. Energy Build. 2022, 264, 112066. [Google Scholar] [CrossRef]
- Chen, C.; Wang, J.; Kishore, S. A distributed direct load control approach for large-scale residential demand response. IEEE Trans. Power Syst. 2014, 29, 2219–2228. [Google Scholar] [CrossRef]
- Dai, Y.; Jiang, Z.; Shen, Q.; Chen, P.; Wang, S.; Jiang, Y. A decentralized algorithm for optimal distribution in HVAC systems. Build. Environ. 2016, 95, 21–31. [Google Scholar] [CrossRef]
- Hollinger, R.; Diazgranados, L.M.; Braam, F.; Erge, T.; Bopp, G.; Engel, B. Distributed solar battery systems providing primary control reserve. IET Renew. Power Gener. 2016, 10, 63–70. [Google Scholar] [CrossRef]
- Michailidis, I.T.; Schild, T.; Sangi, R.; Michailidis, P.; Korkas, C.; Fütterer, J.; Müller, D.; Kosmatopoulos, E.B. Energy-efficient HVAC management using cooperative, self-trained, control agents: A real-life German building case study. Appl. Energy 2018, 211, 113–125. [Google Scholar] [CrossRef]
- Png, E.; Srinivasan, S.; Bekiroglu, K.; Chaoyang, J.; Su, R.; Poolla, K. An internet of things upgrade for smart and scalable heating, ventilation and air-conditioning control in commercial buildings. Appl. Energy 2019, 239, 408–424. [Google Scholar] [CrossRef]
- Xue, Y.; Li, Z.; Lin, C.; Guo, Q.; Sun, H. Coordinated dispatch of integrated electric and district heating systems using heterogeneous decomposition. IEEE Trans. Sustain. Energy 2019, 11, 1495–1507. [Google Scholar] [CrossRef]
- Lymperopoulos, G.; Ioannou, P. Building temperature regulation in a multi-zone HVAC system using distributed adaptive control. Energy Build. 2020, 215, 109825. [Google Scholar] [CrossRef]
- Alhasnawi, B.N.; Jasim, B.H.; Sedhom, B.E.; Hossain, E.; Guerrero, J.M. A new decentralized control strategy of microgrids in the internet of energy paradigm. Energies 2021, 14, 2183. [Google Scholar] [CrossRef]
- Ahrens, M.; Kern, F.; Schmeck, H. Strategies for an adaptive control system to improve power grid resilience with smart buildings. Energies 2021, 14, 4472. [Google Scholar] [CrossRef]
- Jonban, M.S.; Romeral, L.; Akbarimajd, A.; Ali, Z.; Ghazimirsaeid, S.S.; Marzband, M.; Putrus, G. Autonomous energy management system with self-healing capabilities for green buildings (microgrids). J. Build. Eng. 2021, 34, 101604. [Google Scholar] [CrossRef]
- Kolahan, A.; Maadi, S.R.; Teymouri, Z.; Schenone, C. Blockchain-based solution for energy demand-side management of residential buildings. Sustain. Cities Soc. 2021, 75, 103316. [Google Scholar] [CrossRef]
- Gupta, S.K.; Ghose, T.; Chatterjee, K. Coordinated control of Incentive-Based Demand Response Program and BESS for frequency regulation in low inertia isolated grid. Electr. Power Syst. Res. 2022, 209, 108037. [Google Scholar] [CrossRef]
- Michailidis, I.T.; Sangi, R.; Michailidis, P.; Schild, T.; Fuetterer, J.; Mueller, D.; Kosmatopoulos, E.B. Balancing energy efficiency with indoor comfort using smart control agents: A simulative case study. Energies 2020, 13, 6228. [Google Scholar] [CrossRef]
- Michailidis, P.; Michailidis, I.T.; Gkelios, S.; Karatzinis, G.; Kosmatopoulos, E.B. Neuro-distributed cognitive adaptive optimization for training neural networks in a parallel and asynchronous manner. Integr.-Comput.-Aided Eng. 2024, 31, 1–23. [Google Scholar] [CrossRef]
- Tran, N.; Nguyen, T. Artificial Neural Networks for Modeling Pollutant Removal in Wastewater Treatment: A Review. Galore Int. J. Appl. Sci. Humanit. 2024, 8, 88–98. [Google Scholar]
- Nguyen, D.H.; Funabashi, T. Decentralized control design for user comfort and energy saving in multi-zone buildings. Energy Procedia 2019, 156, 172–176. [Google Scholar] [CrossRef]
Approach | Ref. | Achievement |
---|---|---|
Value-based | [55] | CQL reduced grid power consumption by 15% and increased solar utility by 10–12% |
[58] | Hybrid Q-learning achieved 20% energy savings | |
[63] | Q-learning for HVAC control achieved 15% energy savings | |
[69] | Q-learning with LSTM reduced heating costs by 19.4% | |
[59] | DQN achieved a 40–60 kWh improvement in energy sharing | |
[60] | DQN-based controller improved thermal comfort by 15–30% | |
[61] | DQL agents reduced energy consumption by 17% | |
[66] | DQN optimized HVACs, improving energy efficiency by 11.1% | |
[78] | DCCMARL reduced energy consumption by 49% | |
[68] | D3QN reduced uncomfortable duration by 84% | |
[70] | BDQ reduced cooling energy by 14% | |
Policy-based | [56] | Ontology-driven EMS reduced microgrid costs by 5% |
[57] | eJAL algorithm reduced overload duration by 16.3% | |
[64] | MARLISA reduced peak load by 15% and ramping by 35% | |
[73] | Hierarchical RL improved off-grid operations by 64.93% | |
[74] | PPO reduced overvoltages by 34% | |
Actor–critic | [65] | MAAC reduced energy costs by 56.50–75.25% |
[67] | MAAC reduced energy consumption by 0.7–4.18% | |
[75] | Fed-JPC algorithm reduced costs and emissions by 5.87% and 8.02% | |
[76] | MAAC reduced net load demand by 6% | |
[71] | SAC reduced costs by 7% and peak demand by 14% | |
[77] | SAC reduced electricity consumption by 20% and costs by 30% | |
[62] | A2C reduced electricity by 20% and emissions by 20% | |
[72] | ACKTR reduced energy cost by 25.37% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Michailidis, P.; Michailidis, I.; Kosmatopoulos, E. Review and Evaluation of Multi-Agent Control Applications for Energy Management in Buildings. Energies 2024, 17, 4835. https://doi.org/10.3390/en17194835
Michailidis P, Michailidis I, Kosmatopoulos E. Review and Evaluation of Multi-Agent Control Applications for Energy Management in Buildings. Energies. 2024; 17(19):4835. https://doi.org/10.3390/en17194835
Chicago/Turabian StyleMichailidis, Panagiotis, Iakovos Michailidis, and Elias Kosmatopoulos. 2024. "Review and Evaluation of Multi-Agent Control Applications for Energy Management in Buildings" Energies 17, no. 19: 4835. https://doi.org/10.3390/en17194835
APA StyleMichailidis, P., Michailidis, I., & Kosmatopoulos, E. (2024). Review and Evaluation of Multi-Agent Control Applications for Energy Management in Buildings. Energies, 17(19), 4835. https://doi.org/10.3390/en17194835