Submit to this Journal Review for this Journal Propose a Special Issue

Article Menu

Share Help Cite Discuss in SciProfiles

Open AccessArticle

Peer-Review Record

Reinforcement Learning-Based Decentralized Safety Control for Constrained Interconnected Nonlinear Safety-Critical Systems

Entropy 2023, 25(8), 1158; https://doi.org/10.3390/e25081158

by Chunbin Qin¹

, Yinliang Wu¹, Jishi Zhang^2,* and Tianzeng Zhu¹

Reviewer 1:

Xu-dong Zhao

Reviewer 2: Anonymous

Reviewer 3: Anonymous

Entropy 2023, 25(8), 1158; https://doi.org/10.3390/e25081158

Submission received: 30 May 2023 / Revised: 21 June 2023 / Accepted: 1 July 2023 / Published: 2 August 2023

(This article belongs to the Special Issue Information Theory for Interpretable Machine Learning)

Round 1

Reviewer 1 Report

See the attachment.

Comments for author File: Comments.pdf

None.

Author Response

Please see the attachment

Author Response File: Author Response.pdf

Reviewer 2 Report

1. According to Line 362, Remark 6, the authors proposed a technique based on RL to solve DSC. However, essential elements such as RL's markov decision property, bellman equation, Q-learning, and monte carlo are not presented. There isn't enough information on how it is RL-based one. Proposal for technology based on RL is not detailed, so more informatin is required.

2. In the introduction section, the authors proposed a RL based DSC to overcome the shortcomings in references [9]-[13]. The most important difference between the aforementioned papers and the author's paper is presented as "However, the above [9–13] mainly focused on a single continuous-time/ discrete-time nonlinear optimal security control. The security control of interconnected systems has not been fully resolved.".

However, the method to overcome these disadvantages was not explained in detail in the contribution description (lines 84-97). A more specific explanation should be added to that section.

3. Comparative studies and discussions with other DSC methods may also be added.

4. The statement to support that there are numerous control constraints on the 65th line lacks reasonable evidence. It appears that there is a need for relevant references.

5. In equations 2 and 3, there is no explanation provided for the function P_i(x). Additional clarification is required regarding the nature of this function.

6. Line 135, it is necessary to specify the exact target or condition instead of using the expression "above condition."

The manuscript seems to be fine. But to improve the quality, it is recommended to take the grammar check-up and language polishing.

Author Response

Please see the attachment

Author Response File: Author Response.pdf

Reviewer 3 Report

The authors propose a decentralized safety control of constrained interconnected nonlinear systems, using a strategy based on the reinforcement learning theory. The concept appears very interesting, but some doubts arise from the revision. The authors are invited to answer all the following comments.

1. The authors should better explain the innovations of the proposed approach, since there are many works on Barrier Functions, constraints, etc.

2. The authors should improve the technical literature with some recent works (see for example, DOI: 10.1109/TAC.2022.3197562, https://doi.org/10.1146/annurev-control-042920-020211, DOI: 10.1109/LRA.2021.3070252, DOI: 10.1109/ACCESS.2022.3150926, https://doi.org/10.1016/j.epsr.2022.108609 and the references therein) .

3. The authors should also explain the term "decentralized" used for the control approach, and the meaning that they give to it.

4. The UUB stability could be worrying, especially in real applications. Is it possible to extend the results to asymptotic stability? (see for example the interesting results in DOI: 10.1109/TAC.2022.3197562). Otherwise, it could be interesting a sensitivity analysis on the magnitude of the residual set of convergence, in connection with the control tasks.

5. Is it possible to reduce the settling time in the simulation scenario?

6. At line 158, page 5, there is a wrong formatting for the couple (a,A). A careful rereading is recommended.

7. It is mandatory a simulative comparison analysis with some decentralized approaches proposed in the technical literature, in order to better prove the advantages of your proposal

Minor editing are required for the English language.

Author Response

Please see the attachment.

Author Response File: Author Response.pdf

Round 2

Reviewer 2 Report

All of my comments have been well addressed, and the paper can be accepted.

The manuscript seems to be fine. However, some sentences and expressions still need to be checked again.

Reviewer 3 Report

The authors replied to all my questions. Accordingly, the paper can be accepted now.

Article Menu

Reinforcement Learning-Based Decentralized Safety Control for Constrained Interconnected Nonlinear Safety-Critical Systems

Further Information

Guidelines

MDPI Initiatives

Follow MDPI