Submit to this Journal Review for this Journal Propose a Special Issue

Article Menu

Share Help Cite Discuss in SciProfiles

Open AccessArticle

Peer-Review Record

DS-Trans: A 3D Object Detection Method Based on a Deformable Spatiotemporal Transformer for Autonomous Vehicles

Remote Sens. 2024, 16(9), 1621; https://doi.org/10.3390/rs16091621

by Yuan Zhu¹, Ruidong Xu¹

, Chongben Tao², Hao An¹, Huaide Wang¹, Zhipeng Sun³ and Ke Lu^1,*

Reviewer 1: Anonymous

Reviewer 2: Anonymous

Reviewer 3: Anonymous

Remote Sens. 2024, 16(9), 1621; https://doi.org/10.3390/rs16091621

Submission received: 26 February 2024 / Revised: 26 April 2024 / Accepted: 29 April 2024 / Published: 30 April 2024

(This article belongs to the Special Issue Advances in Remote Sensing of Solving Challenges in Autonomous Driving and Safety Analysis)

Round 1

Reviewer 1 Report

Comments and Suggestions for Authors

See attachment.

Comments for author File: Comments.pdf

Author Response

Please see the attachment.

Author Response File: Author Response.pdf

Reviewer 2 Report

Comments and Suggestions for Authors

Dear authors,

Thank you for your efforts in producing this paper. In general, the paper is well written, and I have only few questions or suggestions listed below:

In section 4.1.3 - I suggest providing a more detailed description of the specific sensors utilized in the measurement.

Section 4.2 (lines 417 – 421) – some of the listed categories are more easily detectable by your method, while some of them are less detectable, how you can explain it?

The processing part of the algorithm is done in real-time or post-processing? And also, the data before the processing is somehow pre-processed or it is used as “measured”? Perhaps implementing some pre-processing filters could improve measurements taken in rainy conditions by eliminating “false reflections”.

Regards

Comments on the Quality of English Language

Minor editing of English language required.

Author Response

Please see the attachment.

Author Response File: Author Response.pdf

Reviewer 3 Report

Comments and Suggestions for Authors

The manuscript entitled “DS-Trans: An 3D Object Detection Method Based on Deformable Spatiotemporal Transformer for Autonomous Vehicle” is work that designed a new Transformer-based deep learning algorithm for 3D object detection from multi-frame point clouds. It performs better on the test data set than the other popular algorithms. The main contribution of this work is the investigation of the model structure, which is not significantly innovative but effective. My concerns are as follows:

1) It is recommended to compare the number of trainable parameters and computational complexity with other models.

2) L378, please double check the title of Figure 5.

3) L404, Section 4.1.3, please specify the number and distribution of self-collected samples and whether they are publicly available.

4) L562, Table V or Table 5? L572, Table 6 or Table VI? Please have a check.

5) L595, Table 4, it seems that with the increase of FFN, the performance of the proposed model increases, so why not continue to increase it, such as increasing it to 10, please further analyze their correlation.

Author Response

Please see the attachment.

Author Response File: Author Response.pdf

Round 2

Reviewer 3 Report

Comments and Suggestions for Authors

After revision, the manuscript has been improved. The only issue I still concerned is the correlation between the model performance and its FFN. As explained in the cover letter, the authors should add such explanations or experiment results in the paper.

Author Response

Thank you for your suggestion. We have added more explanations into the 4.5.2 section.

Article Menu

DS-Trans: A 3D Object Detection Method Based on a Deformable Spatiotemporal Transformer for Autonomous Vehicles

Further Information

Guidelines

MDPI Initiatives

Follow MDPI