2. Contributions
The first paper [
1] proposes a driver behavior analysis system using one spatial stream ConvNet to extract the spatial features and one temporal stream ConvNet to capture the driver’s motion information. The two-dimensional (2D) ConvNet is used to construct the spatial and temporal ConvNet streams, and they were pre-trained by the large-scale ImageNet. In order to integrate different modalities, the feature-level fusion methodology was applied, and a fusion network was designed to integrate the spatial and temporal features for further classification. Moreover, a self-compiled dataset of 10 actions in the vehicle was established. Results showed that the proposed system can increase the accuracy rate by nearly 30% compared to the two-stream CNN model with a score-level fusion.
The second paper [
2] proposes an integrated deep network consisting of a detection and identification module for person search. In real situations, person search is a very challenging problem because of the large appearance variation caused by occlusion, background clutter, pose variations, etc., and most existing works take cropped pedestrian images either from manual labelling or a perfect detection assumption. Thus, a framework for the practical surveillance scenarios in which the scene images are captured is designed. Additionally, the detection module based on the Faster R-CNN is used to detect persons in a scene image. For identifying and extracting discriminative features, a multi-class CNN network is trained with the auto-detected bounding boxes from the detection module instead of the manually cropped data. The distance metric is then learned from the discriminative features output by the identification module. According to the experimental results of the test performed in the scene images, the multi-class CNN network for the identification module can provide a 62.7% accuracy rate, which is higher than that for the two-class CNN network.
The third paper [
3] focuses on the ontology-based tourist knowledge representation and recommendation method. The designed model is used to search for popular attractions from the online travel reviews (OTR) content and construct a tourist knowledge structure for these travelers. When the tourists do not need to know the keywords of the popular attraction name, they only need to acquire their current location; the ORT content will then recommend the next attraction to the traveler, which helps the traveler make the correct travel decision. The evaluation result showed that the method proposed in this study can help travelers to quickly make travel decisions and is better than the traditional searching methods.
The fourth paper [
4] builds an effective system, called the personality-driven course decision assistant, to help students determine the courses they should select by mining and filtering learners’ personality patterns. For learner pattern mining, the relationships between the students’ learning results and the referred personalities are discovered to provide the learners with valuable information before learning commences. For filtering learner personality patterns, students with similar personality patterns are filtered to predict the potential learning results. Through the actual system, a number of subjective and objective evaluations were conducted, and the evaluation results reveal that the proposed system is highly effective and reliable.
The fifth paper [
5] proposes an electric energy consumption prediction model utilizing the combination of convolutional neural network (CNN) and bi-directional long short-term memory (Bi-LSTM), which is named the EECP-CBL model, to predict electric energy consumption. In this framework, two CNNs in the first module extract the important information from several variables in the individual household electric power consumption (IHEPC) dataset. Then, the Bi-LSTM module with two Bi-LSTM layers uses the above information as well as the trends of time series in two directions, including the forward and backward states, to make predictions. The obtained values in the Bi-LSTM module is passed to the last module that consists of two fully connected layers for finally predicting the electric energy consumption in the future. The experimental results indicate that the designed framework outperforms the state-of-the-art approaches in terms of several performance metrics for electric energy consumption prediction on several variations of the IHEPC dataset in real-time, short-term, medium-term, and long-term timespans.
The sixth paper [
6] proposes a deep-learning-based defective bean inspection scheme (DL-DBIS), together with a generative-adversarial network (GAN)-structured automated labeled data augmentation method (GALDAM) for enhancing the proposed scheme, so that the automation degree of bean removal with robotic arms can be further improved for coffee industries. The proposed scheme is aimed at providing an effective model to a deep-learning-based object detection module for accurately identifying defects among dense beans. The proposed GALDAM can be used to greatly reduce labor costs because data labeling is the most labor-intensive work in this type of solution. Testing results of a case study reveal that the proposed scheme can efficiently and effectively generate models for identifying defective beans with accuracy and precision values up to 80%.
The seventh paper [
7] proposes a model to forecast the foreign exchange rate. It incorporates event sentiments to accurately predict the exchange rate. Moreover, as the currency market is heavily dependent upon highly volatile factors such as gold and crude oil prices, these sensitive factors for exchange rate forecasting is then considered. The study also shows the importance of incorporating the investor sentiment of local and foreign macro-level events for accurate forecasting of the exchange rate. The results show that this deep-learning-based model is a better predictor of foreign currency exchange rate in comparison with the statistical techniques normally employed for prediction. It also presents evidence that the exchange rate of all three countries is more exposed to events happening in the US.
The eighth paper [
8] proposes a proactive content-loading algorithm for improving per-user personalized preferences using multinomial softmax classification. The proposed UAS technique predicts a partial video skip due to long-time viewing by the user. This prediction prevents unnecessary network usage and guarantees maximum continuity by minimizing the video interruption and delay. Based on experimental results, the proposed algorithm has a personalized per-user content waiting time that is significantly lower than that of competing algorithms.
The ninth paper [
9] proposes a big data analytics framework (named the multiple item support frequent patterns (MISFP)-growth algorithm) that uses Hadoop-based parallel computing to achieve high-efficiency mining of itemsets with multiple item supports (MIS). To facilitate decision makers in setting MIS, the authors also proposed the concept of classification of item (COI), which classifies items of higher homogeneity into the same class, by which the items inherit class support as their item support. The experimental results show an approximately 38% reduction in the execution time on parallel architectures. Furthermore, according to the experimental results, the enhanced performance of the proposed algorithm indicates that it could have big data analytics applications.
The tenth paper [
10] proposes a method that can rapidly mine frequent patterns under varying network bandwidths. This fast distributed mining in changing network bandwidth (FDCNB)-mining method is based on the effective structure for distributed computing environments. Additionally, the authors added a monitor node to observe variable network bandwidths in cloud or distributed computing environments, which is beneficial for the kernel node to select the optimal computing node. Through empirical evaluation, the proposed method is shown to deliver excellent performance in terms of execution efficiency and load balancing.
The eleventh paper [
11] proposes a hybrid CSA (HCSA) to minimize the makespans of PFSPs. First, to make the CSA suitable for solving the PFSP, the smallest position value rule is applied to convert continuous numbers into job sequences. The HCSA then uses a Nawaz–Enscore–Ham (NEH) technique to create a population with the required levels of quality and diversity. A local search is designed to enhance the quality of the solutions and avoid premature convergence. In addition, a simulated annealing approach enhances the local search of a method based on a variable neighborhood search. The tests indicate that the performance of the proposed HCSA is significantly superior to that of other algorithms.