Next Issue
Volume 8, August
Previous Issue
Volume 8, June
 
 

Big Data Cogn. Comput., Volume 8, Issue 7 (July 2024) – 10 articles

Cover Story (view full-size image): Interest and engagement with trustworthy AI in healthcare are rising, as shown by its growing use in diagnostics and patient management. However, translating theoretical frameworks into actual practices remains limited, evidenced by a few dedicated papers. The present study reports the first scoping review on the topic that is specific to decision-making systems in the biomedical domain and attempts to consolidate existing practices as they appear in the academic literature on the subject. The main findings show how the implementation of trustworthy AI principles is inconsistent, particularly in explainability, technical robustness, safety, privacy, and human oversight. This highlights the need for a more comprehensive and integrated approach to trustworthy AI in healthcare. View this paper
  • Issues are regarded as officially published after their release is announced to the table of contents alert mailing list.
  • You may sign up for e-mail alerts to receive table of contents of newly released issues.
  • PDF is the official format for papers published in both, html and pdf forms. To view the papers in pdf format, click on the "PDF Full-text" link, and use the free Adobe Reader to open them.
Order results
Result details
Select all
Export citation of selected articles as:
15 pages, 1955 KiB  
Article
Breast Cancer Detection and Localizing the Mass Area Using Deep Learning
by Md. Mijanur Rahman, Md. Zihad Bin Jahangir, Anisur Rahman, Moni Akter, MD Abdullah Al Nasim, Kishor Datta Gupta and Roy George
Big Data Cogn. Comput. 2024, 8(7), 80; https://doi.org/10.3390/bdcc8070080 - 16 Jul 2024
Cited by 1 | Viewed by 2946
Abstract
Breast cancer presents a substantial health obstacle since it is the most widespread invasive cancer and the second most common cause of death in women. Prompt identification is essential for effective intervention, rendering breast cancer screening a critical component of healthcare. Although mammography [...] Read more.
Breast cancer presents a substantial health obstacle since it is the most widespread invasive cancer and the second most common cause of death in women. Prompt identification is essential for effective intervention, rendering breast cancer screening a critical component of healthcare. Although mammography is frequently employed for screening purposes, the manual diagnosis performed by pathologists can be laborious and susceptible to mistakes. Regrettably, the majority of research prioritizes mass classification over mass localization, resulting in an uneven distribution of attention. In response to this problem, we suggest a groundbreaking approach that seeks to identify and pinpoint cancers in breast mammography pictures. This will allow medical experts to identify tumors more quickly and with greater precision. This paper presents a complex deep convolutional neural network design that incorporates advanced deep learning techniques such as U-Net and YOLO. The objective is to enable automatic detection and localization of breast lesions in mammography pictures. To assess the effectiveness of our model, we carried out a thorough review that included a range of performance criteria. We specifically evaluated the accuracy, precision, recall, F1-score, ROC curve, and R-squared error using the publicly available MIAS dataset. Our model performed exceptionally well, with an accuracy rate of 93.0% and an AUC (area under the curve) of 98.6% for the detection job. Moreover, for the localization task, our model achieved a remarkably high R-squared value of 97%. These findings highlight that deep learning can boost the efficiency and accuracy of diagnosing breast cancer. The automation of breast lesion detection and classification offered by our proposed method bears substantial benefits. By alleviating the workload burden on pathologists, it facilitates expedited and accurate breast cancer screening processes. As a result, the proposed approach holds promise for improving healthcare outcomes and bolstering the overall effectiveness of breast cancer detection and diagnosis. Full article
Show Figures

Figure 1

24 pages, 603 KiB  
Article
Trends and Challenges towards Effective Data-Driven Decision Making in UK Small and Medium-Sized Enterprises: Case Studies and Lessons Learnt from the Analysis of 85 Small and Medium-Sized Enterprises
by Abdel-Rahman H. Tawil, Muhidin Mohamed, Xavier Schmoor, Konstantinos Vlachos and Diana Haidar
Big Data Cogn. Comput. 2024, 8(7), 79; https://doi.org/10.3390/bdcc8070079 - 12 Jul 2024
Viewed by 2782
Abstract
The adoption of data science brings vast benefits to Small and Medium-sized Enterprises (SMEs) including business productivity, economic growth, innovation and job creation. Data science can support SMEs to optimise production processes, anticipate customers’ needs, predict machinery failures and deliver efficient smart services. [...] Read more.
The adoption of data science brings vast benefits to Small and Medium-sized Enterprises (SMEs) including business productivity, economic growth, innovation and job creation. Data science can support SMEs to optimise production processes, anticipate customers’ needs, predict machinery failures and deliver efficient smart services. Businesses can also harness the power of artificial intelligence (AI) and big data, and the smart use of digital technologies to enhance productivity and performance, paving the way for innovation. However, integrating data science decisions into an SME requires both skills and IT investments. In most cases, such expenses are beyond the means of SMEs due to their limited resources and restricted access to financing. This paper presents trends and challenges towards effective data-driven decision making for organisations based on a 3-year long study which covered more than 85 UK SMEs, mostly from the West Midlands region of England. In particular, this study attempts to find answers to several key research questions around data science and AI adoption among UK SMEs, and the advantages of digitalisation and data-driven decision making, as well as the challenges hindering their effective utilisation of these technologies. We also present two case studies that demonstrate the potential of digitisation and data science, and use these as examples to unveil challenges and showcase the wealth of currently available opportunities for SMEs. Full article
(This article belongs to the Special Issue Applied Data Science for Social Good)
Show Figures

Figure 1

22 pages, 1736 KiB  
Article
Demystifying Mental Health by Decoding Facial Action Unit Sequences
by Deepika Sharma, Jaiteg Singh, Sukhjit Singh Sehra and Sumeet Kaur Sehra
Big Data Cogn. Comput. 2024, 8(7), 78; https://doi.org/10.3390/bdcc8070078 - 9 Jul 2024
Viewed by 1404
Abstract
Mental health is indispensable for effective daily functioning and stress management. Facial expressions may provide vital clues about the mental state of a person as they are universally consistent across cultures. This study intends to detect the emotional variances through facial micro-expressions using [...] Read more.
Mental health is indispensable for effective daily functioning and stress management. Facial expressions may provide vital clues about the mental state of a person as they are universally consistent across cultures. This study intends to detect the emotional variances through facial micro-expressions using facial action units (AUs) to identify probable mental health issues. In addition, convolutional neural networks (CNN) were used to detect and classify the micro-expressions. Further, combinations of AUs were identified for the segmentation of micro-expressions classes using K-means square. Two benchmarked datasets CASME II and SAMM were employed for the training and evaluation of the model. The model achieved an accuracy of 95.62% on CASME II and 93.21% on the SAMM dataset, respectively. Subsequently, a case analysis was done to identify depressive patients using the proposed framework and it attained an accuracy of 92.99%. This experiment revealed the fact that emotions like disgust, sadness, anger, and surprise are the prominent emotions experienced by depressive patients during communication. The findings suggest that leveraging facial action units for micro-expression detection offers a promising approach to mental health diagnostics. Full article
Show Figures

Figure 1

27 pages, 3727 KiB  
Article
AMIKOMNET: Novel Structure for a Deep Learning Model to Enhance COVID-19 Classification Task Performance
by Muh Hanafi
Big Data Cogn. Comput. 2024, 8(7), 77; https://doi.org/10.3390/bdcc8070077 - 9 Jul 2024
Cited by 1 | Viewed by 1196
Abstract
Since early 2020, coronavirus has spread extensively throughout the globe. It was first detected in Wuhan, a province in China. Many researchers have proposed various models to solve problems related to COVID-19 detection. As traditional medical approaches take a lot of time to [...] Read more.
Since early 2020, coronavirus has spread extensively throughout the globe. It was first detected in Wuhan, a province in China. Many researchers have proposed various models to solve problems related to COVID-19 detection. As traditional medical approaches take a lot of time to detect the virus and require specific laboratory tests, the adoption of artificial intelligence (AI), including machine learning, might play an important role in handling the problem. A great deal of research has seen the adoption of AI succeed in the early detection of COVID-19 using X-ray images. Unfortunately, the majority of deep learning adoption for COVID-19 detection has the shortcomings of high error detection and high computation costs. In this study, we employed a hybrid model using an auto-encoder (AE) and a convolutional neural network (CNN) (named AMIKOMNET) with a small number of layers and parameters. We implemented an ensemble learning mechanism in the AMIKOMNET model using Adaboost with the aim of reducing error detection in COVID-19 classification tasks. The experimental results for the binary class show that our model achieved high effectiveness, with 96.90% accuracy, 95.06% recall, 94.67% F1-score, and 96.03% precision. The experimental result for the multiclass achieved 95.13% accuracy, 94.93% recall, 95.75% F1-score, and 96.19% precision. The adoption of Adaboost in AMIKOMNET for the binary class increased the effectiveness of the model to 98.45% accuracy, 96.16% recall, 95.70% F1-score, and 96.87% precision. The adoption of Adaboost in AMIKOMNET in the multiclass classification task also saw an increase in performance, with an accuracy of 96.65%, a recall of 94.93%, an F1-score of 95.76%, and a precision of 96.19%. The implementation of AE to handle image feature extraction combined with a CNN used to handle dimensional image feature reduction achieved outstanding performance when compared to previous work using a deep learning platform. Exploiting Adaboost also increased the effectiveness of the AMIKOMNET model in detecting COVID-19. Full article
(This article belongs to the Special Issue Big Data System for Global Health)
Show Figures

Figure 1

13 pages, 2451 KiB  
Systematic Review
The State of the Art of Artificial Intelligence Applications in Eosinophilic Esophagitis: A Systematic Review
by Martina Votto, Carlo Maria Rossi, Silvia Maria Elena Caimmi, Maria De Filippo, Antonio Di Sabatino, Marco Vincenzo Lenti, Alessandro Raffaele, Gian Luigi Marseglia and Amelia Licari
Big Data Cogn. Comput. 2024, 8(7), 76; https://doi.org/10.3390/bdcc8070076 - 9 Jul 2024
Viewed by 1406
Abstract
Introduction: Artificial intelligence (AI) tools are increasingly being integrated into computer-aided diagnosis systems that can be applied to improve the recognition and clinical and molecular characterization of allergic diseases, including eosinophilic esophagitis (EoE). This review aims to systematically evaluate current applications of AI, [...] Read more.
Introduction: Artificial intelligence (AI) tools are increasingly being integrated into computer-aided diagnosis systems that can be applied to improve the recognition and clinical and molecular characterization of allergic diseases, including eosinophilic esophagitis (EoE). This review aims to systematically evaluate current applications of AI, machine learning (ML), and deep learning (DL) methods in EoE characterization and management. Methods: We conducted a systematic review using a registered protocol published in the International Prospective Register of Systematic Reviews (CRD42023451048). The risk of bias and applicability of eligible studies were assessed according to the prediction model study risk of bias assessment tool (PROBAST). We searched PubMed, Embase, and Web of Science to retrieve the articles. The literature review was performed in May 2023. We included original research articles (retrospective or prospective studies) published in English in peer-reviewed journals, studies whose participants were patients with EoE, and studies assessing the application of AI, ML, or DL models. Results: A total of 120 articles were found. After removing 68 duplicates, 52 articles were reviewed based on the title and abstract, and 34 were excluded. Eleven full texts were assessed for eligibility, met the inclusion criteria, and were analyzed for the systematic review. The AI models developed in three studies for identifying EoE based on endoscopic images showed high score performance with an accuracy that ranged from 0.92 to 0.97. Five studies developed AI models that histologically identified EoE with high accuracy (87% to 99%). We also found two studies where the AI model identified subgroups of patients according to their clinical and molecular features. Conclusions: AI technologies could promote more accurate evidence-based management of EoE by integrating the results of molecular signature, clinical, histology, and endoscopic features. However, the era of AI application in medicine is just beginning; therefore, further studies with model validation in the real-world environment are required. Full article
(This article belongs to the Special Issue Machine Learning Applications and Big Data Challenges)
Show Figures

Figure 1

21 pages, 2914 KiB  
Article
Multimodal Quanvolutional and Convolutional Neural Networks for Multi-Class Image Classification
by Yuri Gordienko, Yevhenii Trochun and Sergii Stirenko
Big Data Cogn. Comput. 2024, 8(7), 75; https://doi.org/10.3390/bdcc8070075 - 8 Jul 2024
Viewed by 1326
Abstract
By utilizing hybrid quantum–classical neural networks (HNNs), this research aims to enhance the efficiency of image classification tasks. HNNs allow us to utilize quantum computing to solve machine learning problems, which can be highly power-efficient and provide significant computation speedup compared to classical [...] Read more.
By utilizing hybrid quantum–classical neural networks (HNNs), this research aims to enhance the efficiency of image classification tasks. HNNs allow us to utilize quantum computing to solve machine learning problems, which can be highly power-efficient and provide significant computation speedup compared to classical operations. This is particularly relevant in sustainable applications where reducing computational resources and energy consumption is crucial. This study explores the feasibility of a novel architecture by leveraging quantum devices as the first layer of the neural network, which proved to be useful for scaling HNNs’ training process. Understanding the role of quanvolutional operations and how they interact with classical neural networks can lead to optimized model architectures that are more efficient and effective for image classification tasks. This research investigates the performance of HNNs across different datasets, including CIFAR100 and Satellite Images of Hurricane Damage by evaluating the performance of HNNs on these datasets in comparison with the performance of reference classical models. By evaluating the scalability of HNNs on diverse datasets, the study provides insights into their applicability across various real-world scenarios, which is essential for building sustainable machine learning solutions that can adapt to different environments. Leveraging transfer learning techniques with pre-trained models such as ResNet, EfficientNet, and VGG16 demonstrates the potential for HNNs to benefit from existing knowledge in classical neural networks. This approach can significantly reduce the computational cost of training HNNs from scratch while still achieving competitive performance. The feasibility study conducted in this research assesses the practicality and viability of deploying HNNs for real-world image classification tasks. By comparing the performance of HNNs with classical reference models like ResNet, EfficientNet, and VGG-16, this study provides evidence of the potential advantages of HNNs in certain scenarios. Overall, the findings of this research contribute to advancing sustainable applications of machine learning by proposing novel techniques, optimizing model architectures, and demonstrating the feasibility of adopting HNNs for real-world image classification problems. These insights can inform the development of more efficient and environmentally friendly machine learning solutions. Full article
(This article belongs to the Special Issue Machine Learning and AI Technology for Sustainable Development)
Show Figures

Figure 1

21 pages, 6223 KiB  
Article
Generative Artificial Intelligence: Analyzing Its Future Applications in Additive Manufacturing
by Erik Westphal and Hermann Seitz
Big Data Cogn. Comput. 2024, 8(7), 74; https://doi.org/10.3390/bdcc8070074 - 6 Jul 2024
Viewed by 3081
Abstract
New developments in the field of artificial intelligence (AI) are increasingly finding their way into industrial areas such as additive manufacturing (AM). Generative AI (GAI) applications in particular offer interesting possibilities here, for example, to generate texts, images or computer codes with the [...] Read more.
New developments in the field of artificial intelligence (AI) are increasingly finding their way into industrial areas such as additive manufacturing (AM). Generative AI (GAI) applications in particular offer interesting possibilities here, for example, to generate texts, images or computer codes with the help of algorithms and to integrate these as useful supports in various AM processes. This paper examines the opportunities that GAI offers specifically for additive manufacturing. There are currently relatively few publications that deal with the topic of GAI in AM. Much of the information has only been published in preprints. There, the focus has been on algorithms for Natural Language Processing (NLP), Large Language Models (LLMs) and generative adversarial networks (GANs). This summarised presentation of the state of the art of GAI in AM is new and the link to specific use cases is this first comprehensive case study on GAI in AM processes. Building on this, three specific use cases are then developed in which generative AI tools are used to optimise AM processes. Finally, a Strengths, Weaknesses, Opportunities and Threats (SWOT) analysis is carried out on the general possibilities of GAI, which forms the basis for an in-depth discussion on the sensible use of GAI tools in AM. The key findings of this work are that GAI can be integrated into AM processes as a useful support, making these processes faster and more creative, as well as to make the process information digitally recordable and usable. This current and future potential, as well as the technical implementation of GAI into AM, is also presented and explained visually. It is also shown where the use of generative AI tools can be useful and where current or future potential risks may arise. Full article
Show Figures

Figure 1

20 pages, 274 KiB  
Review
Trustworthy AI Guidelines in Biomedical Decision-Making Applications: A Scoping Review
by Marçal Mora-Cantallops, Elena García-Barriocanal and Miguel-Ángel Sicilia
Big Data Cogn. Comput. 2024, 8(7), 73; https://doi.org/10.3390/bdcc8070073 - 1 Jul 2024
Viewed by 1388
Abstract
Recently proposed legal frameworks for Artificial Intelligence (AI) depart from some frameworks of concepts regarding ethical and trustworthy AI that provide the technical grounding for safety and risk. This is especially important in high-risk applications, such as those involved in decision-making support systems [...] Read more.
Recently proposed legal frameworks for Artificial Intelligence (AI) depart from some frameworks of concepts regarding ethical and trustworthy AI that provide the technical grounding for safety and risk. This is especially important in high-risk applications, such as those involved in decision-making support systems in the biomedical domain. Frameworks for trustworthy AI span diverse requirements, including human agency and oversight, technical robustness and safety, privacy and data governance, transparency, fairness, and societal and environmental impact. Researchers and practitioners who aim to transition experimental AI models and software to the market as medical devices or to use them in actual medical practice face the challenge of deploying processes, best practices, and controls that are conducive to complying with trustworthy AI requirements. While checklists and general guidelines have been proposed for that aim, a gap exists between the frameworks and the actual practices. This paper reports the first scoping review on the topic that is specific to decision-making systems in the biomedical domain and attempts to consolidate existing practices as they appear in the academic literature on the subject. Full article
Show Figures

Figure 1

24 pages, 7762 KiB  
Article
Semantic Non-Negative Matrix Factorization for Term Extraction
by Aliya Nugumanova, Almas Alzhanov, Aiganym Mansurova, Kamilla Rakhymbek and Yerzhan Baiburin
Big Data Cogn. Comput. 2024, 8(7), 72; https://doi.org/10.3390/bdcc8070072 - 27 Jun 2024
Viewed by 1045
Abstract
This study introduces an unsupervised term extraction approach that combines non-negative matrix factorization (NMF) with word embeddings. Inspired by a pioneering semantic NMF method that employs regularization to jointly optimize document–word and word–word matrix factorizations for document clustering, we adapt this strategy for [...] Read more.
This study introduces an unsupervised term extraction approach that combines non-negative matrix factorization (NMF) with word embeddings. Inspired by a pioneering semantic NMF method that employs regularization to jointly optimize document–word and word–word matrix factorizations for document clustering, we adapt this strategy for term extraction. Typically, a word–word matrix representing semantic relationships between words is constructed using cosine similarities between word embeddings. However, it has been established that transformer encoder embeddings tend to reside within a narrow cone, leading to consistently high cosine similarities between words. To address this issue, we replace the conventional word–word matrix with a word–seed submatrix, restricting columns to ‘domain seeds’—specific words that encapsulate the essential semantic features of the domain. Therefore, we propose a modified NMF framework that jointly factorizes the document–word and word–seed matrices, producing more precise encoding vectors for words, which we utilize to extract high-relevancy topic-related terms. Our modification significantly improves term extraction effectiveness, marking the first implementation of semantically enhanced NMF, designed specifically for the task of term extraction. Comparative experiments demonstrate that our method outperforms both traditional NMF and advanced transformer-based methods such as KeyBERT and BERTopic. To support further research and application, we compile and manually annotate two new datasets, each containing 1000 sentences, from the ‘Geography and History’ and ‘National Heroes’ domains. These datasets are useful for both term extraction and document classification tasks. All related code and datasets are freely available. Full article
Show Figures

Figure 1

18 pages, 414 KiB  
Article
ReJOOSp: Reinforcement Learning for Join Order Optimization in SPARQL
by Benjamin Warnke, Kevin Martens, Tobias Winker, Sven Groppe, Jinghua Groppe, Prasad Adhiyaman, Sruthi Srinivasan and Shridevi Krishnakumar
Big Data Cogn. Comput. 2024, 8(7), 71; https://doi.org/10.3390/bdcc8070071 - 27 Jun 2024
Viewed by 1256
Abstract
The choice of a good join order plays an important role in the query performance of databases. However, determining the best join order is known to be an NP-hard problem with exponential growth with the number of joins. Because of this, nonlearning approaches [...] Read more.
The choice of a good join order plays an important role in the query performance of databases. However, determining the best join order is known to be an NP-hard problem with exponential growth with the number of joins. Because of this, nonlearning approaches to join order optimization have a longer optimization and execution time. In comparison, the models of machine learning, once trained, can construct optimized query plans very quickly. Several efforts have applied machine learning to optimize join order for SQL queries outperforming traditional approaches. In this work, we suggest a reinforcement learning technique for join optimization for SPARQL queries, ReJOOSp. SPARQL queries typically contain a much higher number of joins than SQL queries and so are more difficult to optimize. To evaluate ReJOOSp, we further develop a join order optimizer based on ReJOOSp and integrate it into the Semantic Web DBMS Luposdate3000. The evaluation of ReJOOSp shows its capability to significantly enhance query performance by achieving high-quality execution plans for a substantial portion of queries across synthetic and real-world datasets. Full article
Show Figures

Figure 1

Previous Issue
Next Issue
Back to TopTop