An Information Management Model for Addressing Residents’ Complaints through Artificial Intelligence Techniques
Abstract
:1. Introduction
2. Building Defects
3. Natural Language Processing and Machine Learning
4. Research Method
4.1. Process Mapping
4.2. Development of the Defect Classification System
4.3. Development of Word Menu
4.4. Training of Algorithmic Classifiers
- Data split into two sets: one was used to train the models and the other to test the performance of fitted models. Three proportions were used for training and testing: (i) 70% and 30%; (ii) 75% and 25%; and (iii) 80% and 20%.
- Text vectorization: the Document Feature Matrix which convert text documents into a matrix was applied. The rows are the texts and the columns are the words in the texts. “Bag of words” (BOW) and “Term frequency and inverse document frequency” (TF-IDF) are very common Document Feature Matrix models in NLP. In BOW, word frequency is used as a variable and word sorting is not considered, while TF-IDF measures the importance of a term in a text by assigning a weight [34].
- Tuning of hyperparameters: a Random Search and cross-validation with five folds was implemented to avoid overfitting and underfitting [37]. Five folds were chosen to increase the probability that all categories would be included in the folds as the database had categories with low frequency.
- Evaluation of performance: accuracy, precision, recall, and F1 score were the measures used to evaluate the performance of the models. Accuracy is the percentage of correct predictions made by the model [37]. Precision measures the rate of true positives to all positive predictions, while recall measures the rate between true positives and the sum of true positive and false negative predictions [38]. Lastly, the F1 score is a harmonic mean between precision and recall [38].
- Feature engineering: synthetic features (words) were added to the records to improve performance. These were selected according to (i) bi-terms that appeared at least ten times in the records (e.g., “bathroom suites” and “single bedroom”); (ii) most frequent words in the low-frequency categories, and (iii) words indicated by the Keyness score. The Keyness score is a word importance indicator calculated using the chi-squared test, which evaluates whether a term occurs more often in a class than in the whole database [61].
- Resampling to unbalanced classes: when there are large differences in class size, the algorithms often result in biases in favor of high-frequency classes, treating low-frequency classes as noise [62]. The complaint database contains some categories with a low occurrence frequency but of great severity. Therefore, an oversampling approach was used, randomly duplicating the data from the minority class until the data number of the majority class was reached.
4.5. Evaluation of Artifacts
5. Results
5.1. Assessment of the Existing Customer Complaint Service
- Receipt of complaints: the sector named Customer Relationship Unit received customer complaints by phone or through the company website. The complaint was recorded in a descriptive text. Then, the sector checked whether the defect type was still covered by the warranty period, and a technical inspection was scheduled. Staff from the Customer Relationship Unit did not have any technical background in construction, and errors in complaint analysis often occurred, such as sending problems to the warranty service team that were no longer covered by the period. This issue overloaded the technical staff from the Warranty Service Department who were involved in building inspections. Another problem was that records were incomplete and had inconsistent descriptions of defects, making it difficult for the technical staff to clearly understand the problem before the visit.
- Technical inspection: the initial investigation of the defect causes was carried out during a visit by a technician with general knowledge about building technologies. A paper form was used during this inspection to describe the defect type and its origin (e.g., design, construction, use phase). As this form had descriptive fields, the information collected was often incomplete. For instance, no information was collected about the lack of preventive maintenance or improper product use, which could provide relevant feedback to the quality system. During the visits, customers often took the opportunity to report other quality problems. If the causes of the defect could not be identified, a new inspection was scheduled to carry out other tests, frequently held by a specialist (e.g., electrician or plumber). Finally, the person or crew in charge of carrying out the repair service was identified and the repair service was scheduled.
- Repair service: After the repair, additional data were collected and recorded in the company’s information system, such as defect classification, team name, and number of work hours. The company classified the defects according to 58 categories of building parts, such as plumbing system and brick wall, and 477 categories of defect types. The criteria adopted in those classifications were ambiguous and not detailed enough to enable a clear understanding of the causes. Moreover, only 51.36% of the defect categories had been used to classify complaints in the database. Finally, other defects reported verbally by the customer during the inspection were usually not recorded by the company, and other data collected often required manual pre-processing and categorizing to allow systematic analysis for feedback purposes.
- Feedback process: The quality problems identified by the Warranty Service Department were reported to other sectors (e.g., design, production, material supply, etc.). This process was not based on the complaint database due to its limitations, but according to the maintenance team’s perceptions of the most frequently reported quality problems.
5.2. Proposed Model
- Receipt of complaint: the customer can choose between a set of words that best describe the existing problem. The words chosen are classified automatically by a machine learning algorithm according to a defect classification system. Thus, the system recommends to the warranty service team which defect type should be investigated during the inspection, which can lead to a reduction in service lead time. If the menu does not include the problem perceived by the customer, a complementary text field is provided to specify the complaint. These texts can be further incorporated into the word menu as new options, increasing the amount of information available in the system.
- Inspection and repair: the technical staff is in charge of confirming the classifications suggested by the system and collecting additional data on the defect during the inspection. More than one cause can be identified for a single defect. If the proposed category is considered wrong, the technician must choose another category or suggest a new one. A critical review of additional causes of defects must be conducted so that unnecessary categories are not stored in the system.
- Feedback process: the data collected must be processed and analyzed, generating quality indicators and knowledge for the company, such as indicators on the frequency of claims for different building elements, and the causes of building defects. This information can be used to support decision-making and provide feedback for future projects.
5.2.1. Defect Classification System
5.2.2. Word Menu
5.2.3. Recommendation System
6. Discussion
6.1. Solution Evaluation
- Ease of use: during the complaint receipt stage, the choice of words made by the customer was easy to understand as the taxonomy used in the menu is based on the language naturally spoken by customers. Moreover, the options were organized logically, starting from macro (location) to micro level (defect type) information. Accordingly, the ease of lodging a complaint can reduce the negative effect of a repair service on customers. From the point of view of the warranty service teams, the defect classifications are also based on a logical and organized structure that allows ease of use.
- Use the model in other contexts: the word menu and the recommendation system had a wide range of defect types and were developed based on data from 30 projects. This indicates that it is potentially applicable to other housing or building companies and to a range of building elements. Nonetheless, the content of the database depends on the design type and building technology adopted in each region. The algorithms had high classification performance levels, and the hardware’s processing capacity did not need to be high to classify quality problems effectively, according to the runtime of the algorithms.
- Improvements in data collection: the word menu improved data collection by avoiding the loss of essential data to understand the problem, i.e., data that are often forgotten or not provided by the user. The same occurs for the proposed defect classification system, which has five levels of information detail, including the cause of building defects. The set of words covers many types of complaints due to the way the database was built. However, the word menu and recommendation system need to be updated periodically as new building technologies and other types of defects emerge. Thus, new data must be used from time to time to train the ML algorithms.
- Process automation: the recommendation system automatically classifies problems and indicates the type of defect claimed, eliminating the steps of complaint analysis performed by the customer relationship service. Often, staff from that unit do not have much knowledge about building defects. Consequently, wrong data input overloads warranty service teams. Moreover, there is the possibility of eliminating some steps in warranty services. For instance, the first inspection is carried out by a building technician who often requests specialized professionals to perform tests, such as plumbers and electricians, resulting in long investigation times. This step can be eliminated as the type of defect is identified more accurately. As a result, there is a reduction of the negative impact of the repair service on customer satisfaction. These benefits can also help reduce warranty service costs.
- Contributions to the feedback process: the enhanced database can be used to identify the most important building defects, e.g., by associating problems with features of projects. In addition, different levels of the classification system can be used according to the type of assessment. For instance, metrics related to defects in the building systems level are relevant for managers to identify the most critical ones. In contrast, the analysis of “defect type” and “cause” can be directed to the operational levels in which the origin of the defect can be eliminated. Therefore, the defect classification system can be used to provide feedback and support decision-making in both product design and production management. Consequently, building quality tends to improve, which extends the service life of building components and reduces the occurrence of quality problems at handover.
6.2. Managerial Insights and Recommendations
6.3. Theoretical Contributions
7. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Plebankiewicz, E.; Malara, J. Analysis of defects in residential buildings reported during the warranty period. Appl. Sci. 2020, 10, 6123. [Google Scholar] [CrossRef]
- Love, P.E.D.; Teo, P.; Morrison, J. Revisiting quality failure costs in construction. J. Constr. Eng. Manag. 2018, 144, 05017020. [Google Scholar] [CrossRef]
- Park, J.; Seo, D. Defect Repair Cost and Home Warranty Deposit, Korea. Build 2022, 12, 1027. [Google Scholar] [CrossRef]
- Hopkin, T.; Lu, S.L.; Sexton, M.; Rogers, P. Learning from defects in the UK housing sector using action research: A case study of a housing association. Eng. Constr. Archit. Manag. 2019, 26, 1608–1624. [Google Scholar] [CrossRef]
- Carretero-Ayuso, M.J.; Rodríguez-Jiménez, C.E.; Bienvenido-Huertas, D.; Moyano, J.J. Interrelations between the types of damages and their original causes in the envelope of buildings. J. Build. Eng. 2021, 39, 102235. [Google Scholar] [CrossRef]
- Brito, J.N.D.S.; Formoso, C.T.; Echeveste, M.E.S. Analysis of complaint data in social house-building projects: A study in the Residential Leasing Program. Amb. Constr. 2011, 11, 151–166. [Google Scholar] [CrossRef]
- Asante, L.A.; Quansah, D.P.O.; Ayitey, J.; Kuusaana, E.D. The practice of Defect Liability Period in the real estate industry in Ghana. SAGE Open 2017, 7, 2158244017727038. [Google Scholar] [CrossRef] [Green Version]
- Hopkin, T.; Lu, S.; Rogers, P.; Sexton, M. Key stakeholders’ perspectives towards UK new-build housing defects. Int. J. Build. Pathol. 2017, 35, 110–123. [Google Scholar] [CrossRef]
- Levi, U. Contractual Defects and Statutory Defect Liability in Queensland. 2016. Available online: http://www.findlaw.com.au/arti-cles/5238/contractual-defects-and-statutory-defect-liability.aspx (accessed on 30 April 2020).
- Brazil Consumer Protection Code. Law nº 8.078. 11 September 1990; It Provides Information on Consumer Protection and Makes Other Provisions. Available online: https://www.planalto.gov.br/ccivil_03/leis/l8078compilado.htm (accessed on 12 September 2019).
- ABNT (Brazilian Association of Technical Standards). NBR 15575-1: Residential Buildings up to Five Floors—Performance Part 1: General Requirements; ABNT: Rio de Janeiro, Brazil, 2013. [Google Scholar]
- Assaf, S.; Srour, I. Using a data driven neural network approach to forecast building occupant complaints. Build. Environ. 2021, 200, 107972. [Google Scholar] [CrossRef]
- Chen, W.; Chen, K.; Cheng, J.C.P.; Wang, Q.; Gan, V.J.L. BIM-based framework for automatic scheduling of facility maintenance work orders. Automat. Constr. 2018, 91, 15–30. [Google Scholar] [CrossRef]
- Bortolini, R.; Forcada, N. Analysis of building maintenance requests using a text mining approach: Building services evaluation. Build. Res. Inf. 2019, 48, 207–217. [Google Scholar] [CrossRef]
- Cupertino, D.; Brandstetter, M.C.G.O. Proposal for a post-work management tool based on the records of technical assistance requests. Amb. Constr. 2015, 15, 243–265. [Google Scholar] [CrossRef] [Green Version]
- Peng, Y.; Lin, J.R.; Zhang, J.P.; Hu, Z.Z. A hybrid data mining approach on BIM-based building operation and maintenance. Build. Environ. 2017, 126, 483–495. [Google Scholar] [CrossRef]
- Milion, R.N.; Alves, T.D.C.; Paliari, J.C. Impacts of residential construction defects on customer satisfaction. Int. J. Build. Pathol. 2017, 35, 218–232. [Google Scholar] [CrossRef]
- Ahmed, V.; Aziz, Z.; Tezel, A.; Riaz, Z. Challenges and drivers for data mining in the AEC sector. Eng. Constr. Archit. Manag. 2018, 25, 1436–1453. [Google Scholar] [CrossRef]
- Petrova, E.; Pauwels, P.; Svidt, K.; Jensen, R.L. Towards data-driven sustainable design: Decision support based on knowledge discovery in disparate building data. Archit. Eng. Des. Manag. 2019, 15, 334–356. [Google Scholar] [CrossRef] [Green Version]
- Liddy, E.D. Natural Language Processing for Information Retrieval. In Encyclopedia of Library and Information Sciences, 4th ed.; Taylor & Francis: New York, NY, USA, 2018. [Google Scholar]
- Mcarthur, J.J.; Shahbazy, R.F.; Raghubar, C.; Bortoluzzi, B. Machine learning and BIM visualisation for maintenance issue classification and enhanced data collection. Adv. Eng. Inf. 2018, 38, 101–112. [Google Scholar] [CrossRef]
- Gunay, H.B.; Shen, W.; Yang, C. Text-mining building maintenance work orders for component fault frequency. Build. Res. Inf. 2019, 47, 518–533. [Google Scholar] [CrossRef]
- Mo, Y.; Zhao, D.; Du, J.; Syal, M.; Aziz, A.; Li, H. Automated staff assignment for building maintenance using natural language processing. Automat. Constr. 2020, 113, 103150. [Google Scholar] [CrossRef]
- ISO (International Organization for Standardization). ISO 9000: Quality Management Systems—Fundamentals and Vocabulary; ISO: Geneva, Switzerland, 2005. [Google Scholar]
- Atkinson, G. A century of defects. Building 1987, 252, 54–55. [Google Scholar]
- Marinho, J.L.A. Building Pathology: Occurrences in Buildings and Historic Heritage, 2nd ed.; Leud: Guarulhos, Brazil, 2022; p. 296. [Google Scholar]
- Forcada, N.; Macarulla, M.; Gangolells, M.; Casals, M. Handover defects: Comparison of construction and post-handover housing defects. Build. Res. Inf. 2016, 44, 279–288. [Google Scholar] [CrossRef] [Green Version]
- Connor, J.T.O.; Koo, H.J. Proactive Design Quality Assessment Tool for Building Projects. J. Constr. Eng. Manag. 2021, 147, 04020174. [Google Scholar] [CrossRef]
- Gonzalez-Caceres, A.; Bobadilla, A.; Karlshøj, J. Implementing post-occupancy evaluation in social housing complemented with BIM: A case study in Chile. Build Environ. 2019, 158, 260–280. [Google Scholar] [CrossRef]
- Schultz, C.S.; Jørgensen, K.; Bonke, S.; Rasmussen, G.M.G. Building defects in Danish construction: Project characteristics influencing the occurrence of defects at handover. Archit. Eng. Des. Manag. 2015, 11, 423–439. [Google Scholar] [CrossRef]
- Josephson, P.-E.; Hammarlund, Y. The causes and costs of defects in construction. Automat. Constr. 1999, 8, 681–687. [Google Scholar] [CrossRef]
- Grussing, M.N.; Liu, L.Y. Knowledge-based optimization of building maintenance, repair, and renovation activities to improve facility life cycle investments. J. Perform. Constr. Facil. 2014, 28, 539–548. [Google Scholar] [CrossRef]
- Aljassmi, H.; Han, S.; Davis, S. Analysis of the Complex Mechanisms of Defect Generation in Construction Projects. J. Constr. Eng. Manag. 2015, 138, 51–60. [Google Scholar] [CrossRef]
- Silge, J.; Robinson, D. Text Mining with R: A Tidy Approach; O’Reilly Media: Sebastopol, CA, USA, 2017. [Google Scholar]
- Lane, H.; Howard, C.; Hapke, H.M. Natural Language Processing in Action Understanding, Analyzing, and Generating Text with Python; Manning: New York, NY, USA, 2019. [Google Scholar]
- Fan, C.L. Defect risk assessment using a hybrid machine learning method. J. Constr. Eng. Manag. 2020, 146, 04020102. [Google Scholar] [CrossRef]
- Géron, A. Hands-on Machine Learning with Scikit-Learn, Keras, and Tensor Flow: Concepts, Tools, and Techniques to Build Intelligent Systems; O’Reilly Media: Sebastopol, CA, USA, 2019. [Google Scholar]
- James, G.; Witten, D.; Hastie, T.; Tibshirani, R. An Introduction to Statistical Learning: With Applications in R, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
- Zhou, Z. Machine Learning; Springer Nature: Berlin/Heidelberg, Germany, 2021. [Google Scholar]
- Breiman, L. Random Forests. In Machine Learning; Springer: Berlin/Heidelberg, Germany, 2001. [Google Scholar] [CrossRef]
- Friedman, J.H. Stochastic gradient boosting. Comput. Stat. Data 2002, 38, 367–378. [Google Scholar] [CrossRef]
- Bengio, Y.; Ducharme, R.; Vincent, P.; Jauvin, C. A neural probabilistic language model. J. Mach. Learn. Res. 2003, 3, 1137–1155. [Google Scholar]
- Sadiq, S.; Umer, M.; Ullah, S.; Mirjalili, S.; Rupapara, V.; Nappi, M. Discrepancy detection between actual user reviews and numeric ratings of Google App store using deep learning. Expert Syst. Appl. 2021, 181, 115111. [Google Scholar] [CrossRef]
- Tian, Z.; Wang, Y.; Song, Y.; Lee, D.; Zhao, Y.; Li, D.; Zhang, N.L. Empathetic and Emotionally Positive Conversation Systems with an Emotion-specific Query-Response Memory. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Abu Dhabi, United Arab Emirates, 7–11 December 2022; Association for Computational Linguistics: Stroudsburg, PA, USA, 2022; pp. 6364–6376. [Google Scholar]
- Salama, D.M.; El-Gohary, N.M. Semantic Text Classification for Supporting Automated Compliance Checking in Construction. J. Comput. Civ. Eng. 2016, 30, 04014106. [Google Scholar] [CrossRef]
- Moon, S.; Lee, G.; Chi, S.; Oh, H. Automated Construction Specification Review with Named Entity Recognition Using Natural Language Processing. J. Constr. Eng. Manag. 2021, 147, 04020147. [Google Scholar] [CrossRef]
- Zhang, F.; Fleyeh, H.; Wang, X.; Lu, M. Construction site accident analysis using text mining and natural language processing techniques. Autom. Constr. 2019, 99, 238–248. [Google Scholar] [CrossRef]
- Jordan, M.I.; Mitchell, T.M. Machine learning: Trends, perspectives, and prospects. J. Sci. 2015, 349, 255–260. [Google Scholar] [CrossRef]
- Adam, M.; Wessel, M.; Benlian, A. AI-based chatbots in customer service and their effects on user compliance. Electron. Mark. 2021, 31, 427–445. [Google Scholar] [CrossRef] [Green Version]
- Schmidt, A. Interactive Human Centered Artificial Intelligence: A Definition and Research Challenges. In Proceedings of the ACM International Conference Proceeding Series, Salerno, Italy, 28 September 2020. [Google Scholar] [CrossRef]
- Peffers, K.; Tuunanen, T.; Rothenberger, M.A.; Chatterjee, S. A design science research methodology for information systems research. J. Manag. Inf. Syst. 2007, 24, 45–77. [Google Scholar] [CrossRef]
- Van Aken, J.; Chandrasekaran, A.; Halman, J. Conducting and publishing design science research: Inaugural essay of the design science department of the Journal of Operations Management. J. Oper. Manag. 2016, 47, 1–8. [Google Scholar] [CrossRef]
- March, S.T.; Smith, G.F. Design and natural science research on information technology. Decis. Support Syst. 1995, 5, 251–266. [Google Scholar] [CrossRef]
- Kasanen, E.; Lukka, K.; Siitonen, A. The constructive approach in management accounting research. J. Manag. Account. Res. 1993, 5, 243–264. [Google Scholar] [CrossRef]
- ISO (International Organization for Standardization). ISO 19208: Framework for Specifying Performance in Buildings; ISO: Geneva, Switzerland, 2016. [Google Scholar]
- Berr, L.R. Evaluation Method of Construction Quality in Social Housing Units in the Use Step—Technical Analysis and Perception by Users. Ph.D. Thesis, Federal University of Rio Grande do Sul, Porto Alegre, Brazil, 2016. [Google Scholar]
- R Core Team. R: A Language and Environment for Statistical Computing [R Software]; R Foundation for Statistical Computing; R Core Team: Vienna, Austria, 2021. [Google Scholar]
- Bouchet-Valat, M.; SnowballC: Snowball Stemmers Based on the C ‘libstemmer’ UTF-8 Library. R Package Version 0.7.0. 2020. Available online: https://CRAN.R-project.org/package=SnowballC (accessed on 2 March 2021).
- Benoit, K.; Watanabe, K.; Wang, H.; Nulty, P.; Obeng, A.; Müller, S.; Matsuo, A. Quanteda: An R package for the quantitative analysis of textual data. J. Open Source Softw. 2018, 3, 774. [Google Scholar] [CrossRef] [Green Version]
- Kuhn, M.; Caret: Classification and Regression Training. R package version 6.0-86. 2020. Available online: https://CRAN.R-project.org/package=caret (accessed on 18 January 2021).
- Gabrielatos, C. Keyness analysis: Nature, metrics and techniques. In Corpus Approaches to Discourse: A Critical Review; Routledge: Oxford, UK, 2018. [Google Scholar]
- Kaur, P.; Gosain, A. Comparing the Behaviour of Oversampling and Undersampling Approach of Class Imbalance Learning by Combining Class Imbalance Problem with Noise; Springer: Berlin/Heidelberg, Germany, 2018. [Google Scholar]
- Bortolini, R.; Forcada, N. Building Inspection System for Evaluating the Technical Performance of Existing Buildings. J. Perform. Constr. Facil. 2019, 32, 1–14. [Google Scholar] [CrossRef]
Company Position | Educational Background | Experience in the Construction Industry (Years) | Experience in the Warranty Service Department (Years) |
---|---|---|---|
Technician | Building Technology | 6 | 4 |
Head of Department | Civil Engineering | 4 | 2 |
Technician | Building Technology | 8 | 1 |
Technician | Building Technology | 6 | 6 |
Maintenance Manager | Civil Engineering | 17 | 2 |
Constructs | Criteria | Guiding Questions |
---|---|---|
Applicability | Ease of use | Is the model clear and easy to understand by users? Which skill level do users need to operate the model? |
Possibility of using the model in other contexts | How can different companies use the model? | |
Utility | Improvements in data collection | To what extent does the model increase the reliability and completeness of the data? |
Process automation | How much effort is necessary to collect and process complaint data? | |
Contributions to quality management | Which type of information can be generated to provide feedback? |
Model | Runtime in the Training | Accuracy |
---|---|---|
Naïve Bayes | 0.04 s | 77.71% |
SVM | 5.24 m | 77.95% |
RF | 47.32 m | 81.68% |
Gradient Boosting | 4.59 h | 82.89% |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Bazzan, J.; Echeveste, M.E.; Formoso, C.T.; Altenbernd, B.; Barbian, M.H. An Information Management Model for Addressing Residents’ Complaints through Artificial Intelligence Techniques. Buildings 2023, 13, 737. https://doi.org/10.3390/buildings13030737
Bazzan J, Echeveste ME, Formoso CT, Altenbernd B, Barbian MH. An Information Management Model for Addressing Residents’ Complaints through Artificial Intelligence Techniques. Buildings. 2023; 13(3):737. https://doi.org/10.3390/buildings13030737
Chicago/Turabian StyleBazzan, Jordana, Márcia Elisa Echeveste, Carlos Torres Formoso, Bernardo Altenbernd, and Márcia Helena Barbian. 2023. "An Information Management Model for Addressing Residents’ Complaints through Artificial Intelligence Techniques" Buildings 13, no. 3: 737. https://doi.org/10.3390/buildings13030737
APA StyleBazzan, J., Echeveste, M. E., Formoso, C. T., Altenbernd, B., & Barbian, M. H. (2023). An Information Management Model for Addressing Residents’ Complaints through Artificial Intelligence Techniques. Buildings, 13(3), 737. https://doi.org/10.3390/buildings13030737