Usability Testing of a Social Media Chatbot for Increasing Physical Activity Behavior
Abstract
:1. Introduction
2. Materials and Methods
2.1. System Development and Requirement Analysis
2.2. Behavior Change Techniques and Functionalities
2.3. Preliminary Study on MYA’s Usability
2.4. Ethics
3. Results
3.1. Participant Characteristics
3.2. Average Ranking of Chatbot Usability Questionnaire
3.3. Usability Study Results (According to CUQ Calculator)
3.4. Additional Feedback from Study Participants
- Identified issues: “The only thing was that the app got stuck at times, and it wasn’t clear how to proceed or if this behavior was intended”
- Preferred chatbot features: “The random challenge is my favorite feature because it really distinguishes this bot from fitness trackers, and motivates me to do some activity”
- Suggestions for improvements: “A weekly activity challenge would be interesting, like a schedule with the desired level”, and “More inputs so that it can talk about everyday subjects like weather and answer some questions”.
4. Discussion
4.1. Social Media Chatbot Features
4.2. Can a Social Media Chatbot Help Increase Physical Activity?
4.3. Study Limitations
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Appendix A
1. What is your gender? ☐ Female ☐ Male ☐ Diverse ☐ Prefer not to say | |||||
2. What is your age group? ☐ 18–29 years ☐ 30–49 years ☐ 50 – 69 years ☐ ≥70 years | |||||
3. How often do you do physical activity? ☐ Less than 30 min per day ☐ More than 30 min per day ☐ Less than 30 min per week ☐ More than 30 min per week ☐ Monthly ☐ Never ☐ Other: 4. How did you interact with MYA? ☐ Telegram desktop app ☐ Telegram mobile app (Android) ☐ Telegram mobile app (iPhone) ☐ Other: | |||||
5. Do you think MYA could help you in increasing your physical activity/change your current activity behavior? ☐ Yes ☐ No ☐ Maybe ☐ Other: 6. For approximately how long did you interact with MYA? ☐ Less than 5 min ☐ 5–15 min ☐ 15–30 min ☐ 30–60 min ☐ More than 60 min ☐ Other: 7. Chatbot Usability Questionnaire (Holmes et al. 2019) | |||||
1 -Strongly Disagree | 2 -Disagree | 3 -Neutral | 4 -Agree | 5 -Strongly Agree | |
Q1 The chatbot’s personality was realistic and engaging | ☐ | ☐ | ☐ | ☐ | ☐ |
Q2 The chatbot seemed too robotic | ☐ | ☐ | ☐ | ☐ | ☐ |
Q3 The chatbot was welcoming during initial setup | ☐ | ☐ | ☐ | ☐ | ☐ |
Q4 The chatbot seemed very unfriendly | ☐ | ☐ | ☐ | ☐ | ☐ |
Q5 The chatbot explained its scope and purpose well | ☐ | ☐ | ☐ | ☐ | ☐ |
Q6 The chatbot gave no indication as to its purpose | ☐ | ☐ | ☐ | ☐ | ☐ |
Q7 The chatbot was easy to navigate | ☐ | ☐ | ☐ | ☐ | ☐ |
Q8 It would be easy to get confused when using the chatbot | ☐ | ☐ | ☐ | ☐ | ☐ |
Q9 The chatbot understood me well | ☐ | ☐ | ☐ | ☐ | ☐ |
Q10 The chatbot failed to recognise a lot of my input | ☐ | ☐ | ☐ | ☐ | ☐ |
Q11 Chatbot responses were useful, appropriate, and informative | ☐ | ☐ | ☐ | ☐ | ☐ |
Q12 Chatbot responses were not relevant | ☐ | ☐ | ☐ | ☐ | ☐ |
Q13 The chatbot coped well with any errors or mistakes | ☐ | ☐ | ☐ | ☐ | ☐ |
Q14 The chatbot seemed unable to handle any errors | ☐ | ☐ | ☐ | ☐ | ☐ |
Q15 The chatbot was very easy to use | ☐ | ☐ | ☐ | ☐ | ☐ |
Q16 The chatbot was very complex | ☐ | ☐ | ☐ | ☐ | ☐ |
8. Any other comments (including suggestions for improvement)? |
Theme (Subtheme) | Examples of Statements |
---|---|
Identified Issues | |
Interaction Difficulties | Many reverse-coded questions—A bit difficult to answer:) |
If I type in ‘challenge’ in a layer where it fits, but apparently not to the Chatbot, it is overwhelmed | |
Incomplete app design | It’s very inaccurate when it comes to counting steps |
The chatbot makes an “unfinished” impression, e.g., the menu below is not always visible and sometimes it is shown with icons and sometimes with / text | |
Spelling errors | Some spelling mistakes (e.g., writte (write), smartes (smartest), reapeat (repeat), etc. |
A lot of spelling errors (reapeat instead of repeat, smartes instead of smartest, etc) | |
Unresponsive/frozen app | The only thing was that the app got stuck at times, and it wasn’t clear how to proceed or if this behavior was intended |
If you want to create a new goal and have him help you with it, he hangs himself up | |
Preferred Chatbot features | |
Challenge feature | The random challenge is my favorite feature because it really distinguishes this bot from fitness trackers, and motivates me to do some activity |
Goal feature | I also like the functionality for checking the user-defined goals |
Suggestions for Improvement | |
Challenge related suggestions | |
Avoid repeating challenge | When the user does not accept the proposed challenge and asks for a different one, the chatbot should avoid suggesting the same one again |
Personalize challenges | It may be nice for the user to be able to personalize the types of challenges (e.g., in the one-time welcome phase, ask the user to select the types of exercise he/she is never going to accept, that can be excluded from the suggestions) |
Weekly activity challenge | A weekly activity challenge would be interesting, like a schedule with the desired level. |
Interaction related suggestions | |
More facts and input | The idea of a chatbot is cool, but it would need to be connected to services and give more inputs. For example, find activities near you that you can do and suggest them |
The chat is too fixed. Would have been much better if it could take varied answers. | |
More facts should be linked to the chatbot. | |
More inputs so that that it can talk about everyday subjects like weather n answer some questions | |
It could be useful if it answered at least a generic sentence, or if it prompted the initial menu again. | |
More empathy and motivation | Finally, she congratulates that I did 2294 steps even though I did not set a goal and she does not motivate me to set one. I would prefer if she could be more empathic and encourage me to set a goal. |
Options always available | The second time I tried the chatbot was a bit weird. The chatbot was in a kind of “stand by” mode, in order to discuss goals/steps/etc... again, you need to remember to type “/menu”. Perhaps the different options should be shown all the time. |
Entries should be checked for their meaningfulness. If menu suggestions are made, then these should also work |
References
- Ding, D.; Lawson, K.D.; Kolbe-Alexander, T.L.; Finkelstein, E.A.; Katzmarzyk, P.T.; van Mechelen, W.; Pratt, M. The economic burden of physical inactivity: A global analysis of major non-communicable diseases. Lancet 2016, 388, 1311–1324. [Google Scholar] [CrossRef]
- World Health Organization. Global Action Plan on Physical Activity 2018–2030. More Active People for a Healthier World. 2018. Available online: https://apps.who.int/iris/bitstream/handle/10665/272722/9789241514187-eng.pdf (accessed on 4 August 2021).
- Guillon, M.; Rochaix, L.; Dupont, J.-C.K. Cost-effectiveness of Interventions Based on Physical Activity in the Treatment of Chronic Conditions: A systematic literature review. Int. J. Technol. Assess Health Care 2018, 34, 481–497. [Google Scholar] [CrossRef] [PubMed]
- Luo, T.C.; Aguilera, A.; Lyles, C.R.; Figueroa, C.A. Promoting Physical Activity Through Conversational Agents: Mixed Methods Systematic Review. J. Med. Internet Res. 2021, 23, e25486. [Google Scholar] [CrossRef] [PubMed]
- Kongstad, M.B.; Valentiner, L.S.; Ried-Larsen, M.; Walker, K.C.; Juhl, C.B.; Langberg, H. Effectiveness of remote feedback on physical activity in persons with type 2 diabetes: A systematic review and meta-analysis of randomized controlled trials. J. Telemed Telecare 2019, 25, 26–34. [Google Scholar] [CrossRef] [PubMed]
- Kramer, L.L.; Ter Stal, S.; Mulder, B.; de Vet, E.; Van Velsen, L. Developing Embodied Conversational Agents for Coaching People in a Healthy Lifestyle: Scoping Review. J. Med. Internet Res. 2020, 22, e14058. [Google Scholar] [CrossRef] [PubMed]
- Laranjo, L.; Ding, D.; Heleno, B.; Kocaballi, B.; Quiroz, J.C.; Tong, H.L.; Chahwan, B.; Neves, A.L.; Gabarron, E.; Dao, K.P.; et al. Do smartphone applications and activity trackers increase physical activity in adults? Systematic review, meta-analysis and metaregression. Br. J. Sports Med. 2021, 55, 422–432. [Google Scholar] [CrossRef] [PubMed]
- Tsoli, S.; Sutton, S.; Kassavou, A. Interactive voice response interventions targeting behaviour change: A systematic literature review with meta-analysis and meta-regression. BMJ Open 2018, 8, e018974. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Petersen, J.M.; Prichard, I.; Kemps, E. A Comparison of Physical Activity Mobile Apps With and Without Existing Web-Based Social Networking Platforms: Systematic Review. J. Med. Internet Res. 2019, 21, e12687. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Gabarron, E.; Larbi, D.; Denecke, K.; Årsand, E. What Do We Know About the Use of Chatbots for Public Health? Stud Health Technol. Inf. 2020, 270, 796–800. [Google Scholar] [CrossRef]
- Kramer, J.-N.; Tinschert, P.; Scholz, U.; Fleisch, E.; Kowatsch, T. A Cluster-Randomized Trial on Small Incentives to Promote Physical Activity. Am. J. Prev. Med. 2019, 56, e45–e54. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Larbi, D.; Gabarron, E.; Denecke, K. Social Media Chatbot for Increasing Physical Activity: Usability Study. Stud. Health Technol. Inf. 2021, 285, 227–232. [Google Scholar] [CrossRef]
- Michie, S.; Richardson, M.; Johnston, M.; Abraham, C.; Francis, J.; Hardeman, W.; Eccles, M.P.; Cane, J.; Wood, C.E. The Behavior Change Technique Taxonomy (v1) of 93 Hierarchically Clustered Techniques: Building an International Consensus for the Reporting of Behavior Change Interventions. Ann. Behav. Med. 2013, 46, 81–95. [Google Scholar] [CrossRef] [PubMed]
- Flow XO LLC. Create a Chatbot with Zero Coding Skills Required. 2020. Available online: https://flowxo.com (accessed on 15 January 2021).
- Holmes, S.; Moorhead, A.; Bond, R.; Zheng, H.; Coates, V.; McTear, M. Usability testing of a healthcare chatbot: Can we use conventional methods to assess conversational user interfaces? In Proceedings of the 31st European Conference on Cognitive Ergonomics; ECCE: Belfast, UK, 2019; pp. 207–214. [Google Scholar]
- Research & Impact. Ulster University. Available online: https://www.ulster.ac.uk/research/topic/computer-science/artificial-intelligence/projects/cuq (accessed on 15 January 2022).
- De Cicco, R.; Iacobucci, S.; Aquino, A.; Romana Alparone, F.; Palumbo, R. Understanding Users’ Acceptance of Chatbots: An Extended TAM Approach. In Chatbot Research and Design, CONVERSATIONS 2021, Lecture Notes in Computer Science; Følstad, A., Araujo, T., Papadopoulos, S., Law, E.L.-C., Luger, E., Goodwin, M., Brandtzaeg, P.B., Eds.; Springer: Cham, Switzerland, 2022; Volume 13171, pp. 3–22. [Google Scholar]
- Nadarzynski, T.; Miles, O.; Cowie, A.; Ridge, D. Acceptability of artificial intelligence (AI)-led chatbot services in healthcare: A mixed-methods study. Digit Health 2019, 5, 2055207619871808. [Google Scholar] [CrossRef] [PubMed]
- Piao, M.; Kim, J.; Ryu, H.; Lee, H. Development and Usability Evaluation of a Healthy Lifestyle Coaching Chatbot Using a Habit Formation Model. Healthc Inform Res. 2020, 26, 255–264. [Google Scholar] [CrossRef] [PubMed]
- Greenberg, A. Fleeing WhatsApp for Better Privacy? Don’t Turn to Telegram. 2021. Available online: https://www.wired.com/story/telegram-encryption-whatsapp-settings/ (accessed on 2 March 2022).
- Zhang, J.; Oh, Y.J.; Lange, P.; Yu, Z.; Fukuoka, Y. Artificial Intelligence Chatbot Behavior Change Model for Designing Artificial Intelligence Chatbots to Promote Physical Activity and a Healthy Diet: Viewpoint. J. Med. Internet Res. 2020, 22, e22845. [Google Scholar] [CrossRef] [PubMed]
- Kramer, J.-N.; Künzler, F.; Mishra, V.; Smith, S.N.; Kotz, D.; Scholz, U.; Fleisch, E.; Kowatsch, T. Which Components of a Smartphone Walking App Help Users to Reach Personalized Step Goals? Results From an Optimization Trial. Ann. Behav. Med. 2020, 54, 518–528. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Kocielnik, R.; Xiao, L.; Avrahami, D.; Hsieh, G. Reflection Companion: A Conversational System for Engaging Users in Reflection on Physical Activity. Proc. ACM Interact. Mob. Wearable Ubiquitous Technol. 2018, 2, 1–26. [Google Scholar] [CrossRef]
- Piao, M.; Ryu, H.; Lee, H.; Kim, J. Use of the Healthy Lifestyle Coaching Chatbot App to Promote Stair-Climbing Habits Among Office Workers: Exploratory Randomized Controlled Trial. JMIR Mhealth Uhealth 2020, 8, e15085. [Google Scholar] [CrossRef] [PubMed]
- Maher, C.A.; Davis, C.R.; Curtis, R.G.; Short, C.E.; Murphy, K.J. A Physical Activity and Diet Program Delivered by Artificially Intelligent Virtual Health Coach: Proof-of-Concept Study. JMIR Mhealth Uhealth 2020, 8, e17558. [Google Scholar] [CrossRef] [PubMed]
Conversation Flow | Description |
---|---|
First encounter | Started only the first time MYA is used. Collects basic information on the user and explains the usage of the chatbot. A daily step goal is specified. |
Further encounter | Greeting for any other than the first encounter. MYA asks the user about his well-being and tries to encourage the user. |
Menu | Offers access to the four functions: goals, challenges, steps, and facts. |
Goals | Allows the user to specify a goal for long-time encouragement. |
Challenges | Out of a set of user-tailored challenges, one is selected. |
Steps today | Checks the number of steps (simulated step count). This function compares the set goal with current number of steps. If the step goal is not achieved, MYA encourages the user to take more steps. |
Facts | Presentation of a randomly selected fact on health and activity behavior. |
Chatting | Allows out-of-topic chatting with the bot. Current version of MYA is not designed to start out-of-topic discussions. |
Help | Provides help on the various functions. |
Age Group | Gender | ||
---|---|---|---|
Female | Male | Total | |
18–29 years | 3 (10%) | 6 (20%) | 9 (30%) |
30–49 years | 12 (40%) | 6 (20%) | 18 (60%) |
50–69 years | 0 | 3 (10%) | 3 (10%) |
Total | 15 (50%) | 15 (50%) | 30 (100%) |
Participant Characteristic | Mean CUQ Score | Median CUQ | Lowest Score | Highest Score |
---|---|---|---|---|
Gender | ||||
Female | 59.9 ± 18.06 | 60.9 | 29.7 | 92.2 |
Male | 54.9 ± 15.5 | 56.3 | 29.7 | 75.0 |
Age Group | ||||
18 and 29 years | 59.2 ± 20.7 | 68.8 | 29.7 | 92.2 |
30 and 49 years | 59.3 ± 14.6 | 62.5 | 29.7 | 87.5 |
50–69 years | 40.6 ± 8.1 | 45.3 | 31.3 | 45.3 |
MYA’s ability to increase physical activity behavior | ||||
Maybe | 57.9 ± 16.2 | 60.2 | 29.7 | 92.2 |
No | 49.1 ± 15.5 | 43.8 | 31.3 | 71.9 |
Yes | 64.5 ± 17.7 | 68.8 | 29.7 | 87.5 |
Mode of Interaction | ||||
Telegram desktop app | 52.6 ± 21.1 | 43.8 | 29.7 | 92.2 |
Telegram mobile app | 59.5 ± 14.6 | 64.1 | 29.7 | 87.5 |
Android phone | 61.5 ± 14.8 | 64.1 | 29.7 | 87.5 |
iPhone | 56.9 ± 15.7 | 60.9 | 31.3 | 73.4 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Larbi, D.; Denecke, K.; Gabarron, E. Usability Testing of a Social Media Chatbot for Increasing Physical Activity Behavior. J. Pers. Med. 2022, 12, 828. https://doi.org/10.3390/jpm12050828
Larbi D, Denecke K, Gabarron E. Usability Testing of a Social Media Chatbot for Increasing Physical Activity Behavior. Journal of Personalized Medicine. 2022; 12(5):828. https://doi.org/10.3390/jpm12050828
Chicago/Turabian StyleLarbi, Dillys, Kerstin Denecke, and Elia Gabarron. 2022. "Usability Testing of a Social Media Chatbot for Increasing Physical Activity Behavior" Journal of Personalized Medicine 12, no. 5: 828. https://doi.org/10.3390/jpm12050828
APA StyleLarbi, D., Denecke, K., & Gabarron, E. (2022). Usability Testing of a Social Media Chatbot for Increasing Physical Activity Behavior. Journal of Personalized Medicine, 12(5), 828. https://doi.org/10.3390/jpm12050828