Decoding Subjective Understanding: Using Biometric Signals to Classify Phases of Understanding
Abstract
:1. Introduction
2. Materials and Methods
2.1. Procedure
2.2. Participants
2.3. Phases of Understanding
2.4. Action Units
2.5. Data Analysis
2.5.1. Interrater Reliability
2.5.2. Machine Learning
- Splitting data: Stratified 80–20 test–train splits were conducted to address moderate class imbalance across phases.
- Feature scaling: Feature scaling was implemented for logistic regression, lasso, and elastic net, as these algorithms are sensitive to the range of feature values. AUs were standardized across training and test sets.
- Cross-validation setup: 10-fold stratified cross-validation was used for robust model evaluation. Cross-validation was intentionally omitted for logistic regression, allowing the model to serve as a benchmark for traditional inferential statistical modeling.
- Selecting evaluation metrics: Models were evaluated using precision, recall, F1 score, and AUC. Specifically, precision measures the proportion of true positives among all instances predicted as positive, indicating the accuracy of positive predictions. Recall measures the proportion of true positives correctly identified out of all actual positives. The F1 score is the harmonic mean of precision and recall, balancing accuracy and completeness. AUC (Area Under the Curve) evaluates the model’s performance across all classification thresholds, reflecting its ability to distinguish between classes. During cross-validation, metrics for each phase were averaged across the 10 folds to yield a consolidated measure of performance. Weighted averages were calculated for each metric, based on the number of observations per phase. All metrics were calculated using the One-vs-Rest (OvR) approach.
- Training baseline models: Baseline GBM, RF, decision tree, lasso, and elastic net models were trained using default package settings. Algorithms demonstrating promise were selected for optimization.
- Hyperparameter tuning: Models were optimized through hyperparameter tuning. Tuning focused on hyperparameters and grid searches considered most relevant to improving model performance, while accounting for efficiency and computational resources required.
- Test-set evaluation: Finally, optimized models and the logistic regression model were evaluated on test sets.
3. Results
3.1. Inter-Rater Reliability
3.2. Descriptive Statistics
3.3. Machine Learning Analysis
4. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Appendix A
- Riddle 1: I start with M, end with X and have a never-ending amount of letters. What am I? Answer: Mailbox.
- Riddle 2: What 5 letter word becomes shorter when you add two letters to it? Answer: Short.
- Riddle 3: What building has the most stories? Answer: Library.
- Riddle 4: I have two coins that equal 30 cents and one is not a nickel. What two coins do I have? Answer: A quarter and nickel.
- Riddle 5: I saw a boat full of people, yet there wasn’t a single person on board. How is this possible? Answer: Everyone on the boat was in a relationship.
- Riddle 6: Three different doctors said that Paul is their brother, yet Paul claims he has no brothers. Who is lying? Answer: No one.
- Riddle 7: Why is it against the law for a man living in Delhi to be buried in Mumbai? Answer: Because he is alive.
- Riddle 8: Before Mt. Everest was discovered, what was the highest mountain in the world? Answer: Mt. Everest.
- Riddle 9: A big brown cow is lying down in the middle of a country road. The streetlights are not on, the moon is not out, and the skies are heavily clouded. A truck is driving towards the cow at full speed, its headlights off. Yet the driver sees the cow from afar easily, and avoids hitting it, without even having to brake hard. How is that possible? Answer: It was daytime
- Riddle 10: Which month has 28 days? Answer: Every month has at least 28 days.
- Riddle 11: In a lake, there is a patch of lily pads. Every day, the patch doubles in size. If it takes 48 days to cover the entire lake, how long does it take for the patch to cover half of the lake? Answer: 47 days.
- Riddle 12: You are a cyclist in a cross-country race. Just before the crossing finish line, you overtake the person in second place. In what place did you finish? Answer: Second.
- Riddle 13: A bat and a ball cost $1.10 in total. The bat costs a dollar more than the ball. How much does the ball cost? Answer: 5 cents.
- Riddle 14: You’re in a dark room with a candle, a wood stove, and a gas lamp. You only have one match, what do you light first? Answer: The match.
- Riddle 15: How much dirt is there in a hole that is 3 feet deep, and 6 inches in diameter? Answer: None
Appendix B
- Participant 1: Gasps, smiles, laughs, nods head, and verbally indicates understanding (e.g., “that makes sense”)
- Participant 2: Nods head
- Emergent understanding is more likely to be present for participant 1
- Participant 1: Nods head enthusiastically and smiles widely
- Participant 2: Nods head slightly and smiles faintly
- Emergent understanding is more likely to be present for participant 1
- Nodding
- Smiling
- Eyebrow(s) raise
- Eyebrows furrow
- Eyes closing
- Eyes widening
- Eyes rolling
- Face scrunching
- Face relaxing
- Head movement
- E.g., Looking upward
- E.g., Tilt to the side
- Surprise
- Interest
- Disgust
- Laughing
- Chuckling
- Sighing
- Gasping
- Kissing teeth
- Providing explanation for the correct answer
- Researcher: “The correct answer is mailbox”
- Participant: “Right, because mailbox starts with m, ends with x, and letters in this case refers to the mail”
- Researcher: “The correct answer is that you would finish in second place because you were in 3rd place just before crossing the finish line”
- Participant: “That makes sense, I am just swapping places with the person in 2nd”
- Participant: “This was a tricky riddle, I did not expect that to be the answer”
- Participant: “My answer was stupid”
- “Ohh”
- “Ahh”
- “Mhmm”
- “That makes sense”
- “I see”
- Hand gesture
- The following body movements:
- Leaning or sitting back in chair
- Slumping over
- Absence of behavioural indicators of understanding:
- Fading of behavioural indicators of understanding:
- Attention shifts from the riddle:E.g., Participant moves on to the next item or page in the surveyNote: Absence or fading of indicators + shift in attention is the strongest indicator.Note: Participant’s may shift their attention from the riddle (e.g., move on to the next item), but behavioural indicators of understanding (e.g., smiling) are present. In these cases, rely on intuition to determine if the behavioural indicator(s) is associated with the understanding that occurred. If it is, identify when the indicator is absent or fades to determine when understanding has ended.
References
- Roumell, E.A.; Walker, J.; Salajan, F.D. Lifelong Learning and Education Policy in North America. In Third International Handbook of Lifelong Learning; Springer: Berlin/Heidelberg, Germany, 2023; pp. 633–654. [Google Scholar]
- Duke, N.K.; Cartwright, K.B. The Science of Reading Progresses: Communicating Advances beyond the Simple View of Reading. Read. Res. Q. 2021, 56, S25–S44. [Google Scholar] [CrossRef]
- Pearson, P.D.; Palincsar, A.S.; Biancarosa, G.; Berman, A.I. Reaping the Rewards of the Reading for Understanding Initiative. Natl. Acad. Educ. 2020. [Google Scholar] [CrossRef]
- Olson, D.R. Making Sense: What It Means to Understand; Cambridge University Press: Cambridge, UK, 2022. [Google Scholar]
- Olson, D.R. Ascribing Understanding to Ourselves and Others. Am. Psychol. 2023. [Google Scholar] [CrossRef] [PubMed]
- Bruner, J.S. The Process of Education. Phys. Teach. 1965, 3, 369–370. [Google Scholar] [CrossRef]
- Damasio, A.R. Descartes’ Error. Emotion, Reason and the Human Brain; Grosset/Putnam 1994: New York, NY, USA, 1994. [Google Scholar]
- Gardner, H.E. Frames of Mind: The Theory of Multiple Intelligences; Basic Books: New York, NY, USA, 2011. [Google Scholar]
- Goleman, D. Emotional Intelligence: Why It Can Matter More than IQ; Bloomsbury Publishing: London, UK, 2020. [Google Scholar]
- Piaget, J. The Construction of Reality in the Child; Routledge: London, UK, 2013. [Google Scholar]
- Vygotsky, L.S. The Collected Works of LS Vygotsky: The Fundamentals of Defectology; Springer Science & Business Media: Berlin, Germany, 1987; Volume 2. [Google Scholar]
- Russell, J.A. A Circumplex Model of Affect. J. Pers. Soc. Psychol. 1980, 39, 1161–1178. [Google Scholar] [CrossRef]
- James, W. The Principles of Psychology. Henry Holt 1890. [Google Scholar] [CrossRef]
- Damasio, A. Feeling & Knowing: Making Minds Conscious; Pantheon Books: New York, NY, USA, 2021. [Google Scholar]
- Ekman, P.; Rosenberg, E.L. What the Face Reveals: Basic and Applied Studies of Spontaneous Expression Using the Facial Action Coding System (FACS); Oxford University Press: New York, NY, USA, 1997. [Google Scholar]
- Barrett, L.F. How Emotions Are Made: The Secret Life of the Brain; Houghton Mifflin Harcourt: Boston, MA, USA, 2017. [Google Scholar]
- Lazarus, R.S. Emotion and Adaptation; Oxford University Press: New York, NY, USA, 1991; Volume 557. [Google Scholar]
- Frijda, N.H. The Emotions; Cambridge University Press: Cambridge, UK, 1986. [Google Scholar]
- Frijda, N.H. The Laws of Emotion; Taylor and Francis: Abingdon, UK, 2007. [Google Scholar] [CrossRef]
- Woodruff, E. AI Detection of Human Understanding in a Gen-AI Tutor. AI 2024, 5, 898–921. [Google Scholar] [CrossRef]
- Damasio, A. Self Comes to Mind: Constructing the Conscious Brain; Pantheon/Random House: New York, NY, USA, 2010. [Google Scholar]
- Kauffman, S.A. The Origins of Order: Self-Organization and Selection in Evolution; Oxford University Press: New York, NY, USA, 1993. [Google Scholar]
- Kauffman, S.A. At Home in the Universe: The Search for Laws of Self-Organization and Complexity; Oxford University Press: New York, NY, USA, 1995. [Google Scholar]
- Stöckli, S.; Schulte-Mecklenbeck, M.; Borer, S.; Samson, A.C. Facial Expression Analysis with AFFDEX and FACET: A Validation Study. Behav. Res. Methods 2018, 50, 1446–1460. [Google Scholar] [CrossRef]
- Fu, G.; Zhou, X.; Wu, S.J.; Nikoo, H.; Panesar, D.; Zheng, P.P.; Oatley, K.; Lee, K. Discrete Emotions Discovered by Contactless Measurement of Facial Blood Flows. Cogn. Emot. 2022, 36, 1429–1439. [Google Scholar] [CrossRef]
- Biggs, J.B.; Collis, K.F. Evaluating the Quality of Learning: The SOLO Taxonomy (Structure of the Observed Learning Outcome); Academic Press: Cambridge, MA, USA, 1982. [Google Scholar]
- Bartlett, F.C. Remembering: A Study in Experimental and Social Psychology; Cambridge University Press: Cambridge, UK, 1932. [Google Scholar]
- Johnson-Laird, P.N. Mental Models: Towards a Cognitive Science of Language, Inference, and Consciousness; Harvard University Press: Cambridge, MA, USA, 1983. [Google Scholar]
- Piaget, J. The Origins of Intelligence in Children; International Universities Press: Madison, CT, USA, 1952. [Google Scholar]
- Vosniadou, S.; Brewer, W.F. Mental Models of the Earth: A Study of Conceptual Change in Childhood. Cognit. Psychol. 1992, 24, 535–585. [Google Scholar] [CrossRef]
- Weinstein, C.E.; Mayer, R.E. The Teaching of Learning Strategies. In Handbook of Research on Teaching; Wittrock, M.C., Ed.; Macmillan: New York, NY, USA, 1986; pp. 315–327. [Google Scholar]
- Barnett, S.M.; Ceci, S.J. When and Where Do We Apply What We Learn? A Taxonomy for Far Transfer. Psychol. Bull. 2002, 128, 612–637. [Google Scholar] [CrossRef] [PubMed]
- Bar-Hillel, M.; Noah, T.; Frederick, S. Learning Psychology from Riddles: The Case of Stumpers. Judgm. Decis. Mak. 2018, 13, 112–122. [Google Scholar] [CrossRef]
- Toplak, M.E.; West, R.F.; Stanovich, K.E. Assessing Miserly Information Processing: An Expansion of the Cognitive Reflection Test. Think. Reason. 2014, 20, 147–168. [Google Scholar] [CrossRef]
- Otamendi, F.J.; Sutil Martín, D.L. The Emotional Effectiveness of Advertisement. Front. Psychol. 2020, 11, 2088. [Google Scholar] [CrossRef]
- Gwet, K.L. Handbook of Inter-Rater Reliability: The Definitive Guide to Measuring the Extent of Agreement among Raters; Advanced Analytics, LLC: Fort Wayne, IN, USA, 2014. [Google Scholar]
- Wongpakaran, N.; Wongpakaran, T.; Wedding, D.; Gwet, K.L. A Comparison of Cohen’s Kappa and Gwet’s AC1 When Calculating Inter-Rater Reliability Coefficients: A Study Conducted with Personality Disorder Samples. BMC Med. Res. Methodol. 2013, 13, 61. [Google Scholar] [CrossRef] [PubMed]
- Dewey, J. Experience and Education. In The Educational Forum; Taylor & Francis: Abingdon, UK, 1986; Volume 50, pp. 241–252. [Google Scholar]
- Vygotsky, L.S.; Cole, M. Mind in Society: Development of Higher Psychological Processes; Harvard University Press: Boston, MA, USA, 1978. [Google Scholar]
- Tamir, D.I.; Thornton, M.A. Modeling the Predictive Social Mind. Trends Cogn. Sci. 2018, 22, 201–212. [Google Scholar] [CrossRef]
- Clark, A. Whatever next? Predictive Brains, Situated Agents, and the Future of Cognitive Science. Behav. Brain Sci. 2013, 36, 181–204. [Google Scholar] [CrossRef]
- Kasneci, E.; Seßler, K.; Küchemann, S.; Bannert, M.; Dementieva, D.; Fischer, F.; Gasser, U.; Groh, G.; Günnemann, S.; Hüllermeier, E.; et al. ChatGPT for Good? On Opportunities and Challenges of Large Language Models for Education. Learn. Individ. Differ. 2023, 103, 102274. [Google Scholar] [CrossRef]
- Ryumina, E.; Dresvyanskiy, D.; Karpov, A. In Search of a Robust Facial Expressions Recognition Model: A Large-Scale Visual Cross-Corpus Study. Neurocomputing 2022, 514, 435–450. [Google Scholar] [CrossRef]
- Janiesch, C.; Zschech, P.; Heinrich, K. Machine Learning and Deep Learning. Electron. Mark. 2021, 31, 685–695. [Google Scholar] [CrossRef]
- Lu, C.; Zong, Y.; Zheng, W.; Li, Y.; Tang, C.; Schuller, B.W. Domain Invariant Feature Learning for Speaker-Independent Speech Emotion Recognition. IEEEACM Trans. Audio Speech Lang. Process. 2022, 30, 2217–2230. [Google Scholar] [CrossRef]
- Fairclough, S.H.; Venables, L. Prediction of Subjective States from Psychophysiology: A Multivariate Approach. Biol. Psychol. 2006, 71, 100–110. [Google Scholar] [CrossRef] [PubMed]
- Zhao, G.; Zhang, H.; Liu, Z.; Zhang, S.; Zhang, Y. Multiscale Convolutional Neural Networks for Affect Recognition Using EEG and Peripheral Physiological Signals. arXiv 2019, arXiv:1911.12918. [Google Scholar]
Phase of Understanding | Observations |
---|---|
Nascent Understanding | 249 |
Misunderstanding | 171 |
Confusion | 200 |
Emergent Understanding | 287 |
Deep Understanding | 338 |
Total Observations | 1245 |
Phase | Riddle | Answer | Think Aloud | Justification | Level of Certainty |
---|---|---|---|---|---|
Nascent | Which month has 28 days? | July | Don’t know | Not sure | 1 (Not certain) |
Misunderstanding | I have two coins that equal thirty cents and one is not a nickel. What two coins do I have? | Quarter and nickel | X + Y = 30 cents... not a nickel…dime x2 = 20... when they say one is not a nickel does this mean the other could be??? | I know it says one is not a nickel, but maybe that means the first one is not a nickel (so maybe a quarter) and then the second one is a nickel... | 2 (Somewhat certain) |
Confusion | You’re in a dark room with a candle, a wood stove, and a gas lamp. You only have one match, what do you light first? | Not sure why I need to light anything | |||
Emergent | I have two coins that equal thirty cents and one is not a nickel. What two coins do I have? | Quarter, nickel | A + B = 30. Coins 25, 15, 10, 1. Ok this isn’t a coins problem. Aha the other is a nickel. | 25 + 5, it said 1 is not a nickel. | 4 (Very certain) |
Deep | You are a cyclist in a cross-country race. Just before crossing the finish line, you overtake the person in second place. In what place did you finish? | Second | Easy—if you pass the person—u take over that person’s place. | 4 (Very certain) |
Phase of Understanding | Training Sets | Test Sets |
---|---|---|
Nascent Understanding | 200 | 49 |
Misunderstanding | 137 | 34 |
Confusion | 160 | 40 |
Emergent Understanding | 230 | 57 |
Deep Understanding | 271 | 67 |
Total Observations | 998 | 247 |
Metric | Nascent (GBM/RF/Lasso/Net/Tree) | Misunderstanding (GBM/RF/Lasso/Net/Tree) | Confusion (GBM/RF/Lasso/Net/Tree) | Emergent (GBM/RF/Lasso/Net/Tree) | Deep (GBM/RF/Lasso/Net/Tree) |
---|---|---|---|---|---|
Precision | 0.84/0.85/0.83/0.83/0.35 | 0.87/0.88/0.86/0.86/0.19 | 0.93/0.94/0.9/0.9/0.6 | 0.97/0.97/0.95/0.95/0.7 | 0.92/0.84/0.93/0.93/0.5 |
Recall | 0.93/0.88/0.93/0.93/0.28 | 0.98/0.94/0.99/0.99/0.06 | 0.95/0.94/0.97/0.97/0.44 | 0.95/0.94/0.92/0.92/0.8 | 0.67/0.76/0.59/0.58/0.69 |
F1 Score | 0.88/0.86/0.88/0.88/0.3 | 0.92/0.9/0.92/0.92/0.2 | 0.94/0.94/0.93/0.93/0.5 | 0.96/0.95/0.94/0.94/0.74 | 0.78/0.8/0.72/0.72/0.54 |
AUC | 0.75/0.72/0.73/0.73/0.66 | 0.7/0.66/0.68/0.68/0.65 | 0.92/0.91/0.83/0.83/0.74 | 0.97/0.97/0.94/0.94/0.88 | 0.8/0.77/0.77/0.77/0.74 |
Metric | Nascent (GBM/RF/Lasso) | Misunderstanding (GBM/RF/Lasso) | Confusion (GBM/RF/Lasso) | Emergent (GBM/RF/Lasso) | Deep (GBM/RF/Lasso) |
---|---|---|---|---|---|
Precision | 0.84/0.84/0.83 | 0.87/0.87/0.86 | 0.93/0.95/0.90 | 0.97/0.97/0.95 | 0.92/0.89/0.93 |
Recall | 0.93/0.93/0.93 | 0.98/0.98/0.99 | 0.94/0.93/0.97 | 0.95/0.94/0.92 | 0.69/0.71/0.59 |
F1 Score | 0.89/0.89/0.88 | 0.92/0.92/0.92 | 0.94/0.94/0.93 | 0.96/0.95/0.94 | 0.79/0.79/0.72 |
AUC | 0.77/0.72/0.73 | 0.68/0.63/0.68 | 0.93/0.92/0.83 | 0.97/0.96/0.94 | 0.79/0.78/0.77 |
Metric | GBM | RF | Lasso | Logistic |
---|---|---|---|---|
Precision | 0.91 | 0.91 | 0.90 | 0.90 |
Recall | 0.87 | 0.88 | 0.85 | 0.86 |
F1 Score | 0.88 | 0.89 | 0.86 | 0.87 |
AUC | 0.84 | 0.82 | 0.80 | 0.79 |
Action Unit | Importance | Nascent | Misunderstanding | Confusion | Emergent | Deep |
---|---|---|---|---|---|---|
Brow Furrow | 38.80 | 0.098 | 0.000 | 0.550 | −0.976 | −0.290 |
Brow Raise | 0.00 | −0.017 | 0.072 | −0.035 | 0.213 | 0.000 |
Cheek Raise | 5.79 | 0.000 | 0.000 | 0.000 | 0.300 | 0.000 |
Chin Raise | 5.01 | 0.000 | 0.000 | 0.221 | 0.000 | −0.227 |
Dimpler | 4.25 | −0.001 | 0.000 | 0.216 | 0.000 | 0.000 |
Eye Closure | 21.75 | 0.044 | 0.081 | −0.328 | −0.201 | 0.000 |
Eye Widen | 7.23 | 0.000 | 0.000 | −0.028 | 0.000 | 0.105 |
Inner Brow Raise | 3.00 | 0.000 | 0.295 | 0.248 | 0.000 | 0.000 |
Jaw Drop | 7.02 | −0.088 | −0.094 | 0.148 | 0.444 | 0.000 |
Lid Tighten | 12.16 | 0.000 | 0.000 | 1.137 | 0.667 | 0.000 |
Lip Corner Depressor | 5.34 | 0.000 | 0.000 | 0.004 | −0.103 | 0.000 |
Lip Press | 6.10 | 0.074 | 0.000 | 0.000 | 0.000 | −0.119 |
Lip Pucker | 1.56 | 0.003 | −0.022 | 0.000 | −0.016 | 0.000 |
Lip Stretch | 0.98 | −0.005 | 0.000 | 0.142 | 0.000 | 0.000 |
Lip Suck | 1.14 | 0.000 | 0.134 | 0.000 | −0.022 | 0.000 |
Mouth Open | 17.82 | 0.000 | −0.228 | 0.121 | 0.710 | −0.160 |
Neutral | 100.00 | 0.537 | 0.303 | 0.000 | −3.735 | −0.447 |
Nose Wrinkle | 0.24 | 0.202 | 0.000 | 0.000 | 0.000 | 0.000 |
Smile | 7.97 | 0.377 | 0.000 | 0.000 | 0.000 | 0.000 |
Smirk | 4.40 | 0.190 | 0.004 | 0.000 | 0.000 | −0.154 |
Upper Lip Raise | 1.05 | 0.220 | 0.000 | 0.000 | 0.000 | 0.000 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Lazic, M.; Woodruff, E.; Jun, J. Decoding Subjective Understanding: Using Biometric Signals to Classify Phases of Understanding. AI 2025, 6, 18. https://doi.org/10.3390/ai6010018
Lazic M, Woodruff E, Jun J. Decoding Subjective Understanding: Using Biometric Signals to Classify Phases of Understanding. AI. 2025; 6(1):18. https://doi.org/10.3390/ai6010018
Chicago/Turabian StyleLazic, Milan, Earl Woodruff, and Jenny Jun. 2025. "Decoding Subjective Understanding: Using Biometric Signals to Classify Phases of Understanding" AI 6, no. 1: 18. https://doi.org/10.3390/ai6010018
APA StyleLazic, M., Woodruff, E., & Jun, J. (2025). Decoding Subjective Understanding: Using Biometric Signals to Classify Phases of Understanding. AI, 6(1), 18. https://doi.org/10.3390/ai6010018