Impacts of Learning Orientation on the Modeling of Programming Using Feature Selection and XGBOOST: A Gender-Focused Analysis
Abstract
:1. Introduction
2. Research Background
2.1. Learning Orientation and Programming
2.2. Gender and Programming
3. Method
3.1. SW Competency Assessment Tool (CAT)
3.2. Learning Orientation Test Tools
3.3. Data Collection and Sampling
3.4. Analysis Methods
3.4.1. Feature Selection
- Mallow’s technique: is the standardized sum of squared error (SSE) estimator of the data. is the number of independent variables selected for model prediction; n is the number of samples; and is the residual sum of squares for all features. The lower the is, the better the model; the formula is presented below.
- AIC technique: AIC is calculated using Equation (2) for the number of features p selected for model prediction. As p increases, compared to , the penalty becomes bigger, and when the sample n is different, it becomes inaccurate. The lower the AIC, the better the model.
- BIC technique: The BIC for p independent variables is shown in Equation (3). It is similar to AIC, but by modifying the last term, the disadvantage of AIC, which becomes inaccurate when sample n becomes large, is complemented. As with AIC, it is better when the BIC becomes lower.
- Adjusted technique: As the number of features increases, the number of samples also increases. At this point, the explanatory power unconditionally increases. Adjusted is the applied penalty, as the number of samples (n) is reflected, as shown in Equation (4). It has a more meaningful value by receiving a penalty according to the number of samples and becomes a better model as adjusted becomes higher.
3.4.2. XGBOOST
4. Results
4.1. Average Analysis Result of Learning Orientation by Factor
4.2. Mean Analysis Result by the Factor of SW Competency and Learning Orientation
4.3. Feature Selection Results Influencing Modeling
4.4. XGBOOST Result Applying Feature Selection
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References and Note
- Alejos, A.V.; Sanchez, M.G.; Milagros, M.P.; Falcone, F.; Sanchís, P.; López-Martín, A. The influence of gender in the adoption of engineering studies. In Proceedings of the 2014 XI Tecnologias Aplicadas a la Ensenanza de la Electronica (Technologies Applied to Electronics Teaching) (TAEE), Bilbao, Spain, 11–13 June 2014; pp. 1–4. [Google Scholar]
- Glass, R.L. The gender gap: Is it a computing problem or simply a computer science problem? IEEE Softw. 2012, 29, 87–88. [Google Scholar] [CrossRef]
- Kim, Y.; Kim, J. Development and application of software education program of app Inventor utilization for improvement of elementary school girls’ computational thinking. J. Korean Assoc. Inf. Educ. 2015, 19, 31–44. [Google Scholar]
- Lee, J.; Jung, Y.; Park, H. Gender differences in computational thinking, creativity, and academic interest on elementary SW education. J. Korean Assoc. Inf. Educ. 2017, 21, 381–391. [Google Scholar]
- Jang, H.; Kim, J.; Lee, W. Analysis of the recognition of Information subjects related to future profession of high school girls. J. Korean Assoc. Comput. Educ. 2018, 21, 21–29. [Google Scholar]
- Lee, Y.W.; Jin, S.U. Development of vocational cards for female college students’ career exploration activity. Korean J. Woman Psychol. 2011, 16, 135–157. [Google Scholar]
- OECD. Organisation for Economic Co-Operation and Development 2018.
- Yang, H.J.; Kim, J.M.; Lee, W.G. Analysis of algorithm and programming level according to gender of high school students. J. Learn.-Cent. Curric. Instr. 2020, 20, 185–202. [Google Scholar] [CrossRef]
- Lee, H.K. Learning Psychology; Tamgudang: Yongsan-gu, Seoul, 1984. [Google Scholar]
- Kim, H.S. Gender difference of social studies’ academic achievement of Korea school students: On the result of 2009 NAEA. J. Korean Assoc. Geogr. Environ. Educ. 2011, 19, 67–87. [Google Scholar]
- Kang, S.H. Predictors of academic achievement and dropout thinking among university students. J. Educ. Eval. 2010, 23, 29–53. [Google Scholar]
- Lee, B.J.; Song, M.Y. Gender differences in mathematics-related attitudes in National Assessment of Educational Achievement. J. Korean Sch. Math. Soc. 2011, 14, 65–84. [Google Scholar]
- Hilde, G.C.; Tveranger, F. Programming in secondary schools in Norway: A wasted opportunity for inclusion. In Proceedings of the 4th Conference on Gender & IT (GenderIT’18), Heilbronn, Germany, 14–15 May 2018; Association for Computing Machinery: New York, NY, USA, 2018; pp. 175–182. [Google Scholar] [CrossRef] [Green Version]
- Stein, J.; Lajmi, L. The Impact of Self-efficacy and Gender on Computer Performance. In HCI International 2019–Late Breaking Posters. HCII 2019; Stephanidis, C., Antona, M., Eds.; Communications in Computer and Information Science; Springer: Cham, Switzerland, 2019; Volume 1088. [Google Scholar] [CrossRef]
- Yu, B.G.; Kim, J.M.; Lee, W.G. Analysis of the impact of learner characteristics on the achievement of programming. J. Korean Assoc. Comput. Educ. 2014, 17, 15–24. [Google Scholar]
- Papastergiou, M. Digital game-based learning in high school computer science education: Impact on educational effectiveness and student motivation. Comput. Educ. 2009, 52, 1–12. [Google Scholar] [CrossRef]
- Atmatzidou, S.; Demetriadis, S. Advancing students’ computational thinking skills through educational robotics: A study on age and gender relevant differences. Robot. Auton. Syst. 2016, 75, 661–670. [Google Scholar] [CrossRef]
- Rusk, N.; Resnick, M.; Berg, R.; Pezalla-Granlund, M. New pathways into robotics: Strategies for broadening participation. J. Sci. Educ. Technol. 2008, 17, 59–69. [Google Scholar] [CrossRef] [Green Version]
- Baser, M. Attitude, gender and achievement in computer programming. Middle East J. Sci. Res. 2013, 14, 248–255. [Google Scholar] [CrossRef]
- Cai, Z.; Fan, X.; Du, J. Gender and attitudes toward technology use: A meta-analysis. Comput. Educ. 2017, 105, 1–13. [Google Scholar] [CrossRef]
- Noh, J.; Lee, J. Effects of robotics programming on the computational thinking and creativity of elementary school students. Educ. Technol. Res. Dev. 2020, 68, 463–484. [Google Scholar] [CrossRef]
- Kim, S.; Lee, Y. Computational thinking of middle school students in Korea. J. Korea Soc. Comput. Inf. 2020, 25, 229–241. [Google Scholar]
- Akinola, S.O. Computer programming skill and gender difference: An empirical study. Am. J. Sci. Ind. Res. 2015, 7, 1–9. [Google Scholar]
- Zhong, B.; Wang, Q.; Chen, J. The impact of social factors on pair programming in a primary school. Comput. Hum. Behav. 2016, 64, 423–431. [Google Scholar] [CrossRef]
- Keefe, J.W. Assessment of Learning Style Variables: The NASSP Task Force Model. Theory Pract. 1985, 24, 138–144. [Google Scholar] [CrossRef]
- Kolb, D.A. Experiential Learning: Experience as the Source of Learning and Development; FT Press: Upper Saddle River, NJ, USA, 2014. [Google Scholar]
- Smith, P.L.; Ragan, T.J. Instructional Design; John Wiley & Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
- Gregorc, A.F.; Ward, H.B. A new definition for individual. Nassp Bull. 1977, 61, 20–26. [Google Scholar] [CrossRef]
- Tseng, J.C.; Chu, H.C.; Hwang, G.J.; Tsai, C.C. Development of an adaptive learning system with two sources of personalization information. Comput. Educ. 2008, 51, 776–786. [Google Scholar] [CrossRef]
- Bishop-Clark, C.; Wheeler, D.D. The Myers-Briggs personality type and its relationship to computer programming. J. Res. Comput. Educ. 1994, 26, 358–370. [Google Scholar]
- Bishop-Clark, C. Cognitive style and its effect on the stages of programming. J. Res. Comput. Educ. 1995, 27, 373–386. [Google Scholar] [CrossRef]
- Stevens, D. Cognitive processes and success of students in instructional computer courses. AEDS J. 1983, 16, 228–233. [Google Scholar] [CrossRef]
- Thomas, L.; Ratcliffe, M.; Woodbury, J.; Jarman, E. Learning styles and performance in the introductory programming sequence. In Proceedings of the 33rd SIGCSE Technical Symposium on Computer Science Education, Cincinnati, KY, USA, 27 February–3 March 2002. [Google Scholar]
- Davidson, G.V.; Savenye, W.C.; Orr, K.B. How do learning styles relate to performance in a computer applications course? J. Res. Comput. Educ. 1992, 24, 348–358. [Google Scholar] [CrossRef]
- Drysdale, M.T.B.; Ross, J.L.; Schulz, R.A. Cognitive learning styles and academic performance in 19 first-year university courses: Successful students versus students at risk. J. Educ. Stud. Placed Risk 2001, 6, 271–289. [Google Scholar] [CrossRef]
- Ross, J.L.; Drysdale, M.T.; Schulz, R.A. Cognitive learning styles and academic performance in two postsecondary computer application courses. J. Res. Comput. Educ. 2001, 33, 400–412. [Google Scholar] [CrossRef]
- Lee, S.; Kim, J.; Lee, W. Analysis of factors affecting achievement in maker programming education in the age of wireless communication. Wirel. Pers. Commun. 2017, 93, 187–209. [Google Scholar] [CrossRef] [Green Version]
- You, I.; Kim, T. The effects of MINDSTORMS programming instruction on the creativity. J. Korean Assoc. Comput. Educ. 2006, 9, 49–59. [Google Scholar]
- Fuchs, L. Closing the gender gap: Girls and computers. In Proceedings of the Florida Instructional Computing Conference, (ERIC ED 271 103), Orlando, FL, USA, 21–24 January 1986. [Google Scholar]
- Muira, I. Understanding gender differences in middle school computer interest and use. In Proceedings of the Annual Meeting of the American Educational Research Association, San Francisco, CA, USA, 16–20 April 1986. [Google Scholar]
- Reed, W.; Overbaugh, R. The effects of prior experience and instructional format on teacher education students’ computer anxiety and performance. Comput. Sch. 1993, 9, 75–89. [Google Scholar] [CrossRef]
- Farina, F. Predictors of computer anxiety. Comput. Hum. Behav. 1991, 7, 269–291. [Google Scholar] [CrossRef]
- Okebukola, P. The gender factor in computer anxiety and interest among some Australian high school students. Educ. Res. 1993, 35, 181–199. [Google Scholar] [CrossRef]
- Park, S.; Park, J. Sexual difference analysis on application level of computer programming language. J. Gend. Stud. 1998, 2, 47–68. [Google Scholar]
- Song, J.; Paik, S.; Lee, T. The effect of robot programming learning considered gender differences on female middle school student’s flow level and problem solving ability. J. Korean Assoc. Comput. Educ. 2009, 12, 45–55. [Google Scholar]
- Lau, W.W.F.; Yuen, A.H.K. Exploring the effects of gender and learning styles on computer programming performance: Implications for programming pedagogy. Br. J. Educ. Technol. 2009, 40, 696–712. [Google Scholar] [CrossRef]
- Kim, Y.-J.; Lee, S.-A.; Kim, D.-H.; Chae, J.-W.; Ham, H.-S.; Cho, H.C.; Cho, H.-C. Machine learning based gastric cancer computer-aided diagnosis system using feature selection. Trans. Korean Inst. Electr. Eng. 2020, 69, 170–176. [Google Scholar]
- Kolb, D.A. Learning Styles and Disciplinary Differences; Jossey Bass: San Francisco, CA, USA, 1981; pp. 233–255. [Google Scholar]
- Gregorc, A.F. An Adults Guide to Style; Gregorc Associates, Inc.: Columbia, CT, USA, 1982. [Google Scholar]
- Solomon, B.; Felder, R. Index of Learning Styles; North Carolina State University: Raleigh, NC, USA, 1999. [Google Scholar]
- Knapp, T.R. Coefficient alpha: Conceptualizations and anomalies. Res. Nurs. Health 1991, 14, 457–480. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the KDD’16, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Rogati, M.; Yang, Y. High-performing feature selection for text classification. In Proceedings of the Eleventh International Conference on Information and Knowledge Management, McLean, VA, USA, 4–9 November 2002. [Google Scholar]
- Kim, H.; Kim, J.; Han, J. Exploring cryptocurrency influence factors Using Feature Selection Algorithm. Korea Internet Electron. Commer. Assoc. 2019, 19, 185–197. [Google Scholar]
- Lee, S. Communication and the contents of Korean language education. Korean Lang. Educ. Res. 1994, 4, 67–83. [Google Scholar]
- Ying, K.M.; Pezzullo, L.G.; Ahmed, M.; Crompton, K.; Blanchard, J.; Boyer, K.E. In Their Own Words: Gender Differences in Student Perceptions of Pair Programming. In Proceedings of the 50th ACM Technical Symposium on Computer Science Education (SIGCSE), Minneapolis, MN, USA, 27 February–2 March 2019; pp. 1053–1059. [Google Scholar]
Step | Div. | Description |
---|---|---|
1 | SW CAT Analysis | Derive SW competency and set the tool development directions
|
2 | First Expert Review | Two professors (one in computer science and one in computer education) Review the derived SW competency Review tool development direction |
3 | SW CAT Development | Develop questions considering the first expert review |
4 | Second Expert Review | Two professors (one in computer science and one in computer education) Review the developed items |
5 | Revision and Supplementation of Questions | Revise and supplement the questions considering the second expert review |
6 | Conduct a Pilot Test | Test conducted on 72 students from two elementary schools |
7 | Third Expert Review | Teachers, by school level (four elementary school teachers, two middle school teachers, and four high school teachers) Review the composition, expression, and difficulty of the questions |
8 | Pilot Test Result Analysis | Group score analysis Question analysis |
9 | Revision and Supplementation of Questions | Revision/supplementation of questions based on pilot test results and content review by field teachers |
10 | Development Completed | SW CAT development completed |
SW Competency | Question No. | Difficulty (0.62) | Discrimination (0.462) |
---|---|---|---|
1. Analysis | 1.1 Data Collection | 0.35 | 0.404 |
1.2 Data Analysis | 0.34 | 0.412 | |
1.3 Data Collection | 0.84 | 0.389 | |
1.4 Data representation | 0.78 | 0.270 | |
1.5 Data representation | 0.55 | 0.421 | |
1.6 Data Analysis | 0.73 | 0.483 | |
2. Modeling | 2.1 Problem Decomposition | 0.36 | 0.274 |
2.2 Algorithms | 0.70 | 0.588 | |
2.3 Algorithms | 0.73 | 0.549 | |
2.4 Abstraction | 0.52 | 0.420 | |
2.5 Problem Decomposition | 0.51 | 0.510 | |
2.6 Algorithms | 0.57 | 0.481 | |
3. Implementation | 3.1 Automation | 0.15 | 0.250 |
3.2 Automation | 0.13 | 0.252 | |
3.3 Testing | 0.48 | 0.477 | |
3.4 Testing | 0.36 | 0.517 | |
4. Generalization | 4.4 Application and generalization | 0.53 | 0.484 |
Factor | Elements | Cronbach’s α |
---|---|---|
Research method | LearningType1. When I find something I don’t understand, I try to research it in several ways, including looking up a dictionary or doing an internet search. | 0.773 |
LearningType2. I try to find out something I don’t know by reading books (except textbooks, reference books, comic books, or magazines) | ||
LearningType3. When I research, I try to collect as much as data I can to find out what I am looking for. | ||
Analysis method | LearningType4. When it is hard to understand by reading sentences, I try to express information through figures or tables. | 0.750 |
LearningType5. I compare collected data to find the commonalities and differences. | ||
LearningType6. I think differently from others or try to think in my own way. | ||
Thought arrangement and decision-making methods | LearningType7. I think carefully about whether the story I heard or the data I collected is true or not. | 0.862 |
LearningType8. When there are different opinions, I listen to both sides carefully and decide by myself which one is right. | ||
LearningType9. I create a new thing or add my own thoughts based on my research. | ||
LearningType10. When I talk about my thoughts or opinions in front of my friends, I try to organize what I want to say. | ||
LearningType11. When I find a problem, I try to think of a solution first and then make a suggestion. | ||
LearningType12. I try to work with my friend to study or teach each other. | ||
Reflection method | LearningType13. I try to evaluate what I liked or did not like after learning or experiencing. | - |
Div. | Feature Selection | XGBOOST | |
---|---|---|---|
No. of Data | Male: 756 Female: 688 | Training Data Male: 604 Female: 552 | Test Data Male: 152 Female: 136 |
Data Characteristics | Dependent variable: Modeling Independent variable: 3 SW competencies excluding modeling, 13 learning types | Dependent variable: Modeling Independent variable: High-influence characteristics extracted from the feature selection technique |
Factor | Male Students | Female Students | ||||
---|---|---|---|---|---|---|
High-Achieving Group | Low-Achieving Group | t-Value | High-Achieving Group | Low-Achieving Group | t-Value | |
Mean (Std.) | Mean (Std.) | Mean (Std.) | Mean (Std.) | |||
Research Method | 3.69 (0.88) | 3.31 (0.84) | 5.657 *** | 3.80 (0.79) | 3.41 (0.93) | 5.903 *** |
Analysis Method | 3.31 (0.89) | 3.15 (0.85) | 2.294 * | 3.35 (0.80) | 3.16 (0.88) | 2.830 ** |
Thought Organization and Decision-Making Method | 3.67 (0.73) | 3.40 (0.79) | 4.720 *** | 3.78 (0.69) | 3.38 (0.79) | 6.634 *** |
Reflection Method | 3.29 (1.11) | 3.27 (1.10) | 0.182 | 3.44 (0.97) | 3.26 (1.03) | 2.294 * |
Div. | Male Students | Female Students | |||
---|---|---|---|---|---|
High-Achieving Group | Low-Achieving Group | High-Achieving Group | Low-Achieving Group | ||
Mean (Std.) | Mean (Std.) | Mean (Std.) | Mean (Std.) | ||
SW Competency | Analysis | 73.31 (18.98) | 41.60 (20.17) | 72.14 (18.60) | 42.02 (21.07) |
Modeling | 88.80 (7.83) | 11.00 (7.91) | 88.44 (7.69) | 11.61 (7.68) | |
Implementation | 43.96 (27.75) | 17.08 (20.53) | 37.89 (27.88) | 15.49 (19.87) | |
Generalization | 72.23 (44.83) | 35.91 (48.07) | 69.38 (46.14) | 28.63 (45.30) | |
Learning Type | 1 | 3.88 (1.02) | 3.5 (1.08) | 4.06 (0.88) | 3.67 (1.09) |
2 | 3.38 (1.11) | 3.05 (1.03) | 3.47 (1.04) | 3.21 (1.09) | |
3 | 3.81 (1.05) | 3.38 (1.02) | 3.87 (0.92) | 3.34 (1.03) | |
4 | 3.18 (1.14) | 3.1 (1.06) | 3.3 (1.04) | 3.15 (1.04) | |
5 | 3.13 (1.11) | 3 (1.05) | 3.2 (0.94) | 3.03 (1.08) | |
6 | 3.62 (1.01) | 3.36 (1.05) | 3.55 (0.98) | 3.29 (1.06) | |
7 | 3.67 (0.97) | 3.41 (1.05) | 3.7 (0.9) | 3.41 (0.96) | |
8 | 3.76 (0.96) | 3.46 (1.06) | 3.82 (0.87) | 3.46 (0.98) | |
9 | 3.62 (0.98) | 3.4 (1) | 3.69 (0.91) | 3.32 (0.98) | |
10 | 3.63 (0.94) | 3.37 (1.02) | 3.85 (0.9) | 3.32 (1.06) | |
11 | 3.63 (0.95) | 3.32 (0.96) | 3.72 (0.91) | 3.29 (1.01) | |
12 | 3.7 (1) | 3.42 (1.02) | 3.93 (0.89) | 3.49 (1.01) | |
13 | 3.29 (1.11) | 3.27 (1.1) | 3.44 (0.97) | 3.26 (1.03) |
Div. | Male Students | Female Students | ||
---|---|---|---|---|
No. of Features | Order of Influence | No. of Features | Order of Influence | |
6 | Analysis Implementation Generalization LearningType3 LearningType4 LearningType11 | 9 | Analysis Generalization Implementation LearningType11 LearningType3 LearningType2 LearningType13 LearningType12 LearningType5 | |
AIC | ||||
BIC | 3 | Analysis Implementation Generalization | 4 | Analysis Generalization Implementation LearningType11 |
10 | Analysis Implementation Generalization LearningType3 LearningType4 LearningType11 LearningType1 LearningType13 LearningType8 LearningType9 | 11 | Analysis Generalization Implementation LearningType11 LearningType3 LearningType2 LearningType13 LearningType12 LearningType5 LearningType10 LearningType9 |
Gender | Div. (No. of Features) | Accuracy (%) |
---|---|---|
Male | Not Applied Feature Selection (16) | 76.97 |
(6) and AIC (6) | 80.26 | |
BIC (3) | 76.97 | |
(10) | 78.29 | |
Female | Not Applied Feature Selection (16) | 81.16 |
(6) and AIC (9) | 81.88 | |
BIC (4) | 79.71 | |
(11) | 81.56 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Woo, H.; Kim, J.-M. Impacts of Learning Orientation on the Modeling of Programming Using Feature Selection and XGBOOST: A Gender-Focused Analysis. Appl. Sci. 2022, 12, 4922. https://doi.org/10.3390/app12104922
Woo H, Kim J-M. Impacts of Learning Orientation on the Modeling of Programming Using Feature Selection and XGBOOST: A Gender-Focused Analysis. Applied Sciences. 2022; 12(10):4922. https://doi.org/10.3390/app12104922
Chicago/Turabian StyleWoo, HoSung, and Ja-Mee Kim. 2022. "Impacts of Learning Orientation on the Modeling of Programming Using Feature Selection and XGBOOST: A Gender-Focused Analysis" Applied Sciences 12, no. 10: 4922. https://doi.org/10.3390/app12104922
APA StyleWoo, H., & Kim, J.-M. (2022). Impacts of Learning Orientation on the Modeling of Programming Using Feature Selection and XGBOOST: A Gender-Focused Analysis. Applied Sciences, 12(10), 4922. https://doi.org/10.3390/app12104922