Next Article in Journal
Affective Experiences of U.S. School Personnel in the Sociopolitical Context of 2021: Reflecting on the Past to Shape the Future
Next Article in Special Issue
Trends and Gender Disparities in Grades and Grade Anomalies among Bioscience and Health-Related Major Students before, during, and after COVID-19 Remote Instruction
Previous Article in Journal
Examining TPACK Enactment in Elementary Mathematics with Various Learning Technologies
Previous Article in Special Issue
Empowering Female High School Students for STEM Futures: Career Exploration and Leadership Development at Scientella
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Bioscience Students in Physics Courses with Higher Test Anxiety Have Lower Grades on High-Stakes Assessments and Women Report More Test Anxiety than Men

by
Alysa Malespina
,
Fargol Seifollahi
* and
Chandralekha Singh
Department of Physics & Astronomy, University of Pittsburgh, Pittsburgh, PA 15260, USA
*
Author to whom correspondence should be addressed.
Educ. Sci. 2024, 14(10), 1092; https://doi.org/10.3390/educsci14101092
Submission received: 5 August 2024 / Revised: 1 October 2024 / Accepted: 4 October 2024 / Published: 8 October 2024
(This article belongs to the Special Issue Gender and STEM Education)

Abstract

:
Test anxiety is beginning to be recognized as a significant factor affecting student performance in science, technology, engineering, and mathematics (STEM) courses, potentially contributing to gender inequity within these fields. Additionally, the management of test anxiety can improve self-efficacy, which is a construct that has been well studied in the physics context. In this study, we investigated the relationship between self-efficacy, test anxiety, and gender differences in performance in a two-semester-long introductory physics course sequence for bioscience students in which women outnumber men. Using validated survey data and grade information from students in a two-semester introductory physics course sequence, we compared the predictive power of self-efficacy and test anxiety on female and male students’ performance on both low- and high-stakes assessments. We found that there were gender differences disadvantaging women in self-efficacy and test anxiety in both Physics 1 and Physics 2, as well as gender differences in high-stakes outcomes in Physics 1. There were no gender differences in low-stakes assessment scores. We also found that self-efficacy and test anxiety predicted high-stakes (but not low-stakes) assessment outcomes in both Physics 1 and Physics 2. Comparison of these findings with prior studies involving physical science and engineering students shows that although women outnumber men in physics courses for bioscience students and the career goals of bioscience students are very different from the earlier researched group, most of the negative trends hold even for this new population. Thus, these findings, in a new context involving bioscience students in physics courses, are very important because they reinforce the systemic nature of women being affected more adversely by anxiety in high-stakes assessments, which is a threat to creating equitable and inclusive learning environments. An important implication is that course instructors should carefully consider how high-stakes and low-stakes assessments are used to determine grades and how to create an overall equitable, inclusive, and low-anxiety learning environment.

1. Introduction and Theoretical Framework

Students’ science, technology, engineering, and mathematics (STEM)-related motivational beliefs have implications for their performance in individual courses as well as their long-term outcomes [1,2,3,4,5]. In particular, students’ motivational beliefs are correlated with their goals and tend to predict student learning outcomes [1,2,3,4,5,6,7]. There also tend to be differences between men’s and women’s motivational beliefs regarding physics, which have been linked to performance differences in physics courses [4,8,9,10,11]. Here, we focus on two motivational factors: test anxiety and self-efficacy. Test anxiety can affect students’ test performance and is more likely to affect women [12]. Self-efficacy in a given domain is a student’s belief in their ability to succeed at an activity or subject or complete a task [13].
Test anxiety can impact students’ cognition, physical body, and behavior [12,14]. When they experience test anxiety, students’ cognitive resources are not entirely devoted to the assessment, but can be taken up by worry and intrusive thoughts of failure [12]. Additionally, test anxiety can affect how students feel during an assessment. For example, they may experience a fast heartbeat or “butterflies in their stomach”. The behavioral aspect of test anxiety manifests in avoidance techniques, such as procrastination or interacting only with surface-level feedback after the exam (e.g., not examining mistakes closely to make a plan for future improvement) [12,15]. Other studies have found that test anxiety negatively affects student performance, especially on high-stakes assessments such as exams that are heavily weighted [16,17,18]. In addition, women are more likely to report test anxiety than men [12,13], so understanding test anxiety and how to minimize its effect on student success is vital in creating positive and equitable learning environments.
Self-efficacy [13,19] has been linked to positive learning outcomes for physics students [4,5,7,11]. Self-efficacy of students in a particular domain can be enhanced through several mechanisms. One way is by overcoming difficulties, such as succeeding in a challenging homework assignment [19]. Self-efficacy can also be formed through social means, for example, through observation of role models succeeding in the domain of interest or through receiving encouragement such that it allows for students to measure their success through personal improvement [19]. The final mechanism is regulation of emotional states, such as management of anxiety [19]. Women may have fewer role models due to under-representation of women in physics [20,21], and they are less likely to receive encouragement that they can succeed in physics from their instructors and peers [22,23]. Because high self-efficacy allows students to develop coping mechanisms that could reduce test anxiety, we hypothesize that students with high self-efficacy are also likely to have low test anxiety [13].
In this research, we aim to investigate whether test anxiety and/or self-efficacy predict low- and high-stakes assessment outcomes for women and men in a novel context: a two-semester physics course sequence for bioscience students in which women outnumber men. Here, low-stakes assessments are those that individually make up a small portion of a student’s grade, such as homework. High-stakes assessments are individual assessments that make up a large portion of a student’s grade such as traditional exams [16,24,25]. For example, while weekly homework assignments that add up to 10% of the total grade (with each of them counting toward less than one percent of the grade) are considered as low-stakes assessments, a heavily weighted final exam that individually accounts for 25% of the total grade is considered as a high-stakes assessment. Prior research in a physics course for physical science and engineering students and students in other STEM courses has shown that gender gaps are more prevalent in high-stakes than low-stakes assessments [16,24,25]. This, combined with gender differences in physics self-efficacy and test anxiety among physical science and engineering students [18], led us to hypothesize that physics test anxiety and self-efficacy may predict high-stakes but not low-stakes physics assessment performance and may more adversely affect women even in physics courses in which they outnumber men. Prior research has found that there is a relationship between test anxiety and high-stakes assessment grades in biology classrooms [16]. Additionally, past research shows a relationship between self-efficacy, test anxiety, and high-stakes physics grades as well as gender differences disadvantaging women for students enrolled in introductory physics courses for engineering and physical science majors in which women are under-represented [18]. Past research shows that in general, bioscience students differ from physical science and engineering majors in both interests and motivations [26]. Therefore, it is very important to investigate how systemic these inequitable trends are across different contexts, e.g., for an entirely different student population within a given STEM discipline such as physics.
It is important to investigate whether the numerical under-representation of women in some physics courses is the main factor contributing to gender differences. For example, in physics, there are certain stereotypes about who can excel in the discipline and that one needs to be a genius to do well, which may disproportionately affect women and lead to greater anxiety than men even in physics courses in which they outnumber men. Thus, here we focus on these issues related to physics test anxiety and self-efficacy in high-stakes and low-stakes assessment for students who have never been the focus of this type of investigation in the past: female and male students in introductory physics for bioscience and health science-related majors where women are in the majority. Comparison of our findings, discussed in detail in the Results Section, with prior studies involving physical science and engineering students shows that although women outnumber men in physics courses for bioscience students and the career goals of bioscience students are very different from the earlier researched group, the trends hold even for this new population [18,27,28]. Thus, the findings discussed here in the context of physics courses, involving bioscience students and other students with interest in health-related majors, are very important because they emphasize the deep-rooted nature of women being affected more adversely by anxiety in high-stakes assessments, which is a major impediment to creating equitable and inclusive learning environments.
Students pursuing bioscience and health science-related majors are generally required to take at least one physics course for their major (and many of them are required to take two physics courses). Women are not under-represented in these physics courses for bioscience and health science-related majors, but there may still be a gender gap in the motivational belief scores of students in the course. In particular, prior research has found that even in physics courses in which women are not under-represented, men tend to have higher grades and physics-specific motivational beliefs than women [10,29,30,31,32,33,34,35]. For example, women tend to have lower physics self-efficacy than men with the same grades in courses for engineering and physical science students as well as courses for students with interest in bioscience and health science-related professions [9,36].
Our goals for this research were to investigate the relationship between test anxiety and assessment outcomes in introductory physics courses, with a focus on gender differences in each construct, in this novel context of students in bioscience and health science-related majors. We hypothesized that test anxiety would predict high-stakes but not low-stakes assessment outcomes among students. We also hypothesized that despite women outnumbering men in these physics courses, women would still be more likely to experience higher levels of test anxiety partly due to the stereotype that physics requires innate genius and because women tend to stay away from disciplines linked to exceptional innate ability [37]. We included self-efficacy in our investigation because the relationship between performance and self-efficacy is well documented in physics [11,38] and because management of anxiety is explicitly mentioned as a mechanism to enhance self-efficacy [19].

Research Questions

With this background and goals in mind, we aim to answer the following research questions in the novel context of a two-semester physics course sequence for bioscience students and students interested in health professions:
RQ1.
Are there gender differences in students’ prior preparation, self-efficacy, or test anxiety?
RQ2.
Are there gender differences in students’ low- or high-stakes assessment scores?
RQ3.
Are there gender differences in terms of low- or high-stakes assessment scores for students with the same self-efficacy and/or test anxiety?

2. Methodology

2.1. Participants and Procedures

This study took place at a large research university in the United States. Participants were students enrolled in a Physics 1 or 2 course for bioscience and other health-related majors. Intended majors for students in our sample, aside from biological sciences, include but are not limited to microbiology, neuroscience, molecular biology, chemistry with bioscience option, etc., where the common theme across these students is their interest in health-related professions. Previous research has shown that for those aiming for health-related careers and enrolled in science courses, there are various pathways that can lead to or away from their initially intended majors [39]. Therefore, the many minor distinctions within these majors make it challenging to analyze them separately with sufficient statistical power. We will use the term “bioscience majors” throughout for all students since they comprise the majority of students in both Physics 1 and Physics 2 and both these courses are mandatory for them. Students can only advance to Physics 2 if they have at least a C grade in Physics 1, equivalent to a 2.0 out of a 4.0 point grade scale at this institution, with 3.0 corresponding to a B grade and 4.0 corresponding to an A grade. Some of the low-enrollment health-related majors only require Physics 1, so students may not take Physics 2 unless they want to take the MCAT exam for medical school admission.
The Physics 1 course primarily covered mechanics, though both thermodynamics and waves were also included. The Physics 2 course covered electricity and magnetism, geometrical optics, and physical optics. The courses included 2 weekly sessions of 75 min traditional lecture-based instruction led by the course instructors, along with smaller-sized 50 min recitation sessions taught by teaching assistants in which students worked collaboratively on solving physics problems. These courses did not include a laboratory component, although many of the students taking them end up taking laboratory courses as an elective later. The Physics 1 and Physics 2 student samples included sections taught by the same instructor, although the students were not necessarily the same across the two courses. This helped us ensure that no instructor-level effects went unaccounted for in our models. For both courses, midterm and final exams comprised 40% and 25% of the final course grade, respectively. We considered the midterm and final exams as high-stakes assessment as they were each highly weighted and altogether made up approximately two-thirds of the final course grade. Another 10% of the course grade was determined by homework problems which were assigned weekly for students to complete at their own time, and the remaining 25% were participation grades based on completeness for students responding to clicker questions in the lectures or their work in groups on problems in the recitation sessions. Generally, we consider homework and participation as low-stakes assessments as each of them individually take up a small portion of the total grade. For the analysis in this paper, however, we only used students’ homework grades as the low-stakes outcome, as those were the only low-stakes assessments that were graded for correctness and not completeness. Throughout this paper, we use the terms “test” and “exam” interchangeably. All the midterm exams and final exams for the Physics 1 and Physics 2 courses were supervised, timed, and in-person, and while we mainly associate the exams being high-stakes with the grading weight, we recognize that these elements can also contribute to the anxiety levels in students. Midterm and final exams were interspersed uniformly throughout the semester, with midterm exams held around the one-, two-, and three-month marks after the start of classes. All exams were mainly multiple choice, with 1–3 open-ended questions.
Students were given extra credit as an incentive for taking the survey. The surveys were given during the first and last week of classes in the mandatory teaching assistant-led recitations. We call the first and final data sets “pre” and “post”, respectively. In Physics 1, 204 students took the pre-test and 210 took the post-test. In Physics 2, 185 students took the pre-test and 89 took the post-test. The post-tests were taken in the final week of the classes, before students took their final exam.
For analysis, we included only students who successfully passed an attention check on the survey (a question that requested the students to select option “C”). Additionally, we included as many students as possible in each part of the analysis. For example, in a model that uses the average of one construct as well as students’ standardized test scores, we would exclude students who were missing SAT and ACT scores or were missing either pre- or post-survey results. One Physics 2 class section was not able to complete the post-survey and were missing post-test anxiety and self-efficacy data. This resulted in a smaller sample size for post-test anxiety and self-efficacy, but students in this section had statistically indistinguishable prior preparation, pre-motivational factors, and assessment outcomes from other students in the sample, so they were included in analysis where possible.
This research was carried out in accordance with the principles outlined in this institution’s Institutional Review Board ethical policy, and de-identified demographic data were provided through university records. For some variables, such as high school GPA, this approach allows us to rely on records that may be more accurate than students’ own recollection. However, it limits other measures such as student gender, for which students could only report either “male” or “female”. We acknowledge the potential limitations and harms that this method of data collection may cause [40]. This institution recently began to implement more inclusive gender reporting methods for students, which are planned to be used once student samples are large enough to be meaningful for quantitative analysis. Demographic data indicated our Physics 1 sample was 67% women and our Physics 2 sample was 56% women. Students in Physics 1 identified with the following races/ethnicities: 66% White, 17% Asian, 7% African American/Black, 6% multiracial, 3% Hispanic/Latinx, and 1% unspecified. Students in Physics 2 identified with the following races/ethnicities: 63% White, 21% Asian, 6% African American/Black, 6% multiracial, 3% Hispanic/Latinx, and 1% unspecified.

2.2. Measures

2.2.1. Self-Efficacy and Test Anxiety

All test anxiety and self-efficacy survey items can be found in Table 1. The test anxiety survey questions were adapted from the previously validated Motivated Strategies for Learning Questionnaire [41,42]. To ensure we were measuring domain-specific constructs, we explicitly mentioned physics in the survey items, as seen in Table 1. For example, “I have an uneasy, upset feeling when I take an exam” became “I have an uneasy, upset feeling when I take a physics test”. Self-efficacy survey questions were constructed from other surveys and were previously validated [43]. Test anxiety items were on a five-point Likert scale (1—Not at all true, 2—A little true, 3—Somewhat true, 4—Mostly true, 5—Completely true), and self-efficacy items were on a four-point Likert scale (1—NO!, 2—no, 3—yes, 4—YES!). All responses were placed on a 0-1 scale to account for multiple Likert scales. Higher scores on the test anxiety items indicate higher test anxiety levels; therefore, an ideal course outcome would be high self-efficacy and low test anxiety scores on the surveys. We note that since students take the pre-surveys in the first week of their classes, their responses are based on anticipation for the course and reflect any prior physics course experience they have had. We further validated the survey through twenty one-hour student interviews to ensure that students interpreted questions as intended. These validations were incorporated early in the design of the survey before we started using the survey at this institution and included students all the way from introductory to upper-level physics courses.
Additionally, we performed confirmatory factor analysis using the student sample in this study as a check for continued validity. For both the pre- and post-surveys, the Comparative Fit Index (CFI) and Tucker Lewis Index (TLI) were ≥0.90 [44], and the Root Mean Square Error of Approximation (RMSEA) and Standardized Root Mean Square Residual (SRMR) were both ≤0.08 [45], which can be seen in Table A1 in the Appendix. Cronbach’s α was also consistently above 0.7, indicating good internal consistency across our measures [46]. Standardized factor loadings were all above 0.5 [44]. The square of the standardized factor loadings gives the percentage of variance of each observed variable that is explained by the latent variable, meaning that at least 25% of the variance in each of the survey items is explained by the respective construct.

2.2.2. Prior Academic Preparation

High school Grade Point Average (HS GPA) was reported using the weighted 0–5 scale, which is based on the standard 0 (failing)–4 (A) scale with adjustments for honors, Advanced Placement, and International Baccalaureate courses (these programs may offer a bonus as a reward for taking advanced courses, which can allow a GPA higher than 4.0). High school GPA is taken as a measure of general academic skills and generally is a strong predictor of early undergraduate course performance [47].
Students’ Scholastic Achievement Test Math (SAT Math) scores are on a scale of 200–800 and are used as a predictor of performance on high-stakes assessments involving mathematical problem solving (e.g., physics exams) [47,48,49]. If a student took the American College Testing (ACT) examination, we converted ACT to SAT scores [50]. If a student took a test more than once, the school provided the highest section-level score for the SAT and the highest composite score for the ACT. If a student took both ACT and SAT tests, we used their SAT score.

2.2.3. Assessment Scores

Homework and exam grades were provided by the instructor and were de-identified by an honest broker before being included in analysis. If grades were not on a 0–100 scale, they were rescaled. For example, if homework was graded on a 10-point scale, all scores were multiplied by 10 for analysis.

2.3. Analysis

First, we report means and standard deviations of each variable separately for men and women. Next, to determine if there were sex differences in the means of self-efficacy, test anxiety, prior preparation, or assessment scores, we performed unpaired t-tests to measure the statistical significance of the differences [46] and Cohen’s d to measure the size of the difference [51]. Cohen’s d is calculated using:
d = μ 1 μ 2 ( σ 1 2 + σ 2 2 ) / 2 ,
where μ 1 and μ 2 are the mean values of each group and σ 1 and σ 2 are the standard deviations of each group [51]. Group one was women and group two was men. Cohen’s d is considered small if d 0.2 , medium if d 0.5 , and large if d 0.8 [51]. We performed this analysis separately for Physics 1 and Physics 2 courses.
To explore the predictive relationships between test anxiety and assessment outcomes, we used multiple regression analysis. For each regression model, we report the standardized β coefficients, sample size, and adjusted R-squared. Standardized coefficients were used because they are in units of standard deviation and allow for direct comparison of effects [52]. We initially used gender, SAT Math scores, and HS GPA as predictors for low- and high-stakes assessment scores. Here, low-stakes assessment scores are the students’ average homework grades. High-stakes assessment scores are weighted so that 75% of the category is midterm exam grades and 25% is the final exam grade. This weighting was performed because the instructor gave three midterm exams and one final exam.
After establishing baseline models, we introduced pre- or average test anxiety and self-efficacy as predictors. Average test anxiety/self-efficacy is the mean of pre- and post-scores and was used as a proxy for students’ test anxiety/self-efficacy while they were taking the course. For both courses, we introduced two models using pre-self-efficacy and pre-test anxiety scores and four models using the average scores. The first two models predicted high-stakes assessment scores using both or neither of the constructs, while the third and fourth models used either self-efficacy or test anxiety in addition to the baseline predictors. During regression analysis, we used combined assessment categories (e.g., low- and high-stakes assessments), but results were similar when the categories were separated. For example, the regression models predicting high-stakes assessment scores were similar to both the models predicting midterm exam grades and those predicting final exam grades.

3. Results and Discussion

3.1. RQ1. Are There Gender Differences in Students’ Prior Preparation, Self-Efficacy, or Test Anxiety?

For both Physics 1 and 2, men on average had higher SAT Math scores than women, although the difference was not statistically significant for Physics 1. Also, women generally had higher high school GPAs than men. All these differences are small to medium ( d 0.2 to d 0.5 ). Students were generally well prepared for introductory physics: their average SAT Math scores were well above the 2019 national average of 528 [53], and average high school GPA was around or above 4.0 on a 5.0 point scale. High school GPA and SAT Math scores are both shown to be predictors of undergraduate STEM performance [47]. However, we cannot assume that men’s higher SAT scores directly translate to physics performance, as students in this sample show the reverse pattern in calculus 1, with women having a statistically significantly higher grade than men in this course [38].
In general, men reported higher self-efficacy and lower test anxiety than women in both Physics 1 and Physics 2, for both the pre- and post-surveys. However, the magnitude of differences differed by construct and course. In Physics 1, gender differences were large ( d 0.8 ) for test anxiety and self-efficacy on both pre- and post-surveys. From pre to post, the gender differences remained the same for test anxiety, while they decreased slightly for self-efficacy. In Physics 2, gender differences were larger for test anxiety than for self-efficacy on both pre- and post-surveys, with overall higher self-efficacy and lower test anxiety for men. Moreover, while test anxiety gender gaps decreased from pre to post, there was an increased gender gap for self-efficacy. While men’s average self-efficacy score did not change throughout the Physics 2 course, women’s self-efficacy score decreased, which contributed to the larger gender gap.
In terms of broader implications, our results in this novel context of bioscience students in physics courses (in which women outnumber men) are similar to findings for students taking calculus-based introductory physics: there are gender differences in both self-efficacy and test anxiety favoring men [18], and the constructs change over time, showing that they are malleable and potentially able to be influenced [28].
We also note differences in motivational factors from Physics 1 to Physics 2. Overall, students’ self-efficacy decreased from Physics 1 pre to post, which can be seen in Figure 1. For men, the average self-efficacy score remained fairly stable from the end of Physics 1 to the end of Physics 2. Overall, there was an increase in self-efficacy and decrease in test anxiety scores for students in our sample going from Physics 1 post to Physics 2 pre. We hypothesize that one of the reasons for this shift may be attributed to students receiving final grades which exceeded their initial expectations based on the average final exams in Table 2. At the start of Physics 2, women reported self-efficacy that was similar to what they reported at the start of Physics 1. However, from pre to post, their self-efficacy in Physics 2 decreased again, which can also be seen in Figure 1. On the other hand, Figure 2 shows that both men and women exhibit a similar trend of increasing test anxiety over the duration of taking Physics 1 and Physics 2 courses, with women having higher levels of anxiety overall. A student affected by test anxiety is likely to experience limits on the cognitive resources they can devote to the assessment [12], so we hypothesize that increased test anxiety may prevent students from accurately representing their knowledge on high-stakes assessments. In traditional exam-reliant courses, this is particularly concerning.

3.2. RQ2. Are There Gender Differences in Students’ Low- and High-Stakes Assessment Scores?

Homework constitutes the “low-stakes” assessment category. Table 2 and Table 3 show that female students had higher average homework scores than male students in Physics 1 (Table 2) and Physics 2 (Table 3). However, both effect sizes were small ( d 0.2 ), and gender differences were not statistically significant. Midterm and final exams constitute “high-stakes” assessments. In Physics 1, on average, men had higher exam scores. The gender differences were statistically significant with a medium effect size for the midterm exams and marginally significant for the final exam. On the other hand, there were no statistically significant gender differences in exam scores for Physics 2.
The courses had grades based primarily on midterm and final exam scores. This raises concerns for the Physics 1 course gender differences in exam scores. Research suggests that women are more likely than men to leave STEM fields due to concerns about grades (even if they have a reasonably high average) [54,55], so women’s lower high-stakes exam scores may contribute to their decision to leave these majors that require introductory physics. This, combined with data that show that gender gaps exist in very few other courses for bioscience students at this institution [56], suggests that some of these women may leave bioscience-related STEM fields due to their worse average performance in physics than men.
Physics 2 shows no such gender disparity in exam scores, though the course is graded in a similar way to Physics 1. Although this study is quantitative in nature and does not focus on the mechanisms for this difference, we hypothesize some potential reasons for this difference between Physics 1 and Physics 2. One difference may be that although the percentages of female and male students taking any type of high school physics course are roughly the same and the majority of these courses are heavily focused on Physics 1 content (e.g., mechanics), men are more likely to take AP or honors versions of these high school courses. These AP or honors versions of the high school courses may give men an advantage over women in the college Physics 1 course discussed here and may lead to the gender difference observed in our investigation if the instructor does not provide opportunities to bridge the gender gap due to disparities in high school courses taken [57,58]. On the other hand, a majority of these students do not take high school physics courses with materials commensurate with college Physics 2 (e.g., electricity and magnetism) discussed here. Thus, men are unlikely to have an advantage coming in compared to women. Other potential reasons are that only those who pass Physics 1 with at least a C grade can enroll in Physics 2 and that while Physics 2 is mandatory for all bioscience majors, it is not mandatory for some health-related majors (although these students from other health-related majors comprise a smaller portion of the students in Physics 1 compared to bioscience majors).

3.3. RQ3. Are There Gender Differences in Terms of Low- or High-Stakes Assessment Scores for Students with the Same Self-Efficacy and/or Test Anxiety?

The results of models predicting low-stakes assessments for Physics 1 and 2 can be found in Table A2 in the Appendix. For both Physics 1 and Physics 2, low-stakes assessment scores are not predicted by pre-self-efficacy, average self-efficacy, pre-test anxiety, or average test anxiety. Also, the models that include self-efficacy and test anxiety do not explain any more variance than the models that exclude them. The only significant predictor of low-stakes assessments is high school GPA, and gender is not a significant predictor of the outcome. This confirms our hypothesis that test anxiety will not predict low-stakes assessment scores.
Table 4 and Table 5 show the results of our regression models predicting high-stakes assessment outcomes for Physics 1 and Physics 2, respectively. Each of the models uses the variables in the far left column to predict the outcome variable (e.g., Physics 1 and Physics 2 high-stakes assessment scores). Any blank spaces in the table indicate that the predictor in the corresponding row was not used in that model. We have used test anxiety and self-efficacy scores from either the pre-survey or the average score using both pre- and post-survey as predictors. The adjusted R-squared for each model has been reported as a measure of how well the model explains the variance in the dependent variable while taking the number of predictors into account. The strength of each predictor, controlling for all other predictors in the model, is given by the standardized regression coefficient [52]. More specifically, for each change of one standard deviation in the predictor variable, the model predicts there will be a change of beta standard deviations in the outcome variable, controlling for all other predictor variables [52]. The tables aim to show the predictive relationship between student characteristics (sex, high school GPA, SAT/ACT math scores, self-efficacy, and test anxiety) and their Physics 1 high-stakes exam scores. The coefficients, as described earlier, represent how strongly each characteristic is related to the Physics 1 exam scores, where a positive number indicates a positive relationship and a negative number indicates a negative relationship. For example, we can see that in Average Model 1, a one-standard-deviation increase in test anxiety decreases the Physics 1 high-stakes assessment score by 0.26 standard deviations, when keeping all other variables constant.
For Physics 1, both high school GPA and SAT/ACT Math scores consistently have strong positive correlations with Physics 1 exam scores across all models. This means that students with higher high school GPAs and higher standardized math scores tend to do better on the high-stakes exams in this course. However, high-stakes assessment scores are not predicted by pre-test anxiety or pre-self-efficacy, which can be seen in Table 4. Also, looking at pre-survey scores, Model 1 in Table 4 (which includes pre-self-efficacy and test anxiety as predictors) does not explain much more of the variance than Model 2 (which does not include pre-self-efficacy and test anxiety as predictors). Thus, prior self-efficacy or test-anxiety scores do not appear to greatly contribute to performance differences among students. This is good because instructors can intervene by creating an equitable and inclusive learning environment to improve students’ self-efficacy and reduce their test anxiety.
However, in Physics 1, high-stakes assessment scores were predicted by average test anxiety but not self-efficacy, which can be seen in Table 4. Average Model 1 in Table 4 shows that average test anxiety predicts high-stakes assessment outcomes. Importantly, Average Model 1 and Average Model 3 explain more of the variance in the dependent variable (high-stakes assessment scores) compared to the other models. Additionally, there are no statistically significant gender differences in any of the average models that include test anxiety or self-efficacy as a predictor of high-stakes grades, but there are gender differences in Average Model 2. This means that gender differences in test anxiety and self-efficacy may account for some of the gender differences disadvantaging women we see in high-stakes assessments.
For Physics 2, high-stakes assessment scores were similarly not predicted by pre-self-efficacy or pre-test anxiety, and there were no gender differences for students with the same prior preparation scores (high school GPA and SAT/ACT math scores). We can see again that the students’ prior self-efficacy and test anxiety scores do not seem to contribute to their exam performance, which gives instructors the chance to cultivate a more inclusive and fair learning environment. In the models using the average score, controlling both for average self-efficacy and test anxiety, neither of them are statistically significant predictors of the outcome, but test anxiety and self-efficacy each become statistically significant predictors of the high-stakes assessment score in Model 3 and Model 4. This implies that for students in Physics 2, self-efficacy and test anxiety are inversely correlated, meaning that students with high test anxiety tend to have lower self-efficacy and vice versa, which can potentially explain how including both variables obscures their relationship with the outcome. Additionally, gender does not predict high-stakes assessment outcomes in Physics 2. The models can be seen in Table 5.
Broadly, we find that self-efficacy positively predicts high-stakes assessment scores, while test anxiety negatively predicts scores. We also find (see Section 3.1 RQ1) that both self-efficacy and test anxiety measures became worse over time for all students and that women reported lower self-efficacy and higher test anxiety than men even in this novel context of physics courses in which they outnumber men. Thus, it is important for instructors to take steps to reduce student test anxiety and increase student self-efficacy. This is important in encouraging the success of all students, but particularly women who appear to be more affected by low self-efficacy and high test anxiety, especially in Physics 1.

4. Conclusions, Limitations, and Future Directions

We have added to the research in this field by exploring gender differences in and the predictive power of self-efficacy and test anxiety for low- and high-stakes assessments for physics courses in which women are in the majority. While past research has identified gender differences in these factors and their relationship with low- and high-stakes assessment outcomes in courses where women are in the minority [18], our results here demonstrate that the inequitable patterns hold even when women outnumber men. Using validated survey data and grade information from students in a two-semester introductory Physics course sequence for bioscience majors and other majors with students interested in health-related professions, we compared the predictive power of self-efficacy and test anxiety on female and male students’ performance on both low- and high-stakes assessments. We found that there are gender differences disadvantaging women in self-efficacy, test anxiety, and high-stakes (but not low-stakes) assessment outcomes in Physics 1. We also found that self-efficacy and/or test anxiety predicted only high-stakes assessment outcomes in both Physics 1 and Physics 2. Comparison of these findings in a novel context with prior studies involving physical science and engineering students shows that although women outnumber men in physics courses for bioscience students and the career goals of bioscience students are very different from those of physical science and engineering majors, most of the adverse trends are similar even for this new population. Therefore, these findings highlight the systemic nature of women being more adversely affected by anxiety in high-stakes assessments and the need for creating more equitable learning environments.
Instructors can help decrease test anxiety and its adverse impact on student performance directly by lowering the emphasis on high-stakes assessments and increasing the emphasis on low-stakes assessments in their courses. The careful choice of assessment tools can help create a more equitable classroom environment by minimizing fears that come with test anxiety related to high-stakes assessments (e.g., receiving a low course grade due to one bad exam score which counts for a significant portion of the course grade). Moreover, frequent low-stakes assessments can also give students many attempts to practice test taking and encourage spaced practice, which is more effective for the retention of knowledge and skill development than “cramming” before an exam [59]. We recognize that instructors may want to include opportunities to assess students’ cumulative learning in a course. In particular, since physics is a hierarchical discipline, it may be useful to give students incentives and support to organize their knowledge hierarchically so that they focus on discerning the connections between the concepts in different chapters. However, these types of cumulative assessments can be made lower-stakes by offering more of them in the course and making each count towards less of a student’s grade [60]. Additionally, implementing a range of formative assessments (such as clicker questions, homework, tutorials, projects, and other types of assessments), each of which do not count for a very large portion of a students’ course grade, can help students develop a wider variety of skills without increasing their anxiety. In particular, providing students with these types of supports via different types of frequent formative assessments can help students develop the desired knowledge structure and skills including the ability to communicate science better based upon the goals of the course in a low-anxiety and equitable learning environment.
In addition to recommending the instructors to incorporate more frequent low-stakes assessments to reduce student anxiety and boost self-efficacy, rather than a few heavily weighted high-stakes exams, we also suggest the implementation of research-based interventions and activities designed to promote equity in college courses. An important implication is that the course instructors should carefully contemplate how to foster equitable, inclusive, and low-anxiety learning environments and examine the role of high-stakes and low-stakes assessments in grading policies. The course instructors have a central role to play in creating such an environment in which student anxiety is reduced during learning and assessment and self-efficacy is improved [61]. For example, prior research suggests that when instructors emphasize at the beginning of their courses through short activities that most students struggle in learning to solve challenging physics problems, that struggle is the stepping stone to learning physics, and that students should embrace them, while talking about their own struggles when they were in similar courses, the gender gap in performance is eliminated [62]. When instructors implement these types of activities, they are also encouraged to reflect on their own mindset about whether all students can excel in their courses, which is important because instructors with growth mindset about their students’ potential have significantly smaller performance differences between traditionally marginalized and dominant demographic groups in STEM courses, and students from traditionally marginalized groups report greater motivation in those courses [63].
One limitation of this investigation with regard to RQ3 is that our findings are correlational in nature and a correlation does not imply causation. Another limitation related to generalizability is that the research was carried out at a large research university in the US and results may not apply to other types of US institutions, e.g., two-year or four-year liberal arts colleges or higher education institutions in other countries. Moreover, the institution where this study was carried out is a predominantly White institution, and these findings may not apply to minority-serving institutions. Another limitation relates to the fact that the investigation only focused on gender differences and did not focus on other facets of students’ identities, e.g., based upon their race/ethnicity, first-generation college status, low-income status, etc., and the intersectionality of different identities and how they impact test anxiety and self-efficacy.
Future studies could focus on investigating similar issues at other types of institutions as well as for different student demographic groups and intersecting identities. It would also be valuable to conduct individual interviews or focus group discussions with students from various demographic groups because qualitative investigations can complement quantitative study discussed here and shed light on the mechanisms, e.g., for the observed gender differences.

Author Contributions

A.M. and F.S. contributed to analysis and interpretation of data, as well as writing and revision of the manuscript. C.S. contributed to the conception and design of research, acquisition and interpretation of data, and revision of the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research is supported by National Science Foundation grant DUE-1524575.

Institutional Review Board Statement

This research was carried out in accordance with the principles outlined in the university’s Institutional Review Board ethical policy with approval code PRO15030412.

Informed Consent Statement

Consent requirement was waived due to the research being approved as exempt from informed consent by the university’s Institutional Review Board.

Data Availability Statement

The data sets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Conflicts of Interest

No potential conflicts of interest were reported by the authors.

Appendix A

Table A1. Survey fit indices for confirmatory factor analysis for both Physics 1 and Physics 2.
Table A1. Survey fit indices for confirmatory factor analysis for both Physics 1 and Physics 2.
Fit IndicesCronbach’s α
Course CFI TLI RMSEA SRMR Pre-SE Post-SE Pre-TA Post-TA
Physics 10.950.940.070.050.710.810.890.92
Physics 20.930.910.080.060.840.870.930.94
Note: Cronbach’s α for pre- and post-test anxiety (TA) and self-efficacy (SE) is included. For Physics 1, N = 230 , and for Physics 2, N = 203 .
Table A2. Physics 1 and 2 low-stakes assessment scores predicted by student sex, high school GPA (HS GPA), SAT/ACT Math scores, pre- or average self-efficacy, and pre- or average test anxiety.
Table A2. Physics 1 and 2 low-stakes assessment scores predicted by student sex, high school GPA (HS GPA), SAT/ACT Math scores, pre- or average self-efficacy, and pre- or average test anxiety.
Physics 1Physics 2
Pre Avg. Pre Avg.
Variable Model 1 Model 2 Model 3 Model 1 Model 1 Model 2 Model 3 Model 1
Sex (M = 0, F = 1)−0.01 ns−0.02 ns−0.01 ns−0.04 ns0.04 ns0.04 ns0.08 ns−0.04 ns
HS GPA0.32 ***0.32 ***0.31 ***0.31 ***0.27 ***0.27 ***0.27 ***0.32 **
SAT/ACT Math0.05 ns0.05 ns0.05 ns0.05 ns−0.06 ns−0.06 ns−0.09 ns−0.02 ns
Self-Efficacy0.05 ns −0.05 ns−0.02 ns 0.09 ns
Test Anxiety0.07 ns0.04 ns 0.07 ns0.09 ns0.10 ns 0.04 ns
N19019019017417917917978
Adjusted R 2 0.080.090.090.070.060.060.060.04
Note: Standardized regression ( β ) coefficients are provided. * = p < 0.05 , ** = p < 0.01 , *** = p < 0.001 , and ns = not statistically significant.

References

  1. Bouffard-Bouchard, T.; Parent, S.; Larivee, S. Influence of self-efficacy on self-regulation and performance among junior and senior high-school age students. Int. J. Behav. Dev. 1991, 14, 153–164. [Google Scholar] [CrossRef]
  2. Pintrich, P.R.; De Groot, E.V. Motivational and Self-Regulated Learning Components of Classroom Academic Performance. J. Educ. Psychol. 1990, 82, 33–40. [Google Scholar] [CrossRef]
  3. Zimmerman, B.J. Self-efficacy: An essential motive to learn. Contemp. Educ. Psychol. 2000, 25, 82–91. [Google Scholar] [CrossRef] [PubMed]
  4. Nissen, J.M.; Shemwell, J.T. Gender, experience, and self-efficacy in introductory physics. Phys. Rev. Phys. Educ. Res. 2016, 12, 020105. [Google Scholar] [CrossRef]
  5. Sawtelle, V.; Brewe, E.; Kramer, L.H. Exploring the relationship between self-efficacy and retention in introductory physics. J. Res. Sci. Teach. 2012, 49, 1096–1121. [Google Scholar] [CrossRef]
  6. Steele, C.M. A threat in the air: How stereotypes shape intellectual identity and performance. Am. Psychol. 1997, 52, 613. [Google Scholar] [CrossRef]
  7. Maries, A.; Singh, C. Towards meaningful diversity, equity and inclusion in physics learning environments. Nat. Phys. 2024, 20, 367–375. [Google Scholar] [CrossRef]
  8. Salehi, S.; Cotner, S.; Azarin, S.; Carlson, E.; Driessen, M.; Ferry, V.; Harcombe, W.; McGaugh, S.; Wassenberg, D.; Yonas, A.; et al. Gender Performance Gaps Across Different Assessment Methods and the Underlying Mechanisms: The Case of Incoming Preparation and Test Anxiety. Front. Educ. 2019, 4, 107. [Google Scholar] [CrossRef]
  9. Marshman, E.M.; Kalender, Z.Y.; Nokes-Malach, T.; Schunn, C.; Singh, C. Female students with A’s have similar physics self-efficacy as male students with C’s in introductory courses: A cause for alarm? Phys. Rev. Phys. Educ. Res. 2018, 14, 020123. [Google Scholar] [CrossRef]
  10. Cwik, S.; Singh, C. Gender differences in students’ self-efficacy in introductory physics courses in which women outnumber men predict their grade. Phys. Rev. Phys. Educ. Res. 2022, 18, 020142. [Google Scholar] [CrossRef]
  11. Cavallo, A.M.; Potter, W.H.; Rozman, M. Gender differences in learning constructs, shifts in learning constructs, and their relationship to course achievement in a structured inquiry, yearlong college physics course for life science majors. Sch. Sci. Math. 2004, 104, 288–300. [Google Scholar] [CrossRef]
  12. Zeidner, M. Test Anxiety: The State of the Art; Springer: New York, NY, USA, 1998. [Google Scholar]
  13. Bandura, A.; Freeman, W.; Lightsey, R. Self-efficacy: The exercise of control. J. Cogn. Psychother. 1999, 13, 158–166. [Google Scholar] [CrossRef]
  14. Maloney, E.A.; Sattizahn, J.R.; Beilock, S.L. Anxiety and cognition. Wiley Interdiscip. Rev. Cogn. Sci. 2014, 5, 403–411. [Google Scholar] [CrossRef]
  15. Solomon, D.; Battistich, V.; Kim, D.I.; Watson, M. Teacher practices associated with students’ sense of the classroom as a community. Soc. Psychol. Educ. 1996, 1, 235–267. [Google Scholar] [CrossRef]
  16. Ballen, C.J.; Salehi, S.; Cotner, S. Exams disadvantage women in introductory biology. PLoS ONE 2017, 12, e0186419. [Google Scholar] [CrossRef] [PubMed]
  17. Stang, J.B.; Altiere, E.; Ives, J.; Dubois, P.J. Exploring the contributions of self-efficacy and test anxiety to gender differences in assessments. In Proceedings of the Physics Education Research Conference 2020, PER Conference, Virtual Conference, 22–23 July 2020; pp. 497–502. [Google Scholar]
  18. Malespina, A.; Singh, C. Gender differences in test anxiety and self-efficacy: Why instructors should emphasize low-stakes formative assessments in physics courses. Eur. J. Phys. 2022, 43, 035701. [Google Scholar] [CrossRef]
  19. Bandura, A. On the Functional Properties of Perceived Self-Efficacy Revisited. J. Manag. 2012, 38, 9–44. [Google Scholar] [CrossRef]
  20. Porter, A.M.; Ivie, R. Women in Physics and Astronomy; Statistical Research Center of the American Institute of Physics: College Park, MD, USA, 2019. [Google Scholar]
  21. Hazari, Z.; Brewe, E.; Goertzen, R.M.; Hodapp, T. The importance of high school physics teachers for female students’ physics identity and persistence. Phys. Teach. 2017, 55, 96–99. [Google Scholar] [CrossRef]
  22. Li, Y.; Singh, C. Impact of perceived recognition by physics instructors on women’s self-efficacy and interest. Phys. Rev. Phys. Educ. Res. 2023, 19, 020125. [Google Scholar] [CrossRef]
  23. Sundstrom, M.; Heim, A.B.; Park, B.; Holmes, N. Introductory physics students’ recognition of strong peers: Gender and racial or ethnic bias differ by course level and context. Phys. Rev. Phys. Educ. Res. 2022, 18, 020148. [Google Scholar] [CrossRef]
  24. Cotner, S.; Leno, L.; Walker, J.; Jørgensen, C.; Vandvik, V. Gender gaps in the performance of Norwegian biology students: The roles of test anxiety and science confidence. Int. J. STEM Educ. 2020, 7, 55. [Google Scholar] [CrossRef]
  25. Pelch, M. Gendered differences in academic emotions and their implications for student success in STEM. Int. J. STEM Educ. 2018, 5, 33. [Google Scholar] [CrossRef] [PubMed]
  26. Crouch, C.H.; Geller, B. Introductory Physics for Life Sciences: Preparing and Engaging Students through Authentic Interdisciplinary Connections. In The International Handbook of Physics Education Research: Learning Physics; Taşar, M.F., Heron, P.R.L., Eds.; AIP Publishing: Melville, NY, USA, 2023. [Google Scholar] [CrossRef]
  27. Malespina, A. Investigating Gender Differences in Test Anxiety, Self-efficacy, Mindset, Grade Penalty and Grades in Physics Courses: A Quest for Equity. Ph.D. Thesis, University of Pittsburgh, Pittsburgh, PA, USA, 2023. [Google Scholar]
  28. Malespina, A.; Singh, C. Introductory physics during COVID-19 remote instruction: Gender gaps in exams are eliminated, but test anxiety and self-efficacy still predict success. Eur. J. Phys. 2024, 45. [Google Scholar] [CrossRef]
  29. Little, A.J.; Humphrey, B.; Green, A.; Nair, A.; Sawtelle, V. Exploring mindset’s applicability to students’ experiences with challenge in transformed college physics courses. Phys. Rev. Phys. Educ. Res. 2019, 15, 010127. [Google Scholar] [CrossRef]
  30. Espinosa, T.; Miller, K.; Araujo, I.; Mazur, E. Reducing the gender gap in students’ physics self-efficacy in a team-and project-based introductory physics class. Phys. Rev. Phys. Educ. Res. 2019, 15, 010132. [Google Scholar] [CrossRef]
  31. Wang, J.; Hazari, Z. Promoting high school students’ physics identity through explicit and implicit recognition. Phys. Rev. Phys. Educ. Res. 2018, 14, 020111. [Google Scholar] [CrossRef]
  32. Hazari, Z.; Cass, C. Towards meaningful physics recognition: What does this recognition actually look like? Phys. Teach. 2018, 56, 442–446. [Google Scholar] [CrossRef]
  33. Brewe, E.; Sawtelle, V.; Kramer, L.H.; O’Brien, G.E.; Rodriguez, I.; Pamelá, P. Toward equity through participation in modeling instruction in introductory university physics. Phys. Rev. Spec. Top.—Phys. Educ. Res. 2010, 6, 010106. [Google Scholar] [CrossRef]
  34. Traxler, A.; Brewe, E. Equity investigation of attitudinal shifts in introductory physics. Phys. Rev. Spec. Top.—Phys. Educ. Res. 2015, 11, 020132. [Google Scholar] [CrossRef]
  35. Van Dusen, B.; Nissen, J. Equity in college physics student learning: A critical quantitative intersectionality investigation. J. Res. Sci. Teach. 2020, 57, 33–57. [Google Scholar] [CrossRef]
  36. Cwik, S.; Singh, C. Damage caused by societal stereotypes: Women have lower physics self-efficacy controlling for grade even in courses in which they outnumber men. Phys. Rev. Phys. Educ. Res. 2021, 17, 020138. [Google Scholar] [CrossRef]
  37. Leslie, S.J.; Cimpian, A.; Meyer, M.; Freeland, E. Expectations of brilliance underlie gender distributions across academic disciplines. Science 2015, 347, 262–265. [Google Scholar] [CrossRef]
  38. Whitcomb, K.M.; Kalender, Z.Y.; Nokes-Malach, T.; Schunn, C.; Singh, C. Comparison of self-efficacy and performance of engineering undergraduate women and men. Int. J. Eng. Educ. 2020, 36, 1996–2014. [Google Scholar]
  39. Witherspoon, E.B.; Schunn, C.D. Locating and understanding the largest gender differences in pathways to science degrees. Sci. Educ. 2020, 104, 144–163. [Google Scholar] [CrossRef]
  40. Traxler, A.L.; Cid, X.C.; Blue, J.; Barthelemy, R. Enriching gender in physics education research: A binary past and a complex future. Phys. Rev. Phys. Educ. Res. 2016, 12, 020114. [Google Scholar] [CrossRef]
  41. Pintrich, P.R. A Manual for the Use of the Motivated Strategies for Learning Questionnaire; University of Michigan: Ann Arbor, MI, USA, 1991. [Google Scholar]
  42. Pintrich, P.R.; Smith, D.A.F.; Garcia, T.; McKeachie, W.J. Reliability and Predictive Validity of the Motivated Strategies for Learning Questionnaire (MSLQ). Educ. Psychol. Meas. 1993, 53, 801–813. [Google Scholar] [CrossRef]
  43. Kalender, Z.Y.; Marshman, E.; Schunn, C.D.; Nokes-Malach, T.; Singh, C. Gendered patterns in the construction of physics identity from motivational factors. Phys. Rev. Phys. Educ. Res. 2019, 15, 020119. [Google Scholar] [CrossRef]
  44. Hu, L.; Bentler, P.M. Cutoff criteria for fit indexes in covariance structure analysis: Conventional criteria versus new alternatives. Struct. Equ. Model. A Multidiscip. J. 1999, 6, 1–55. [Google Scholar] [CrossRef]
  45. Browne, M.W.; Cudeck, R. Alternative ways of assessing model fit. Sage Focus Ed. 1993, 154, 136. [Google Scholar] [CrossRef]
  46. Frey, B.B. The SAGE Encyclopedia of Educational Research, Measurement, and Evaluation; SAGE Publications, Inc.: Thousand Oaks, CA, USA, 2018. [Google Scholar] [CrossRef]
  47. Galla, B.M.; Shulman, E.P.; Plummer, B.D.; Gardner, M.; Hutt, S.J.; Goyer, J.P.; D’Mello, S.K.; Finn, A.S.; Duckworth, A.L. Why High School Grades Are Better Predictors of On-Time College Graduation Than Are Admissions Test Scores: The Roles of Self-Regulation and Cognitive Ability. Am. Educ. Res. J. 2019, 56, 2077–2115. [Google Scholar] [CrossRef]
  48. Hazari, Z.; Tai, R.H.; Sadler, P.M. Gender differences in introductory university physics performance: The influence of high school physics preparation and affective factors. Sci. Educ. 2007, 91, 847–876. [Google Scholar] [CrossRef]
  49. Vincent-Ruz, P.; Binning, K.; Schunn, C.; Grabowski, J. The effect of math SAT on women’s chemistry competency beliefs. Chem. Educ. Res. Pract. 2018, 19, 342–351. [Google Scholar] [CrossRef]
  50. The College Board. ACT. Guide to the 2018 ACT/SAT Concordance; The College Board: New York, NY, USA, 2018. [Google Scholar]
  51. Cohen, J. Statistical Power Analysis for the Behavioral Sciences; L. Erlbaum Associates: Hillsdale, NJ, USA, 1988. [Google Scholar]
  52. Cohen, J.; Cohen, P.; West, S.; Aiken, L.S. Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences; Lawrence Erlbaum: Mahwah, NJ, USA, 2003. [Google Scholar]
  53. College Entrance Examination Board. SAT Suite of Assessments Annual Report; College Entrance Examination Board: New York, NY, USA, 2020. [Google Scholar]
  54. Seymour, E.; Hunter, A.B.; Thiry, H.; Weston, T.J.; Harper, R.P.; Holland, D.G.; Koch, A.K.; Drake, B.M. Talking About Leaving Revisited: Persistence, Relocation, and Loss in Undergraduate STEM Education; Springer International Publishing AG: Cham, Switzerland, 2019. [Google Scholar]
  55. Rodriguez, I.; Potvin, G.; Kramer, L.H. How gender and reformed introductory physics impacts student success in advanced physics courses and continuation in the physics major. Phys. Rev. Phys. Educ. Res. 2016, 12, 020118. [Google Scholar] [CrossRef]
  56. Malespina, A.; Singh, C. Gender gaps in grades versus grade penalties: Why grade anomalies may be more detrimental for women aspiring for careers in biological sciences. Int. J. STEM Educ. 2023, 10, 13. [Google Scholar] [CrossRef]
  57. Lindstrøm, C.; Sharma, M.D. Self-efficacy of first year university physics students: Do gender and prior formal instruction in physics matter? Int. J. Innov. Sci. Math. Educ. 2011, 19, 1–19. [Google Scholar]
  58. Marchand, G.C.; Taasoobshirazi, G. Stereotype threat and women’s performance in physics. Int. J. Sci. Educ. 2013, 35, 3050–3061. [Google Scholar] [CrossRef]
  59. Anderson, J.R. Learning and Memory: An Integrated Approach; Wiley: New York, NY, USA, 2000. [Google Scholar]
  60. Laverty, J.T.; Bauer, W.; Kortemeyer, G.; Westfall, G. Want to Reduce Guessing and Cheating While Making Students Happier? Give More Exams! Phys. Teach. 2012, 50, 540–543. [Google Scholar] [CrossRef]
  61. Sathy, V.; Hogan, K.A. Inclusive Teaching: Strategies for Promoting Equity in the College Classroom; West Virginia University Press: Morgantown, WV, USA, 2022. [Google Scholar]
  62. Binning, K.R.; Kaufmann, N.; McGreevy, E.M.; Fotuhi, O.; Chen, S.; Marshman, E.; Kalender, Z.Y.; Limeri, L.; Betancur, L.; Singh, C. Changing social contexts to foster equity in college science courses: An ecological-belonging intervention. Psychol. Sci. 2020, 31, 1059–1070. [Google Scholar] [CrossRef]
  63. Canning, E.A.; Muenks, K.; Green, D.J.; Murphy, M.C. STEM faculty who believe ability is fixed have larger racial achievement gaps and inspire less student motivation in their classes. Sci. Adv. 2019, 5, eaau4734. [Google Scholar] [CrossRef]
Figure 1. Average self-efficacy scores of men and women from the start of Physics 1 to the end of Physics 2. Error bars represent standard error, and self-efficacy score is on a 0–1 scale.
Figure 1. Average self-efficacy scores of men and women from the start of Physics 1 to the end of Physics 2. Error bars represent standard error, and self-efficacy score is on a 0–1 scale.
Education 14 01092 g001
Figure 2. Average test anxiety scores of men and women from the start of Physics 1 to the end of Physics 2. Error bars represent standard error, and test anxiety is on a 0–1 scale.
Figure 2. Average test anxiety scores of men and women from the start of Physics 1 to the end of Physics 2. Error bars represent standard error, and test anxiety is on a 0–1 scale.
Education 14 01092 g002
Table 1. Items included in student survey and factor analysis.
Table 1. Items included in student survey and factor analysis.
Factor Loading
Physics 1Physics 2
Construct Name/Item TextPrePostPrePost
Self-Efficacy
1.I am able to help my classmates with physics in the laboratory or in recitation0.610.560.710.74
2.I understand concepts I have studied in physics0.600.730.760.80
3.If I study, I will do well on a physics test0.640.840.800.82
4.If I encounter a setback in a physics exam, I can overcome it0.640.790.740.83
Test Anxiety
5.I am so nervous during a physics test that I cannot remember what I have learned0.800.830.910.89
6.I have an uneasy, upset feeling when I take a physics test0.900.920.910.93
7.I worry a great deal about physics tests0.800.890.830.82
8.When I take a physics test, I think about how poorly I am doing0.760.810.870.92
Note: The same items were given to students for the pre- and post-survey. For Physics 1, N = 230 , and for Physics 2, N = 203 .
Table 2. Physics 1 prior preparation, self-efficacy, test anxiety, and assessment scores.
Table 2. Physics 1 prior preparation, self-efficacy, test anxiety, and assessment scores.
FemaleMaleComparison
VariableNMeanSDNMeanSDtpd
HS GPA1534.140.40773.960.642.540.0110.36
SAT Math152666647567770−1.230.220−0.17
Self-Efficacy Pre1350.600.16670.710.13−5.02<0.001−0.75
Self-Efficacy Post1410.500.20680.630.19−4.60<0.001−0.68
Test Anxiety Pre1290.480.26660.280.215.47<0.0010.83
Test Anxiety Post1420.630.27680.400.275.58<0.0010.82
Homework15388117787130.910.3640.13
Midterm Exams1536614777113−2.900.004−0.40
Final Exam1536213776614−1.950.052−0.27
Note: Sample size, mean, and standard deviation (SD) of prior preparation, motivational factors, and assessment outcomes of students. Results and significance of unpaired t-tests are provided. Cohen’s d effect sizes are also given; a negative d indicates that female students had lower scores than male students.
Table 3. Physics 2 prior preparation, self-efficacy, test anxiety, and assessment scores.
Table 3. Physics 2 prior preparation, self-efficacy, test anxiety, and assessment scores.
FemaleMaleComparison
Variable N Mean SD N Mean SD t p d
HS GPA1174.180.41864.060.411.990.0480.28
SAT Math117669718569066−2.040.042−0.29
Self-Efficacy Pre1070.590.19770.650.16−2.280.023−0.34
Self-Efficacy Post480.510.20380.650.14−3.52<0.001−0.76
Test Anxiety Pre1050.560.26750.340.245.93<0.0010.90
Test Anxiety Post480.600.28380.380.263.63<0.0010.79
Homework11793158689201.330.1850.19
Midterm Exams1176512866613−0.640.524−0.09
Final Exam1175515865517−0.240.811−0.03
Note: Sample size, mean, and standard deviation (SD) of prior preparation, motivational factors, and assessment outcomes of students. Results and significance of unpaired t-tests are provided. Cohen’s d effect sizes are also given; a negative d indicates that female students had lower scores than male students.
Table 4. Physics 1 high-stakes assessment scores predicted by student sex, high school GPA (HS GPA), SAT/ACT Math scores, pre/average self-efficacy, and pre/average test anxiety.
Table 4. Physics 1 high-stakes assessment scores predicted by student sex, high school GPA (HS GPA), SAT/ACT Math scores, pre/average self-efficacy, and pre/average test anxiety.
PreAvg.
Variable Model 1 Model 2 Model 1 Model 2 Model 3 Model 4
Sex−0.16 *−0.21 ***−0.06 ns−0.20 ***−0.07 ns−0.10 ns
HS GPA0.30 ***0.32 ***0.23 ***0.29 ***0.24 ***0.25 ***
SAT/ACT Math0.41 ***0.41 ***0.37 ***0.42 ***0.38 ***0.40 ***
Self-Efficacy0.06 ns 0.12 ns 0.26 ***
Test Anxiety−0.08 ns −0.26 *** −0.32 ***
Adjusted R 2 0.380.370.450.370.450.42
N190190174174174174
Note: Standardized regression ( β ) coefficients are provided. * = p < 0.05 , *** = p < 0.001 , and ns = not statistically significant.
Table 5. Physics 2 high-stakes assessment scores predicted by student sex, high school GPA (HS GPA), SAT/ACT Math scores, pre/average self-efficacy, and pre/average test anxiety.
Table 5. Physics 2 high-stakes assessment scores predicted by student sex, high school GPA (HS GPA), SAT/ACT Math scores, pre/average self-efficacy, and pre/average test anxiety.
PreAvg.
Variable Model 1 Model 2 Model 1 Model 2 Model 3 Model 4
Sex0.02 ns−0.02 ns0.09 ns−0.03 ns0.08 ns0.05 ns
HS GPA0.25 ***0.25 ***0.09 ns0.09 ns0.09 ns0.09 ns
SAT/ACT Math0.40 ***0.43 ***0.46 ***0.52 ***0.47 ***0.46 ***
Self-Efficacy0.02 ns 0.08 ns 0.24 *
Test Anxiety−0.11 ns −0.23 ns −0.28 **
Adjusted R20.330.320.350.290.350.33
N17917978787878
Note: Standardized regression ( β ) coefficients are provided. * = p < 0.05 , ** = p < 0.01 , *** = p < 0.001 , and ns = not statistically significant.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Malespina, A.; Seifollahi, F.; Singh, C. Bioscience Students in Physics Courses with Higher Test Anxiety Have Lower Grades on High-Stakes Assessments and Women Report More Test Anxiety than Men. Educ. Sci. 2024, 14, 1092. https://doi.org/10.3390/educsci14101092

AMA Style

Malespina A, Seifollahi F, Singh C. Bioscience Students in Physics Courses with Higher Test Anxiety Have Lower Grades on High-Stakes Assessments and Women Report More Test Anxiety than Men. Education Sciences. 2024; 14(10):1092. https://doi.org/10.3390/educsci14101092

Chicago/Turabian Style

Malespina, Alysa, Fargol Seifollahi, and Chandralekha Singh. 2024. "Bioscience Students in Physics Courses with Higher Test Anxiety Have Lower Grades on High-Stakes Assessments and Women Report More Test Anxiety than Men" Education Sciences 14, no. 10: 1092. https://doi.org/10.3390/educsci14101092

APA Style

Malespina, A., Seifollahi, F., & Singh, C. (2024). Bioscience Students in Physics Courses with Higher Test Anxiety Have Lower Grades on High-Stakes Assessments and Women Report More Test Anxiety than Men. Education Sciences, 14(10), 1092. https://doi.org/10.3390/educsci14101092

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop