3.2. The Tasks and Materials
In our experiments, the students were asked independently to read and evaluate the selected materials and to submit their assessment. At the same time, their eye movement behaviors were recorded through eye trackers.
The experiment materials were from a previous work [
25], where 20 students participated in re-editing a section of the textbook in an online web environment. They were provided with a textbook (a pdf file) and were asked to search for relative multimedia as content to complete their editing using Microsoft Sway. Each student was required to complete a slide deck that was restricted in the range of 8 to 12 pages. We successfully collected the work of 20 students, that is, 20 online slide decks. These works were page-turned, which allowed us to study the eye movement behavior page by page. Among them, we further selected six slide decks as experimental materials. To identify the quality of the work, two experts rated each work based on assessment scale that is the same as the student assessment scale defined in
Section 3.5.
Finally, six different Microsoft Sway slide decks were selected as learning content for the students to read in this experiment. Two of them had high evaluation scores, two had medium evaluation scores, and two had low evaluation scores. We scored each of them with the measurement and divided them into three groups by their score. The scores were regarded as a metric to measure the degree of interest. The two slide decks with high evaluation scores were named #1 and #2. The two slide decks with medium evaluation scores were named #3 and #4. The two slide decks with low evaluation scores were named #5 and #6.
The six slide decks have their respective characteristics. The average page number of the six selected slide decks is 10. To understand the features of each work according to the eye movement behavior, we organized the composition of each slide deck by page. Each page was presented as a visual stimulus. To identify the page composition, we divided the pages into eight categories according to the visually similar features and similar characteristics regarding the statics, including “Upper left title”, “Centered title”, “Image with upper left title”, “Image with centered title”, “Image with instructions”, “Scattered text”, “Centralized text”, and “Little centralized text”, as shown in
Table 2.
Figure 1 shows the examples of each category.
3.3. Procedure
Before the experiment, we prepared the eye tracker and eye movement recording system. The Tobii eye tracker 4C [
26] was selected as the device to process the experiment as it is quite simple to set up and can access to the recording system. An open-source eye analytic software called Chrome Plus Record (CPR) [
27] with Tobii’s analytical use license, was used in this work for eye movement data recording, as it was more convenient and more suitable to fulfill our research purposes compared with other eye tracking analytic applications.
The eye tracker was placed in front of the user under the screen with an attached magnet, as shown in
Figure 2. After the installation, we need one more step to finish the setting: i.e., calibration. The calibration is based on the nine-point calibration function implemented in CPR system with the Tobii development SDK, as shown in
Figure 3, where all calibration points are visualized with a circle. In the calibration process, subject needed to look at the calibration point on the screen by following an instruction of staring at specific points, and the eye tracker started reading the subject’s gaze data. By checking if the subject’s gaze matched the location of an alternately moving and stopping display point on the monitor, the CPR system will automatically determine if the calibration passed or not. We utilized a 23 inch LCD with 1920 × 1080 resolutions as the experimental device. The distance between the subject’s eye and the screen was not strictly prescribed as the 4C eye tracker can capture the human face and trace the eye ball model.
Before the experiment, the subjects were clearly informed that the experiment would store their eye tracking data for further analysis in this work. To start the experiment, we first asked the subject to sit in front of the prepared screen with the eye tracker placed in front of them. The calibration process was conducted for each subject to adjust the eye tracker. Only once the subject passed the calibration, was he or she allowed to go on with the next step. Secondly, after calibration, we asked the subject to read the Microsoft Sway slide decks in order, while we recorded the eye movement behaviors with the CPR system. To prevent the influence of the reading order on the evaluation scores, the reading order of the slide decks for each participant were randomly arranged. Thirdly, after reading each slide deck, the participant was asked to evaluate each work by filling in the assessment questionnaire that is defined in
Section 3.4. The steps mentioned above, including calibration and reading and filling in the questionnaire were repeated until all of the six slide decks were successfully evaluated. In general, it took 10 minutes for each participant to evaluate one slide deck. Finally, the experimental instructor checked the completion of the entire process. If the data were correctly recorded into the database, we exported them as a csv file for further analysis.
3.5. Measured Values
We mainly focused on the information provided by GazeFixation.csv, URLEvent.csv, and Rawdata.csv, which were generated by the CPR system. The GazeFixation file recorded the information regarding the fixations, including the fixation ID, subject name, start time of the fixation in milliseconds, duration in milliseconds, the position information in pixels, and the scrollTop property information in pixels. The threshold for a meaningful fixation in this work was defined as 250 ms. The URLEvent file shows the information of the webpages visited by the subjects, including the URL ID, subject name, webpage URL, keyword, and visiting time. Combining GazeFixation with the webpage viewing information provided by the URLEvent file, we were able to analyze the eye behaviors for each slide deck. Furthermore, after separating the slide decks into several single pages, we were able to analyze the eye movement deeply on each certain page. The Rawdata file recorded the raw mouse and eye movement data, including the event ID, the subject name, the event time, the mouse position information, the gaze point position information, and the scrollTop information. It provided the information of single movements of the subject’s eye and mouse, allowing us to study any single event during the reading process.
We used the following common eye tracking metrics to represent a slide deck or a page: duration, fixation duration (FD), fixation number (FN), and fixation average duration (FAD). The duration referred to the total duration of viewing on this slide deck. FD referred to the total duration of fixations on this slide deck. A longer duration or a longer fixation duration indicated that the participant considered that information to be important and was more focused on it [
29]. FN referred to the total fixation count in the slide deck. A higher fixation count indicated more devoted attention was given to this stimulus [
30]. FAD was the mean value of the durations of fixations on this slide deck. A longer FAD indicated more overall effort spent by a participant during the experiment [
31].
In addition, we proposed several new fixation-based metrics to measure the depth of student’s processing in the content, including the fixation time percentage (FTP), mean time to fixation (MTTF), and spatial diversity of fixations (SDOF). In the experiment, we researched whether the proposed metrics were stable or not in measuring the subjects’ interest while viewing the slide decks. FTP was defined as the percentage of total fixation duration takes of total viewing duration on the page/slide deck, can be obtained by using the following equation:
FTP is a metric that focuses on the proportion of all fixation duration in the total viewing duration of a stimulus. It is different from the metrics, such as the Ratio of Fixation Time and ROAFT [
32], which focus on the ratio of fixation duration of an AOI in all fixation duration of the stimulus. FTP is supposed to explain the subject’s attention while viewing the slide decks. In general, a larger fixation time percentage means that the participant pays more attention and is more concerned with the content of the slide deck. MTTF is the average time until the next fixation is generated, which is calculated as the following equation:
While studying deeply page by page, we find that the characteristics of each page, such as the density and layout of the visual elements on the screen, have a strong connection with the MTTF. For example, a page with a small font size and high density of text leads to a small MTTF. On the other hand, a page that is text-only with a large font size and low density, leads to a large MTTF. Therefore, in the experiment we also attempted to determine whether MTTF was a representative metric or not to measure the subjects’ interest while viewing slide decks. The SDOF is the average distance (pixel) to the mean point of all fixation points. It was utilized to evaluate the balance of visual elements on the screen, usually referred to as the mean point. The SDOF is described with the following equation:
We bring two definitions of the mean point here. One is the average position of all fixation points, the other is the average point weighted by the duration of fixation. In general, these two mean fixation points were located at almost the same location. However, in some special cases, the two mean fixation points did not fall in the same position, such as a small-type page where participants gave an extremely short viewing duration. We speculated that during the viewing process, the subjects made fixations on searching for the “next page” button that is on the bottom right corner of the screen. Due to the lack of content, the page was not attractive enough for the subjects. The existence of the difference between the two versions of mean point demonstrated the inconsistency of the fixation durations. Being aware of that, we conducted a filter to clear the unnecessary fixation points in the research. We will discuss the filtering method in
Section 3.6. As the two different definitions usually cause the same results, we simply use “average fixation point” here. In the experiment, we also attempted to determine whether the SDOF could represent the degree of dispersion of the fixations, or whether a lower SDOF indicated a higher attraction of the page.