Factors affecting browsing duration on a health discussion forum: analysis of eye-tracking data
Wenjing Pian, Christopher S.G. Khoo, Gang Li and Jianxing Chi
Introduction.User-contributed information on health-related social media sites contains useful user-experience information that complements information on authoritative health information websites. Though more people are searching and browsing social media sites for health information, there are few studies of user browsing behaviour on these sites, and the factors that affect the behaviour. This study investigated the factors affecting the users’ skimming and examining durations when browsing a health discussion forum.
Methods. An eye-tracker system was used to record users’ eye fixations and eye movements when browsing a health discussion forum, HealthBoards.com.
Analysis. Stepwise multiple linear regression was used to develop models to predict users’ skimming and examining durations.
Results. The type of health information need context (i.e. browsing for own health issue, browsing for someone else’s health issue, and browsing with no particular health issue in mind), the perceived urgency of the health information need, and the length of the health information text were found to be significant predictors of the examining and skimming durations. Demographic factors of gender and age were not significant.
Conclusion Future studies of online health information seeking should take these three factors into consideration, and distinguish between browsing two types of screens—the summary screen of document surrogates and the detailed content screen.
People are increasingly searching for and browsing health information on websites and social media platforms (Moreland, French and Cumming, 2015; Fox and Duggan, 2013; Song et al., 2015). User-contributed health information on social media sites offer user-experience information that complements information on authoritative health information sites (Chew and Khoo, 2016).
Different social media platforms have different functional characteristics, which may result in different information browsing behaviours. There have been several studies of health information behaviour on Facebook public groups (e.g., Scott and Hand, 2016). However, there have been few studies of information seeking and browsing behaviour on health discussion forums. It is important to understand how people browse health topics in these forums as they have large numbers of registered users. The health discussion forum used in this study, HealthBoards.com (http://www.healthboards.com/boards/index.php), has 1.1 million registered users.
A typical health discussion forum has a two-stage interface structure. Stage 1 displays a summary page of post surrogates (mainly post titles) on a particular topic or retrieved by a search query. The user browses the page of post surrogates and selects (clicks on) the post surrogate that seems relevant or useful. The system then displays a Stage 2 page of the detailed post content, together with comments from other users. Within each stage, two kinds of browsing behaviour can be distinguished: skimming quickly over the text, and examining the text closely. These are reflected in two kinds of eye movements: saccade or quick eye movement, and eye fixation. The two-stage structure of user interaction with the discussion forum combined with the two types of browsing behaviour (skimming and examining) is used to structure the analyses in this study.
This study investigated the factors affecting users’ skimming and examining durations when browsing the summary pages of post surrogates and detailed post-content pages on a popular health discussion forum. In particular the following factors were investigated:
- The user’s health information need context: whether the user is seeking information for his or her own health issue, seeking information for someone else’s (i.e. family member of friend’s) health issue, or browsing without any particular health issue in mind;
- Perceived urgency of the health information need;
- Length of the information content (number of words)
- Demographic factors of gender and age.
Previous studies have identified several factors that affect user online seeking behaviour (Teo, 2001; Bidmon and Terlutter, 2015; Scott and Hand, 2016; Pian, Khoo and Chang, 2016), including gender, age, types of health information need context, and urgency of health information need. However, the relative importance of these factors for different types of browsing behaviour, types of health information site, and types of display screens is not known.
The study was carried on the campus of the Nanyang Technological University, Singapore. Eighty people were recruited in the study. Seven participants failed the eye-tracker calibration test due to their thick glasses. Of the 73 participants, 60 were university students, faculty members, and administrative staff; the remaining 13 were Singapore residents. None of the participants was a health professional, and none had a critical or terminal health issue (e.g., cancer or HIV). They were competent in English and instructions, survey questions and interview were all carried out in English. The study was approved by the university’s Institutional Review Board.
The participants were asked to browse a health discussion forum (HealthBoards.com) to find relevant or useful information related to a health issue of their choice. If a participant could not think of a health issue, they were assigned to the category of browsing with no particular issue in mind. Their browsing behaviours were recorded using a Tobii T-series eye tracker system, and the eye fixation and eye movement data were exported to a Microsoft Excel file for processing and analysis. The participants were interviewed to collect demographic information, a health issue to seek information for in the session, and the perceived urgency of the health information need.
Stepwise multiple linear regression was used to develop predictive models for the durations of examining and skimming, based on the independent variables of type of health information need context, level of urgency of information need, gender, age, and length of the health information text. The unit of analysis in this study was the individual Web pages, which may be a summary screen displaying a list of post surrogates or a detailed post content screen (including user comments). As each research participant browsed several summary and detailed post pages, there were totally 814 data units in the analyses.
Two dummy variables were created to represent the three types of health information need context (browsing for self: 1, 0; browsing for others: 0, 1; and browsing with no particular issue: 0, 0). A 7-point Likert scale was used to represent the perceived urgency of information need from extremely urgent (7) to not urgent at all (1). The length of the information text on the Web page was measured as the number of words and divided by 100, to obtain values in single digits.
As each participant browsed several summary pages and detailed post pages, the duration of examining for each page was calculated by accumulating individual fixation durations on the page (Scott and Hand, 2016; Wang, Tsai and Tsai, 2016; Desjarlais, 2013), and the duration of skimming was calculated by accumulating the time gaps between fixations (Pian, Khoo and Chang, 2016; Zhang, 2012).
The regression models developed for predicting the examining and skimming durations were found to be good in accounting for the variances in the dependent variables (see Table 1). The model fitness adjusted R squares range between 0.39 and 0.52.
For the stage of browsing the detailed post screen, three factors—type of information need context, urgency of information need and length of information text—were found to be significant predictors of the examining duration (α=0.05). In general, the examining duration was highest for users seeking for their own health information need, lower when seeking for other’s health information need, and lowest when browsing with no particular health issue in mind. Post hoc comparisons using Tukey’s HSD found significant differences among these three types of health information need context.
In addition, the interaction variables, Seeking_for_self*Length_of_information_text and Seeking_for_others*Length_of_information_text were significant. This indicates that longer post content pages increase the examining durations of users searching for self or others, more than for users browsing with no particular issue in mind. Table 2 shows the average examining durations for the different combinations of information need context and length of information text.
As for the skimming duration, again all the factors were significant at the 0.05 level, but the coefficients for Seeking_for_self and Seeking_for_others were negative, indicating that users with no particular health issue in mind had a longer skimming duration.
The length of information text was found to affect the examining duration significantly. The demographic factors of gender and age were not found to be significant in predicting browsing duration.
For the stage of browsing the summary screen of post surrogates, type of health information need context and urgency of information need were significant predictors of examining and skimming durations, but not the information length. Interestingly, for the examining duration, Seeking_for_self had a negative coefficient, indicating that users browsing for own health issue spent less time examining post surrogates than users browsing on others’ behalf or with no particular issue in mind. Users browsing for other’s health issue had the highest examining duration. Gender and age were again not significant. As for the skimming duration, users searching with a conscious health issue (either for self or for others) had a shorter skimming duration than users browsing with no issue in mind.
|Examining duration for detailed post content||Skimming duration for detailed post content||Examining duration for post surrogates|
|Variable||Model adj R2: 0.388 |
|Model adj R2: 0.434|
|Model adj R2:0.524
|Seeking for self||24.001||.027||-5.817||.002||-1.007||.000|
|Seeking for others||12.432||.000||-7.061||.041||2.294||.012|
|Urgency of health information need||1.005||.021||.502||.047||.584||.014|
|Length of information text||.391||.031||.053||.120||.141||.562|
|Seeking for self|
* Length of information text
|Seeking for others * Length of information text||1.546||.019|
|Average examining duration (in seconds)||Type of information need context|
|Seeking for own health issue||Seeking for other’s health issue||No particular health issue in mind|
|Length of health information text (words)||Long (>500)||65.2||54.3||27.0|
This study found that three factors—type of health information need context (i.e. seeking for self, seeking for others, and seeking with no particular health issue in mind), the perceived urgency of the health information need, and the length of the health information text—are significant predictors of the durations of examining and skimming behaviour on a health discussion forum. The type of health information need context has the strongest influence on examining and skimming durations.
Users browsing for own health issue tend to spend less time browsing the summary screen of post surrogates (relative to the other two groups of users). They are more decisive in making predictive evaluations about the likely relevance of a post, based on the post surrogate. They spend much more time reading the post content closely, and less time skimming.
Users browsing for other’s health issue tend to spend the most time examining the summary screen of post surrogates, trying to decide which post is likely to be useful. They spend more time examining the post content than users with no particular issue in mind, but less time than users browsing for own health issue. Users browsing with no particular health issue in mind engage in a lot of skimming. They tend to spend the most time skimming the post surrogates and post content, relative to the other two groups of users.
Longer post content increase the examining duration significantly, but not the skimming duration. The length of the post content interacts with the type of information need context: longer post content has a bigger impact on the examining duration for users seeking for self and others than for users browsing with no issue in mind. However, the length of the summary page of post surrogates does not have a significant impact on examining and skimming duration, probably because the number of post surrogates displayed was fixed for each webpage.
As expected, perceived urgency of health information need increases examining and skimming duration for both the summary screen of post surrogates and detailed post screen. Gender and age were not significant. Previous studies have found demographic factors to have a relationship with people’s online health information behaviour (e.g., Bidmon and Terlutter, 2015; Cullen, 2002; Teo, 2001). In the current study, browsing durations were recorded exactly using an eye-tracker machine, whereas previous studies used self-reported data from questionnaires or interviews (e.g., Teo, 2001; Fox and Duggan, 2013). Also, previous studies made use of cumulative time spent over a period of time, such as one week or one month. This study focused on the average browsing durations for individual webpages.
The results of this study indicate that future studies of health information seeking should take into consideration users’ health information need context (whether seeking health information for self, for others, or with no particular issue in mind), and distinguish between browsing detailed health information content or summary screens of search results.
This research was supported in part by National Natural Science Foundation of China (No. 71790612) and Social Science Foundation of Fujian Province of China (No. FJ2017C011). The authors would like to thank the anonymous reviewers for their comments and suggestions and the editors for their efforts to publish this paper.
About the authors
Wenjing Pian (first author and corresponding author) is an Assistant Professor at School of Economics and Management at Fuzhou University (address: No.2 Xueyuan Road, University Town, Minhou County, Fuzhou City, Fujian Province, China, 350116) and Distinguished Scholar at Mengchao Hepatobiliary Hospital of Fujian Medical University (address: No.312 Xihong Road, Gulou District, Fuzhou City, Fujian Province, China, 350025). He received his PhD degree at Nanyang Technological University, Singapore. His research focuses on consumer health information seeking, health/medical informatics, and machine learning in clinical decisions. He can be reached at firstname.lastname@example.org .
Christopher S.G. Khoo is an Associate Professor at Wee Kim Wee School of Communication and Information at Nanyang Technological University (address: WKW SCI building, Nanyang Link, Singapore, 637718). He obtained his PhD at Syracuse University in 1997, his MSc in library and information science at the University of Illinois, Urbana-Champaign in 1987 and a BA from Harvard University. His main research interests are in knowledge organization, ontologies, automatic sentiment categorization, human categorization behaviour, natural language processing, information extraction, multidocument summarization, and clinical decision support systems. He can be reached at ASSGKHOO@ntu.edu.sg.
Gang Li is a Professor and the Director in the Centre for Studies of Information Resources at Wuhan University (address: 299 Bayi Street, Wuchang District, Wuhan City, Hubei Province, China, 430072), awarded Changjiang Professor in 2016. He is now researching on big data integration and analysis focusing on multiple aspects of urban information, such as geographical, cultural and scientific data. He has completed and is working on multiple related projects. He can be reached at email@example.com.
Jianxing Chi (corresponding author) is a Lecturer at College of Communication at Fujian Normal University (address: Fujian Normal University Qishan Campus, University Town, Fuzhou City, 350117 ). She received her Master’s degree from London School of Economics. She does research in quantitative and qualitative social research and health communication and social media. Her research interest relates to health communication, new media and communication technology. She can be reached at firstname.lastname@example.org.
- Bidmon, S. & Terlutter, R. (2015). Gender differences in searching for health information on the internet and the virtual patient-physician relationship in Germany: exploratory results on how men and women differ and why. Journal of Medical Internet Research, 17(6). Retrieved from http://www.jmir.org/2015/6/e156/ (Archived by WebCite® at http://www.webcitation.org/70AlFw65X)
- Chew, S.H. & Khoo, C.S.G. (2016). Comparison of drug information on consumer drug review sites versus authoritative health information websites. Journal of the American Society for Information Science and Technology, 67(2), 333-349.
- Cullen, R.J. (2002). In search of evidence: family practitioners' use of the Internet for clinical information. Journal-Medical Library Association, 90(4), 370-379.
- Desjarlais, M. (2013). Internet exploration behaviours and recovery from unsuccessful actions differ between learners with high and low levels of attention. Computers in Human Behaviour, 29(3), 694-705.
- Fox, S. & Duggan, M. (2013). Health online 2013. Pew Research Center. Retrieved from http://www.pewinternet.org/2013/01/15/health-online-2013/ (Archived by WebCite® at http://www.webcitation.org/70AliuZux)
- Moreland, J., French, T.L. & Cumming, G.P. (2015). The prevalence of online health information seeking among patients in Scotland: a cross-sectional exploratory study. JMIR Research Protocols, 4(3). Retrieved from http://www.researchprotocols.org/2015/3/e85/ (Archived by WebCite® at http://www.webcitation.org/70AmKDgST)
- Pian, W., Khoo, C.S. & Chang, Y-K. (2016). The criteria people use in relevance decisions on health information: an analysis of user eye movements when browsing a health discussion forum. Journal of Medical Internet Research, 18(6), 1-18. Retrieved from http://www.jmir.org/2016/6/e136/ (Archived by WebCite®at http://www.webcitation.org/70AlZW6Td)
- Scott, G.G. & Hand, C.J. (2016). Motivation determines Facebook viewing strategy: an eye movement analysis. Computers in Human Behavior, 56, 267-280.
- Song, H., Omori, K., Kim, J., Tenzek, K.E., Hawkins, J.M., Lin, W.Y., . . . Jung, J.Y. (2016). Trusting social media as a source of health information: online surveys comparing the United States, Korea, and Hong Kong. Journal of Medical Internet Research, 18(3). Retrieved from http://www.jmir.org/2016/3/e25/ (Archived by WebCite® at http://www.webcitation.org/70Am4qqF5
- Teo, T.S.H. (2001). Demographic and motivation variables associated with Internet usage activities. Internet Reserch: Electronic Networking Applications and Policy, 11(2), 125-137.
- Wang, C.Y., Tsai, M.J. & Tsai, C.C. (2016). Multimedia recipe reading: predicting learning outcomes and diagnosing cooking interest using eye-tracking measures. Computers in Human Behaviour, 62, 9-18.
- Zhang, Y. (2012). Consumer health information searching process in real life settings. Proceedings of the Annual Meeting of the American Society for Information Science and Technology, 49(1), 1-10.