header
published quarterly by the university of borås, sweden

vol. 25 no. 2, June, 2020



Exploring temporal variations of depression-related health information behaviour in a discussion forum: the case of Suomi24


Jonas Tana, Emil Eirola, and Kristina Eriksson-Backa


Introduction. This study presents temporal variations of depression-related health information behaviour in a discussion forum. The dimension of time has previously been overlooked within health information behaviour research, and information science in general. Adding this dimension, we propose, can reveal interesting insights.
Method. The open data of the discussion forum of Suomi24, the largest and most popular discussion forum in Finland, was utilized to analyse the time stamps of all messages in the depression subcategory.
Analysis. All 116,781 messages within the depression sub-category, ranging from 1.1.2001 to 14.5.2015, were processed in JSON format, and analysed using Python. A quantitative analysis and comparison to all other messages, as well as messages in the health category was conducted.
Results. Messages posted in the depression subcategory of the Suomi24 discussion forum follow clear and recurring temporal patterns. Significant peaks are visible on different timescales, during the night time, on Monday and Thursday, as well as in spring and autumn. These results are compared to previous research on the dimension of time in relation to depression as well as information science.
Conclusion. Adding the dimension of time can reveal interesting new insights on health information behaviour, especially for stigmatised health issues like depression. Knowledge of the temporal nature of depression-related behaviour can aid early intervention, which can be beneficial for positive health outcomes.

Introduction

Modern healthcare is reliant on self-care and the role of the individual in the management of illness and health problems. Research suggests that between seventy and ninety per cent of all health problems are solved or managed without the involvement of healthcare professionals (Oliphant, 2010; Siepmann, 2008). A majority of people also try to treat or diagnose themselves before seeking medical advice from professionals (Oliphant, 2010; Siepmann, 2008). As more and more people take greater responsibility for maintaining their health, laypersons are becoming expert patients with their own health projects. Health information behaviour, or how people seek, obtain, evaluate, categorise and use information related to their health is essential in maintaining good health (Ek, 2013).

The Internet has become a convenient source, providing an endless amount of information on health issues to an increasing number of consumers (Hausner et al., 2008; Kim and Oh, 2011; Lee et al., 2014; Wikgren, 2002). In Britain, 69% of the British public searched the Internet for health information in 2013 (Dutton et al., 2013), while 64% of Finns aged 16–89 years used the Internet to find health-related information in 2017 (Tilastokeskus, 2017). Younger Finns are more active users, whereas Finnish seniors prefer to rely on medical information and interpersonal communication with health personnel (Ek and Niemelä, 2010; Eriksson-Backa, 2008, 2013).

Social media sites in particular can offer users a platform for supported learning, networking with peers, family or friends, or sharing one’s problems, processes and outcomes with others in a global community (Pulman, 2009). One of the most important aspects of modern technology is that it allows individuals to access the information they need whenever they have questions or needs, not just during clinic hours (Chan et al., 2016; Gustafson et al., 1999). This is highly relevant as time, or the temporal aspects, variations, and traits that govern and regulate many aspects of life also affect many aspects of health (Ayers et al., 2014).

These health issues and threats, which might rise anytime during the day (diurnal), week or month (seasonal), have been suggested to trigger health information behaviour, especially information needs and information seeking (Lambert and Loiselle, 2007). When considered collectively, this health information behaviour reveals reoccurring trends and patterns governed by temporal variations, which would suggest that this behaviour can be triggered and influenced by time, or the variations in time. (Ayers et al., 2014). This dimension of health information behaviour can also provide valuable insights into patterns of symptoms and disease, especially for stigmatised or sensitive health topics, such as mental illnesses like depression.

For people living with depression, the Internet provides a platform for health information behaviour, and a considerable percentage of people interested in the issues of depression are seeking and sharing answers to information needs online (Hausner et al., 2008). Because of the complexity, stigma, and barriers to care that mental illness present, individuals have been shown more likely to seek information about these problems online (Ayers et al., 2014; Chan et al., 2016; Gustafson et al., 1999; Powell and Clarke, 2006).

Research also suggests that people prefer to interact and ask questions online, since members of online communities might be more knowledgeable about the condition, or have had a similar experience (Brady et al., 2016; Gustafson et al., 1999; Halder et al., 2017; Naslund et al., 2016; Walther and Boyd, 2002). This consists of everything from finding out about and treating symptoms of depression to peer-support in managing the illness, producing a complex information context of digital traces rich with temporal data. These digital traces have been defined as 'numerically produced correlations of disparate kinds of data that are generated by our practices in a media environment characterized by digitisation'. (Hepp et al., 2018, p. 440). These digital traces provide temporal clues of health information behaviour in relation to interests, causes, symptoms, advices, and cures for health issues (Ayers et al., 2014; Lambert and Loiselle, 2007). They can also be traced in near real-time with great accuracy in relation to temporal aspects, and can thus complement existing disease profiles, reveal novel insights, as well as help propose new hypotheses for health information behaviour (Ayers et al., 2014; Chew and Eysenbach, 2010; Eriksson-Backa et al., 2016; Signorini et al., 2011; Stoové and Pedrana 2014).

Despite a long-term interest in the nature of temporality, time as a dimension of information behaviour is still an under-explored topic in the research field (Savolainen, 2018; Solomon, 1997). Reasons for this may be the methodological barriers presented by traditional means of collecting data related to time, as time expressed in the information dynamics of social systems has been difficult to capture and comprehend, especially in the question of when an individual is engaged in information behaviour (Eysenbach, 2009; Solomon, 1997).

One solution to these barriers is the use of Web data, or digital traces. The utilisation of Web data can be especially useful for stigmatised diseases or other health phenomena where traditional data is insufficient or non-existent, and where individuals have been shown more likely to seek information about their problems online (Ayers et al., 2014; Eysenbach, 2009). For instance, depression-related health information seeking in Finland has shown clear and recurring patterns, ranging from diurnal to seasonal variations (Tana, 2018; Tana et al., 2018).

Similar results were found elsewhere, where studies on Twitter posts revealed diurnal and seasonal patterns in posts related to mood and depression (Chen et al., 2018; De Choudhury et al., 2013; Golder and Macy, 2011). Nimrod (2012) found an increase in the level of activity in the online depression communities during winter, from January to March. A deeper understanding of health information behaviour in relation to time could be used to help people faced with a health-related knowledge gap. This in turn can aid in early intervention, which can be beneficial for positive health outcomes (Lambert and Loiselle, 2007). The aim of this paper is to discern the temporal variations of depression-related discussions in the largest and most popular Finnish online discussion forum (Suomi24). In order to reach this aim, the following research question is answered:

Does depression related online health information behaviour within a large discussion forum show temporal variations on different time scales?

Depression

According to the World Health Organization (2017), depression is the leading cause of disability worldwide, and a major contributor to the overall global burden of disease. In Finland depression is a major public health issue and it is estimated that at least 5% of the adult population suffer from depression every year (Patana, 2014). Depression, which is considered both a symptom and an illness, differs from everyday mood changes and can become a serious health condition, especially when it is long-lasting with a moderate to severe intensity (World Health Organization, 2017). Depression has a broad spectrum, ranging from chronic to episodic depressions with mild to severe, as well as typical to atypical symptoms.

The WHO (Marcus et al., 2012, p. 6) defines depression as follows:

Depression is a common mental disorder, characterized by sadness, loss of interest or pleasure, feelings of guilt or low self-worth, disturbed sleep or appetite, feelings of tiredness and poor concentration. Moreover, depression often comes with symptoms of anxiety. These problems can become chronic or recurrent and lead to substantial impairments in an individual’s ability to take care of his or her everyday responsibilities.

Depression is also characterized by different temporal variations: diurnal, weekly and seasonal. Diurnal variations of depression and symptoms of depression are associated with early-morning or evening worsening, or afternoon slumps (Wirz-Justice, 2008). Seasonality has also been associated with levels of both anxiety and depression, with peaks varying from autumn to mid-winter to early spring (Christensen and Dowrick, 1983; Oyane et al., 2008; Stordal et al., 2008). However, both diurnal as well as seasonal variations have been shown to be somewhat irregular, and the presence and direction of these variations are decidedly inconclusive over time (Hasler, 2013; Powell and Clarke, 2006; Wirz-Justice, 2008). The difficulties in drawing conclusions on temporal variations of depression have been suggested to because of the limitations in methodology for tracking and monitoring depression and seasonality in populations (Ayers et al., 2013; Harmatz et al., 2000). One potential method for observing temporal variations of depression is monitoring the Internet. Social media platforms have previously been suggested to be an effective disease surveillance tool (Chew and Eysenbach, 2010; Signorini et al., 2011; Stoové and Pedrana 2014), also when it comes to depression (Chen et al., 2018; De Choudhury et al., 2013; Fatima et al., 2018; Islam et al., 2018)

Depression can be treated effectively, however, fewer than half of those affected receive treatment. One of the barriers to treatment is stigma. It has been suggested that many people are reluctant to seek treatment because they do not want to be labelled as mental patients and want to avoid the stigma connected with this (Aromaa et al., 2011). This has also been suggested as the reason why a majority of individuals either suffering from depression are more likely to seek and share answers to information needs online (Ayers et al., 2014; Chan et al., 2016; Gustafson et al., 1999; Hausner et al, 2008; Powell and Clarke, 2006). Therefore, knowledge about online health information behaviour related to depression has the potential to present new meaningful temporal insights into depression.

Aggregating and analysing digital traces of online health information behaviour can be seen as infodemiological research. Infodemiology is defined as ‘the science of distribution of and determinants of information in an electronic medium, specifically the Internet, or in a population with the ultimate aim to inform public health and public policy’ (Eysenbach 2011, p. 155). Infodemiology is rooted in the idea that there is a relationship between population health and information and communication patterns on the Internet.

This kind of research is promising, as several studies have demonstrated that valid trends and patterns reflecting the entire population, and even subsets of it, can be extracted from online data (Chew and Eysenbach, 2010; Nuti et al., 2014; Signorini et al., 2011; Stoové and Pedrana, 2014). It can also help health professionals and other officials to understand broader patterns of mental illness (Ayers et al., 2014).

Furthermore, Finland is an ideal country to study changes in temporal variations for depression-related behaviour because of its northern location that causes major variations in the number of hours of daylight and the temperature during all four seasons (cf. Yang et al., 2010). However, there are still rather homogeneous weather parameters throughout the whole country. In addition to this, Finland was an early adopter of social media in all age categories, and of information and communication technology, and has a high availability of Internet services. Previous research (Nimrod, 2013) has also identified the need for research on non-English-language online depression communities in order to examine cultural variations and their impact on the manifestation of depression. This study can contribute to this by providing data on the temporal variations of depression related online health information behaviour.

Internet discussion forums

Internet discussion forums can be labelled as social media, which is defined by Merriam-Webster (Social media, n.d.) as:

information
forms of electronic communication (such as Websites for social networking and microblogging) through which users create online communities to share information, ideas, personal messages, and other content (such as videos)'.

A similar definition is given by Carr and Heyes (2015, p. 52) who define social media as

Internet-based channels that allow users to opportunistically interact and selectively self-present, either in real-time or asynchronously, with both broad and narrow audiences who derive value from user-generated content and the perception of interaction with others.

Internet discussion forums date back to the early Internet newsgroups and bulletin boards but are in no way obsolete today. Online discussion forums, more specifically, can be defined as online settings in which people exchange information, opinions and support on different topics, ranging from general to specific. Savolainen (2011) also states that they can be conceptualised as virtual information grounds that serve the end result of information sharing. Forums that are health-related are no exception. In health-related forums people who are in a similar life situations seek and exchange factual information and provide emotional support to help each other (Savolainen, 2011).

Internet discussion forums usually consist of different categories, topics and subtopics, containing threads (a thread is a collection of messages like original posts, replies and comments to replies. These user-submitted messages contain the user's details and the date and time it was posted (Internet Forum, 2018). A posted question often results in multiple responses (Savolainen, 2011). There has been an impressive increase in usage and acceptance of both health-related social media and online forums, as a massive number of people are searching for information about their health problems and posting their symptoms online (Nasralah et al., 2018; Sinha et al., 2018). Finnish statistics show that in 2014, 24% of the population aged 16–89 had written in an online forum, and that this was slightly more common among men (28%) than women (20%).

Writing in discussion forums is clearly more common for the younger demographic, as over 40% within the age category 16–34, and 35% in the age category 35–44, had written messages in an online forum in 2014 (Tilastokeskus, 2017). In the UK this seems to be just as common, as 33% posted on a message board that year (Dutton et al., 2013). These numbers are somewhat in line with the usual categorisation of online forum visitors, usually divided into active users (visitors who post on forums) and lurkers (visitors who only read forum messages and do not post). It has been estimated that lurkers make up the majority, between 45% and 90% of forum visitors (Moore et al., 2017).

Discussion forums have been studied in various health-related contexts, ranging from chronic cough (Sinha et al., 2018) and health-related effects of electronic cigarettes (Hua et al., 2013) to the most common area, information on and peer support for mental health conditions (Ali et al., 2015; Johnsen et al., 2002; Kummervold et al., 2002; Nimrod 2013; Sinha et al., 2018). In fact, online forums have been shown to be one of the most frequently used internet data sources for researchers (Im and Chee, 2012)

Time as a concept in information science

Research in information science has throughout history had a strong emphasis on spatial issues, while time as a concept has been more or less overlooked. Within information science, the temporal context of information seeking, according to Savolainen (2006; 2018), has been approached in three different ways:

  1. time as a fundamental attribute of situation or context of information seeking,
  2. time as qualifier of access to information, and
  3. time as an indicator of the information-seeking process.

The first approaches the general time concepts surrounding life and situations therein. The second approach is treated as a constraint in the form of time pressure imposed on access to information sources. The last approach is to viewing time as a qualifier in the information seeking process (Chen and Rieh, 2009). Of these three approaches, the first one is relevant in this case (for a more comprehensive analysis on the concept of time within information science see Savolainen (2006 and 2018M.)). It touches, as Savolainen (2018) states, on Dervin’s sense-making approach, which suggests that reality is discontinuous, gap filled and changeable across time and space, and it is accessible only, and always incompletely, in context. These are specific moments in time-space, and in spatial and temporal confluences of people, settings, activities, and events. Ultimately, temporal factors become real as constitutive elements of contexts only through human practices (Savolainen 2018.). Dervin (1992) conceptualises individuals as “time-space-bound”, and individuals in their time and space need to make sense of the world.

Within sense-making, different kinds of time receive attention, and the view of time is fundamental to the theory as it allows for the possibility of conceptualising information behaviour as temporally patterned (Davies and McKenzie, 2002; Dervin, 1992). An individual identifies a need for information upon encountering a gap in his or her knowledge that prevents them from continuing the journey through time-space (Dervin, 1992). This gap in the individual’s knowledge can be filled by something that the individual calls information. However, information seeking is only one response to a gap, and sense-making can also include seeking reassurance, expressing or sharing feelings and connecting with others, in other words, information behaviour (Case, 2012). Savolainen (2018) states that time is an important contextual and situational factor of information seeking and information behaviour.

Drawing on this, time or the temporal context can also be the situational factor or context for information seeking, because when an individual seeks information is as important as what he or she seeks (Davies and McKenzie, 2002). Individuals are different at different moments in time-space, moving from past to present to future, as defined by the linear conception of time (Savolainen, 2018). Time can likewise be perceived as a cyclic phenomenon, from the perspective and recognition of repetition of behaviour, for instance behaviour related to health (Davies and McKenzie, 2002; Dervin, 1992). Savolainen (2018) points out, that if looking at a particular season or day, one could argue that they are not cyclical because they do not reoccur in a similar fashion. However, especially within health information behaviour, reoccurring trends and patterns within these temporal units (varying from hours to days to months or seasons) can occur when considered collectively, suggesting that time, or temporal traits, also influences and triggers health information seeking and needs (Ayers et al., 2014).

Time is, as previously stated, an important contextual and situational factor and regarded as highly important within information science (Savolainen, 2006; 2018). Dervin (2005) emphasizes the dimension of time-space as fundamental in research on information behaviour. Even though many information behaviour researchers have included time or temporal factors in their research, only a few have explicitly studied time (time of day, day of the week, seasons) as a context or dimension of everyday life (Chen and Rieh, 2009). All aspects of time (and space) should therefore be taken into account within research, given that information behaviour is not static or immobile. This could potentially lead to the discovery of significant contextual dimensions within health information behaviour (Chen and Rieh, 2009; Davies and McKenzie, 2002; Savolainen, 2018). By utilising digital traces to analyse temporal trends and patterns of health information seeking and needs, we begin to understand not only how, but also when, people make sense of their world or health (Solomon, 1997). And as Wilson (1999) states, one of the strengths of Dervin’s sense-making approach lies partly in its methodological consequences, as it can lead to a way of questioning that can reveal the nature of a problematic situation, in this case, the situation in time. Wilson notes that applying such questioning can lead to genuine insights that can influence information service design and delivery.

Social media provides a novel source for studying the dimension and context of time for health information behaviour. Health-related discussions in relation to time in online forums have been studied to some extent. Halder, Poddar and Kan (2017) examined temporality in relation to mental health and online forums. They studied the problem of predicting a patient’s future emotional status based on past forum posts and provided a framework meant to track a user’s linguistic changes over time. The use of negative or positive words could reveal the deterioration or improvement of emotional status over time. Paul, et al. (2016), on the other hand, studied demographic and temporal trends connected with use of drugs that could be found in an online forum. They compared these trends against data from a large epidemiological survey. Nimrod (2013) analysed some key trends in English-based online depression communities in terms of level of activity in relation to seasonal changes. To the best of our knowledge, no previous studies have examined the dimensions of time in relation to health information behaviour in online mental health discussion forums in the same detailed manner as this study.

Data and methods

Suomi 24

Suomi24 is the largest, and most popular, free and anonymous discussion forum in Finland. The site had 832,000 unique visitors per week in 2015, each of whom in average spent seven minutes on the forum in one session. The total amount of page loads a week in 2015 was over seventeen million (Aller, 2016) and during the first half of 2018, Suomi24 still had a monthly reach of over two million Internet users, making it the eighth most popular Finnish Internet site (FIAM, 2018). Overall, there are over fifty-three million (53,366,704) messages posted in Suomi24 for the time period 1st January, 2001 to the 14th. May, 2015. The forum contains several different main topics such as hobbies, society, economy and health. The category of health, with two-and-a-half million messages, includes a number of subtopics like dieting, women’s and men’s health, plastic surgery, birth control, as well as mental health. The mental health subtopic, with 589,121 messages, is divided into fourteen sub-categories, of which depression is one. All main topics, categories, subtopics and sub-categories are provided by Suomi24 and its moderators, and forum users are not able to add new or edit existing categories. The depression sub-category consists of 116,781 messages, of which 15,386 are original posts, 57,682 are replies, and 43,713 are comments to replies. In general, health and its absence is one of the most common topics in Suomi24, making it a not only a unique social media data source, but also the biggest social media data set on health in Finland (Lagus et al., 2018). The whole Suomi24 discussion forum data set can be downloaded for research purposes through FinCLARIN’s Kielipankki Korp-interface. The corpus contains all the messages in the Suomi24 discussion forum from 1st January, 2001 to the 14th. May, 2015.

The messages analysed in this study have all been posted under the sub-category ‘Depression’. The messages consist of 15,386 original posts, 57,682 replies, and 43,13 comments to replies. The data consists of a total of 116,781 messages.

The data are available in JSON format, which allows for efficient processing of large data quantities through programming. Python parsing libraries were used to extract the timestamps and category information for every post. The Python Data Analysis Library 'pandas' provides the functionality to aggregate the raw data and calculate the statistics of interest. The timestamps were converted from UTC (Universal Time Coordinated) to local time in Finland to account for daylight savings time. To evaluate the statistical significance of observed differences in the distribution of posts in different categories, we used the statistical library in SciPy to conduct a chi-squared test of independence. As this results in a multiple hypothesis scenario, the p-values were corrected for multiple comparisons by the step down method using Sidak adjustments from the StatsModels library. The standard significance level of 0.05 was used for p-values after the multiple hypothesis correction.

The data analysis in this study does not include any identifying factors nor involve any intervention in the integrity of a person. Analysis only consists of time stamps for the messages, as this metadata reveals temporal variations and patterns without conducting content analysis of messages. The data itself can be seen as public data, as it is collected from the Internet, which is claimed as a freely accessible public domain. Moreover, the Suomi24 data do not belong in the private sphere of communication, and may therefore be utilized for research purposes (Savolainen, 2011). Thus no ethical concerns were present, and an institutional board review was not required.

Results

Analysing the timestamps of messages posted in the depression sub-category of the Suomi24 discussion forum shows that these messages broadly follow an hourly distribution similar to the messages in the forum as a whole. However, when compared, there are statistically significant differences in the distribution of depression-related messages, compared to the whole forum, as well as the health category. Due to the large amount of data, even small differences of ±1% are statistically significant, especially when comparing the health category to the entire forum. The comparison between the hourly distribution of all messages, the health category and the depression sub-category can be seen in Figure 1. Messages in the depression sub-category are clearly over-represented (up to +115%) during nighttime hours (between 00.00 and 05.00), and somewhat under-represented during morning and mid-day (between 07.00 and 12.00) (-30%). The health category features the same effect as the messages in the depression sub-category to a considerably weaker extent (+20% and -5%, respectively). Compared to all messages, as well as the messages in the health category, the relative number of posts in the depression sub-category clearly shows statistically significant peaks for night-time, as indicated in Figure 2.

Figure 1. Hourly distribution of all messages (green), messages in the health (=terveys) category (blue) and messages in the depression (=masennnus) subcategory (orange)
Figure 1: Hourly distribution of all messages (green), messages in the health (=terveys) category (blue) and messages in the depression (=masennnus) subcategory (orange)

Figure 2. Relative hourly distribution of messages in the health (=terveys) category (blue) and messages in the depression (=masennus) subcategory (orange) compared to all messages.
Figure 2: Relative hourly distribution of messages in the health (=terveys) category (blue) and messages in the depression (=masennus) subcategory (orange) compared to all messages. Filled diamonds denote cases where the difference of the relative share from the overall average is statistically significant, and empty diamonds where it is not significant.

As can be seen in Figure 3, this pattern in hourly distribution of messages reiterates during every day of the week, for all messages in the health category as well as the depression sub-category. However, higher peaks are visible in the beginning of the week on Monday, and then later in the week on Thursday, a phenomenon which is especially evident for the messages in the depression sub-category. A comparison of the relative number of messages in the health category and the depression sub-category compared to all messages clearly show that depression-related messages follow recurring patterns of higher peaks and lower troughs for all weekdays, as seen in Figure 4. Figure 4 also illustrates that peaks for messages in the depression sub-category are relatively higher during night time compared to the health category, a phenomenon that is recurring for all weekdays.

Figure 3. Hourly distribution for each day of week of all messages, messages in the health category and messages in the depression subcategory.
Figure 3: Hourly distribution for each day of week of all messages (green), messages in the health (=terveys) category (blue) and messages in the depression (=masennus) subcategory (orange).

Figure 4. Relative hourly distribution for each day of week of messages
Figure 4: Relative hourly distribution for each day of week of messages in the health (=terveys) category (blue) and messages in the depression (=masennus) subcategory (orange) compared to all messages. Filled diamonds denote cases where the difference of the relative share from the overall average is statistically significant, and empty diamonds where it is not significant.

When examined on a monthly level (Figure 5), the trend of messages posted in the depression sub-category reveals a bimodal pattern that follow seasonal variations, with clear peaks in spring and autumn. As seen in Figure 5, the peaks for messages in the depression sub-category in spring and autumn are significantly higher compared to all messages, as well as the messages in the health category. The peaks are most visible in March and May, and in September, November and December. Compared to all messages and the messages in the health category, even a smaller midsummer (middle of July) peak is visible. The trough in messages in the depression sub-category in the beginning of the year, mainly January and February, clearly differs from the January peak in all messages and the messages in the health category. A closer examination of the daily amount of all messages posted in the forum, health category and depression sub-category throughout the year shows a somewhat more irregular pattern for the messages in the depression sub-category (Figure 6). It also reveals clear peak days in the depression sub-category throughout the year. The highest seasonal peaks are distributed on the following dates: the 6–7th of March, 27–29th of April, 29–30th of August, 9th of September, 22nd and 27th of October, and 2nd and 3rd of November.

Figure 5. Triangular window rolling mean of the monthly distribution over a year of all messages
Figure 5: Triangular window rolling mean of the monthly distribution over a year of all messages (green), messages in the health (=terveys) category (blue) and messages in the depression (=masennus) subcategory (orange).

Figure 6. Daily distribution over a year of all messages
Figure 6: Daily distribution over a year of all messages (green), messages in the health (=terveys) category (blue) and messages in the depression (=masennus) subcategory (orange). Filled diamonds denote cases where the difference of the relative share from the overall average is statistically significant. For clarity, this is only shown for the depression category and when it is significant.

Examining the depression-related messages on a thread level also reveals that replies to threads are over-represented compared to top-level posts by up to +70% during morning hours (between 06 and 08, with a clear peak at 06) as indicated in Figure 7. No similar effect can be seen for the forum as a whole.

Figure 7: Relative hourly distribution of replies compared to top-level posts for all messages
Figure 7: Relative hourly distribution of replies compared to top-level posts for all messages (green), messages in the health (=terveys) category (blue) and messages in the depression (=masennus) subcategory (orange). Filled diamonds denote cases where the difference of the relative share from the overall average is statistically significant, and empty diamonds where it is not significant.

Discussion

This study provides novel insights into how depression-related health information behaviour can be studied in relation to time. The principal findings in this study are that messages in the depression sub-category of the Suomi24 discussion forum follow clear and recurring temporal patterns, on a diurnal, weekly and seasonal scale. In relation to diurnal variations, messages are more prevalent during the evening and night. There is also evidence that depression-related health information behaviour increases on Sunday and Monday, and decreases towards Friday and Saturday. There is, furthermore, a clear bimodal pattern that suggests that depression-related health information behaviour follows seasonal variations, with peaks in spring and autumn. There are also clear peak dates during the year. These findings could support and complement previous findings on both diurnal, weekly and seasonal variations of depression.

The clear rise in the number of messages posted during evening and night within the depression sub-category compared to all other messages within the Suomi24 discussion forum supports the notion of the evening-worse diurnal mood variation that Rusting and Larsen (1998) have identified. This type of diurnal mood variation is associated with milder depressive symptoms and atypical depression, and may characterise chronic dysthymia with neurotic-type symptoms of depression, including responsivity of mood, initial insomnia, self-pity, hypochondriasis, hopelessness and anxiety (Hasler, 2013; Rusting and Larsen 1998). This finding is also in line with the diurnal patterns found for depression-related health information seeking in search engines in Finland (Tana et al., 2018). The prevalence of night-time peaks in depression have also been identified in Twitter posts by Chen et al. (2018) and De Choudhury, Counts and Horvitz (2013). The peaks during night time are in coherence with how the WHO (Marcus et al., 2012) defines depression, with symptoms of disturbed sleep as well as the assumed negative effect that depression has on sleep quality and the causal relationship between insomnia and depression (Tsuno et al., 2005). The higher response rate during early-morning could also be an indicator for disturber sleep patterns.

On a weekly level, higher message activity during Sunday and Monday support previous research that states that Sundays and Mondays (Sunday neurosis and Blue Monday are days of lower mood or depression symptoms (Akay and Martinsson, 2009; Brådvik, 2002; Tumen and Zeydanli, 2014). One interpretation for both the diurnal and weekly findings, supported by Brådvik and Rusting and Larsen (1998), is that structured versus unstructured time may account for both diurnal and weekly variations in mood and depression. Sundays as well as evening time, are for most people lower in activity and less constrained by external demands, compared to days and hours filled with work. This may trigger inner tension and negative thinking, and lead to mood disorders (Brådvik, 2002; Rusting and Larsen, 1998). De Choudhury, Counts and Horvitz (2013) give a similar explanation when they state that individuals become more prone to worrying thoughts at night due to loneliness, break from work, lack of energy, or other interactions between light and darkness and the nervous system. A similar temporal analysis of depression-related discussion forum messages have not, to the knowledge of the authors, been conducted either in Finland or anywhere else in the world, and therefore these results cannot be compared to previous research results.

Patterns of seasonal variations on depression-related health information behaviour are also visible in the depression sub-category of the Suomi24 discussion forum. There are clear peaks during autumn and spring compared both to all messages in general and in the health category. This matches the findings of Ayers et al. (2013), Tana (2018) and Yang et al. (2010), who also found clear seasonal patterns for seeking depression-related information with the help of search engines. Compared to Nimrod (2012), who studied twenty-five depression related online communities, the findings in this study are somewhat deviant, as Nimrod identified slightly higher activity during January–March. However, Nimrod’s data were limited to one year only. The seasonal findings in the present study also conform with previous conclusions of seasonality in diagnosed depression, hospital admissions and suicide rates in the Scandinavian countries and the Northern Hemisphere that have shown peaks during spring and autumn (Christensen and Dowrick, 1983; Hakko et al., 1998; Stordal et al., 2008). These findings are however contrary to studies that have shown recurring seasonal peaks for depression during the midwinter months (Harmatz, 2000; Oyane et al., 2008). This contradiction could be based on methodological limitations, as these studies were based on self-report questionnaires and diagnostic interviews.

One of the strengths of this study is that it covers depression-related health information behaviour on an accurate and meticulous scale in relation to time over a period of almost fifteen years. More traditional epidemiological methods lack this kind of minute data-collection, and suffer from other data-related issues, from collection to processing. Earlier research on diurnal, weekly and seasonal variations have been based on retrospective assessment and self-rating, either during a clinical interview or in response to an item in a questionnaire (Hasler, 2013). In this respect, novel use of data sources, in this case discussion forums, presents a promising complementary, rapid and cost-effective approach to population and public health research. Compared to the expensive, time-consuming and difficult conventional techniques, such as surveys, this kind of infodemiological research can provide new insights into depression research.

Within information science, this kind of temporal analysis surpasses and answers the methodological barrier and gap to study the dimension and perspective of time for health information behaviour. This perspective, as Dervin (2005, p. 28) points out, delivers on the mandate to study time, space and movement, and allows for the possibility of conceptualising information seeking and use as temporally patterned. This again, allows for capturing the important question of when an individual is engaged in health information behaviour, or when in time-space these gaps in knowledge appear. Applying this kind of time analysis enhances the methodological aspect of the sense-making approach. It also allows researchers to study how the context of time affects the individual, or if there are repetitive or cyclical patterns and trends in knowledge gaps on a collective level, as suggested in this study.

Limitations and future research

One of the limitations of this study is that it does not analyse the textual content of messages posted in Suomi24 depression subcategory. Therefore it is not possible to know exactly why an individual has posted the message or if the individual in question is suffering, or has suffered, from depression. Hence, it is needed to be noted that the variables (messages) used in this paper does not present actual incidence. However, as earlier research on depression in Suomi24 has shown, it is reasonable to assume that the reason people seek and share depression-related information in the Suomi24 forum is because of either an information need or to share similar personal experiences about the symptom in question (Savolainen, 2011). Moreover, as this study only captures messages posted by active forum users, all online forum lurkers fall outside the scope of analysis. Most visitors to the Suomi24 online discussion forum arrive from Google searches (Lagus et al., 2016), therefore online lurkers’ behaviour could very well have been captured in previous studies on health information seeking behaviour based on Google data in Finland (Tana, 2018; Tana et al., 2018).

Furthermore, discussion forums do not offer demographic data, which limits the drawing of conclusions about larger population behaviour in general. However, as the Internet is by far the most popular channel for health information behaviour, and as more and more people are inclined to seek and share information about health related issues online, changes in health status are often reflected by changes in different patterns on the internet (Ayers et al., 2016; Eysenbach, 2011). Therefore, the analysis of Web data may reflect larger trends in populations. Further research could utilise methods of artificial intelligence, such as natural language processing, sentiment analysis or machine learning, to analyse the language or the emotional tone in messages to identify, measure and detect depression or depressive symptoms as expressed in text. This could assist in constructing early detection warning systems in social media services and social networking sites. One particularly interesting phenomena that should be examined in a more meticulous manner is the higher rate of replies for depression related messages. Applying a qualitative research approach on the replies could help shed some light on this anomaly. Moreover, utilising the unique Suomi24 dataset, information behaviour and the dimension of time could be studied for a myriad of phenomena, ranging from other health related issues to more general topics, such as hobbies or the economy.

Implications

The findings in this study could imply that Finnish people could be more in need of, and receptive to, support in relation to depression or depressive symptoms during these times. This is especially evident for the diurnal variation, where the evening-worse pattern (where symptoms worsen during the evening), is associated with milder depressive symptoms and where early intervention could be beneficial for positive health outcomes. However, all findings in this study could be utilized by public health officials to facilitate aid and optimise positive health outcomes by providing resources at the best time for intervention. This is a time when the subjects are focusing their attention on the health threat and direct their efforts to becoming more engaged and providing ways of managing depression-related complications. With these findings in mind, there should be strategies to ensure that those in need of help have an online pathway to evidence-based information and aid, such as the Finnish Mental Hub. Utilising Web data for research in different health related research areas should in no way be seen as a replacement of traditional epidemiological research, and should only function as a complement in methods. A major challenge for mental health issues is how to not only assess but also treat mental illness among individuals who do not present for treatment or cannot be reached by more traditional means. In these cases, the Internet could function as a less stigmatising, and cost-reducing, tool to help screen, support and even treat those who do not bring their problems to the attention of health professionals.

Conclusions

This paper is a novel attempt at utilising a large social media data set to investigate time and temporal variations, from hourly to weekly and seasonal patterns, in relation to depression-related health information behaviour. Adding this dimension of time to health information behaviour has implications for information, computer and health sciences, truly utilising the potential of multidisciplinary research. As time as a factor and the temporal nature of information behaviour is still an under-explored topic in information-behaviour research (Savolainen, 2018; Solomon, 1997), adding the dimension of time can entail interesting new insights into the study of health information behaviour, and information science in general. This type of data analysis could be developed further with the aim to discover how time and temporal traits affect more general health contemplations. It could also serve as a means to identify and discover interesting insights that can function as a base for further hypothesis testing. The aim of this study was to draw attention to the possibility of studying time to gain insights into temporal nature of health information behaviour in relation to depression by utilising data from the largest discussion forum in Finland. Studying online discussion forums can be beneficial, as it has been shown that peer-support in combination with therapy significantly reduces depressive symptoms (Griffiths et al., 2012). Insight into all depression-related behaviour could be useful to be able to provide the best possible support at the best possible time, and therefore, a broad understanding of health related behaviour in relation to depression and depressive symptoms is crucial. Despite its limitations, this preliminary analysis showed that analysing depression-related health messages in a discussion forum is informative, and that the benefits of this kind of an approach are various.

About the authors

Jonas Tana is a postdoctoral researcher and senior lecturer in nursing at Arcada University of Applied Sciences. He holds a bachelor’s degree from Arcada University of Applied Sciences and a master’s degree from University of Helsinki. His main research interests lie in online health information behaviour and e-health. He can be contacted at jonas.tana@arcada.fi
Emil Eirola is an artificial intelligence scientist at Silo.AI and is also working as Post-doctoral Research Scientist at the Department of Business Management and Analytics at the Arcada University of Applied Sciences in Helsinki, Finland. He received his M.Sc. degree in Mathematics (2009) and D.Sc. degree in Information Science (2014) from the Aalto University School of Science in Helsinki, Finland. Primary research interests include machine learning with incomplete data, feature selection, and mixture models, with applications to data in the fields of finance, security, and healthcare. He can be contacted at emil.eirola@arcada.fi
Kristina Eriksson-Backa is a University teacher and postdoctoral researcher in Information Studies at Åbo Akademi University, Turku, Finland. She is docent (adjunct professor) at the same university. Her main research interests are information about food and health in the media, health information behaviour, especially among older adults, understanding of health information, health communication, and e-health. She can be contacted at kristina.eriksson-backa@abo.fi

References

Note: A link from the title is to an open access document. A link from the DOI is to the publisher's page for the document.


How to cite this paper

Tana, J., Eirola, E., & Eriksson-Backa, K. (2020). Exploring temporal variations of depression-related health information behaviour in a discussion forum: the case of Suomi24. Information Research, 25(2), paper 854. http://InformationR.net/ir/25-2/paper854.html (Archived by the Internet Archive at https://bit.ly/2XdWEgp)

Check for citations, using Google Scholar