Open access publishing trend analysis: statistics beyond the perception
Elisabetta Poltronieri, Elena Bravo, Moreno Curti, Maurizio Ferri and Cristina Mancini
Introduction
Open access principles
Over the last decade, the open access option for strengthening the spread of scholarly literature is becoming progressively attractive for both publishers and authors in almost every field of science. Because of its innovative features and the evolution of the Internet in reshaping the publishing business, the open access model has become less alternative as regards market strategies of commercial publishers and more integrated with their policies of economic growth (Vitiello, 2013).
The latest 2014 issue of the quarterly-updated series illustrating key indicators of open access trend shows an increasing annual rates of both open access peer-reviewed scholarly journals and open access repositories (Morrison, 2014).
The basic and successful aspect of the open access model resides in the concept of immediate, and public availability online of peer-reviewed scholarly research, free of most copyright and licensing restrictions. There are two ways for authors to adhere to open access paradigm: the gold and the green road. The gold road is when authors publish in open access journals that make research free online for all to read. The green road is when authors make their papers openly available by self-archiving them in digital repositories compliant with the standards of web interoperability, developed by the Open Archives Initiative for long-term accessibility and preservation.
Rapid publication of research results, transparency in the peer-review process, openness engendering better reproducibility of papers, as showcases for potential collaborators and funders, are opportunities and benefits of open access that are not yet fully grasped by researchers.
One of the major concerns is the sustainability of the open access in comparison with the traditional publishing model and it is related to the costs of publishing (article publication charges). This implies that ‘many authors wonder about the advantages and disadvantages of open access journals for both themselves and their institutions’ (Broome, 2014, p. 69) because the desirable setting for the scholarly community should be based on the principles of achieving savings and increasing both efficiency and effectiveness of scientific communication.
As claimed by the recognised leading voice of open access movement, Peter Suber (2013), the open access paradigm is based primarily, on the assumption that research findings, resulting from public-funded studies, should be freely available online. Free access to findings of publicly-funded research has therefore become a moral imperative within the scientific community worldwide, able to stimulate the policy actions of governments, funding agencies, academic and research institutions to maximise the distribution of research papers.
Open-access policies, guidelines and legislation
As far as the biomedical field is concerned, a milestone initiative to improve open science was, in 2008, the National Institutes of Health Public Access Policy (US. Dept. of Health & Human Services, 2008) which mandates the deposit of all Institutes-funded, final peer-reviewed articles into the digital archive PubMed Central for immediate and permanent access.
In line with this perspective the Wellcome Trust released its Position statement in support of open and unrestricted access to published research (Wellcome Trust, n.d.) for its grant-funded research. The same key principle was reaffirmed by the recommendations of the Finch Report (Working Group…, 2012) accepted by the UK Government.
Implementing open access to research results has been fundamental also for the European Commission who in December 2013 released the Guidelines on open access to scientific publications and research data in Horizon 2020 (European Commission, 2013). These Guidelines, in line with the EU advocacy strategies for open access established within the European Commission's 7th Framework Programme, have been conceived to mandate open access to scientific publications resulting from research funded under Horizon 2020.
Pursuant to international perspectives, in a specific section of a recent law regarding the valorisation of culture, also the Italian legislation recognised the principle of open access stating that at least 50% of scientific publications derived from publicly financed studies must have free and immediate open access (Italy. Statutes, 2013). In this regard, research institutions are required to adopt policies intended to promote open access for scientific results and, as pointed out by Moscon (2013, p. 3): ‘The recognition by legislators on the importance of open access is a development of great significance, but it is not sufficient. The success of open access depends on a radical change of values and customs of the scientific community, as well as on necessary economic, organizational and educational investments’.
Open access publishing models
Open access is also attracting more and more attention from journal editors of specific disciplines, including biomedicine. In fact, several papers (Petersen, 2014; Kahn, 2014; Viale, 2013; Calder, 2012) have contributed to increased knowledge on the open access mechanism and advantages among researchers, opinion leaders and policy makers. In this framework the open access paradigm gained rapid success and new business perspectives have arisen for publishers, who began to offer open access options for subscription-based journals in their portfolio (hybrid open-access journals) (Poltronieri, et al., 2013). As a consequence of a growing number of journals with appealing features, authors are giving credit to open access journals that are increasingly competitive as compared to those tracked by Journal Citation Reports (hereafter, 'Reports') released by Thomson Reuters.
At the same time, the huge number of open access journals available, over 10,000 titles as indexed in the Directory of Open Access Journals (hereafter, 'Directory'), may make researchers feel confused because of a perceived low quality of open access papers, caused by aggressive market strategies of unscrupulous publishers (Broome, 2014). The research community continues to be alarmed by detrimental phenomena related to publishing practices by companies with little or no experience as academic publishers (so-called would-be or predatory publishers) that aim to make profits by publishing poorly-written papers (Pierce, 2014). In this respect, however, it is worth noting that initiatives are already in place to make authors aware of this phenomenon (Beall, n.d.).
Open access journals and the impact factor
The current debate over open access to scientific literature has also led to considering the impact factor of open access journals as a constant concern of the research community aiming to have its works cited. The main reason of the researchers’ impact factor-driven decision to publish in these journals closely relates to funds assigned and to career advancement (PLoS Medicine Editors, 2006). Thus, notwithstanding the boom in innovative metrics dedicated to measuring research impact (Barbaro, Gentili and Rebuffi, 2014; Mabile, et al., 2013), scientists maintain, in most cases, their choice of a journal provided with an impact factor. This reasoning followed by scholars worldwide, combined with the assumption of a large citation impact of open access papers, inspired studies concerning the relationship between open access journals and impact factor (Giglia, 2010). Despite the critical points raised by impact factor detractors (Seglen, 1997; Ramsden, 2009; Fooladi et al., 2013), the pressure exerted on authors by the logic of impact factor is gradually pushing open access journals towards consolidated standards of reputation.
The present work analyses the number of open access journals which have acquired impact factor in the years 2010-2012 and is intended to describe the trend of open access publishing, in order to give researchers appropriate information on the degree of reputation achieved by open access journals. All journals surveyed for the mentioned years were also examined with respect to pertaining subject categories to highlight the distribution of journals per content area.
As a case study, the journals in which the researchers affiliated with the Istituto Superiore di Sanità in Italy published in the same period (2010-2012) were considered, since this research institution represents a small-scale research community in the biomedical field.
Methods
Overview
The focus of the analysis was twofold. Firstly, to calculate the actual increase in open access journals as indexed in the Directory from 2002 to 2012, compared with the growth of journals indexed by Journal Citation Reports in the same years. Secondly, to determine the amount of open access journals with impact factor in the most recent years (2010-2012), also with respect to their content domain. Furthermore, the biomedical field was investigated by carrying out a case study on the literary production of Istituto Superiore di Sanità research staff for the same three years.
Tools for identifying journals: the Directory of Open Access Journals and Journal Citation Reports
The Directory of Open Access Journals and Journal Citation Reports are two well-known tools for the identification of world’s leading scholarly journals. In order to identify open access journals provided with impact factor, journals were selected if they appeared in both databases and these were used for the analysis carried out in this study.
The Directory is a database of peer-reviewed open access journals founded by the University of Lund on May 2003, which enables searches on over 10,000 journals and which is recognised as the authoritative source of information on open access journals. For each journal title, the Directory displays a basic profile providing information on subject, publisher, journal web site and possible article publication fees. Since early 2014, new criteria for journal inclusion have been adopted (Directory..., 2016). Thus, to be indexed in this database, publishers are now required to fill in a detailed application form providing information about their journals including content licensing, copyright permissions, transparency, quality of editorial process etc.
Journal Citation Reports is one of the databases included in the Web of Science, the citation indexing service maintained by Thomson Reuters. It is neutral in its journal selection criteria, thus including both subscription-based journals and open access journals. Journal Citation Reports is a recognised source of quantitative data based on the citation analysis of published articles. It is intended to monitor academic journals in the science and social science fields and to provide, on an annual basis and among other bibliometric values, the impact factor of each individual journal. Inclusion in Journal Citation Reports represents a parameter of impact for journals, which have to meet rigorous criteria to be entered in this database (Testa, 2012). Journal Citation Reports also allows users to sort and compare journal data pertaining to similar subject categories and to understand the journals' influence in the scholarly landscape. Journals are arranged in subject areas and may be assigned to more than one category in the case of titles covering different subjects. In this analysis the Reports structure of journal subject categories is adopted to identify the content domain pertaining to all journals surveyed.
Subject grouping and ranking of journals
In the present work, the large number of subject categories identified in
Journal citation reports have been grouped into eleven major disciplines obtained by matching the Reports detailed subject categories with Medical subject headings terms (US. National Library of Medicine, 2016), the controlled terminology produced by the National Library of Medicine and used for indexing papers included in PubMed. The class ‘medicine’, for instance, was adopted as an umbrella term encompassing those medical specialties such as allergy, anesthesiology, cardiac and cardiovascular system etc., which are specific subject categories in Journal citation reports. Then, the 159 the Reports' subject categories were grouped into the following eleven subject fields (disciplines) matching Medical subject headings terms: bBiological science disciplines, chemistry, earth sciences, health occupations, information science, natural science disciplines, mathematics, medicine, physics, technology-industry-agriculture, other (residual class covering history and philosophy of science, logic, medical ethics, and multidisciplinary sciences).
Journal citation reports sorts journals into quartiles to overcome the bias related to the direct comparison of impact factor scores of journals that are listed in diverse subject areas. Quartiles refer to a division into four equal percentiles of the journals listed in a given category. Thus, the indicators showing journal ranking in a discipline are represented by the quartiles Q1-Q4: from the highest position in Q1 to the lowest position in Q4. The Journal citation reports' quartile structure was considered for the purpose of the present analysis. In this regard, each journal indexed by the Directory and the Reports' databases has been examined with respect to both its subject categories and its quartile rankings. According to this category grouping, in some cases, the same journal is assigned to more than one subject category.
Survey of journals indexed by the Directory of open access journals and Journal citation reports
The analysis was started in late November 2013 by extrapolating the number of journals indexed each year from the Directory, starting from 2002 (the first year for which data was available in the database). At that time, the directory's interface allowed specific queries to be launched so the number of journals added each year could be easily obtained; this function has not been available since early 2014. Thus, the present analysis covered journals added to the Directory until the end of 2012.
The Directory journal list was then matched with the Reports list to highlight the trend of open access journals having gained impact factors in the time span considered (2010-2012) by this study. The same criteria were applied to process the publications produced by Istituto Superiore di Sanità research staff in the same years. The idea was to focus the analysis on a specific research field (biomedicine), to compare results with global data (journals of all disciplines included in both the Directory and Reports databases) so as to outline the trend of open access journals with impact factors in a defined subject area.
The literary production by Istituto Superiore di Sanità was then assumed as an interesting case study, as it represents the leading research institution in Italy in the field of public health.
Results
Journals indexed by the Directory of Open Access Journals and Journal citation reports from 2002 to 2012
Since its first collection of data, the Directory has shown a dramatic increase from 35 journals in 2002 to 7,889 in 2012 (Figure 1, upper panel). In 2002, the Reports already listed 5,876 journals, a number that reached 8,471 in 2012 (Figure 1, lower panel).
As shown in Figure 1, the percentage of annual increase in titles in the Directory tends to stabilise from 2006 onward, towards an average of about 20-30%. In the same period, a higher variability in the annual percentage increase rate can be observed for the Rreports journals and, with a few exceptions (2009 and 2010), the annual increase in Reports is far lower than 10%.
The total of journals indexed by Reports was 8,073, 8,336 and 8,471 in 2010, 2011 and 2012 respectively. In the same years, 783, 861 and 928 of these journals appeared were also indexed in the Directory. Thus, the corresponding percentage of journals listed in both the databases considered, in the three years examined, is equal to 9.7%, 10.3% and 11% in 2010, 2011 and 2012 respectively.
Year | No. of journals indexed in | |||
---|---|---|---|---|
Both DOAJ and JCR | JCR | % both DOAJ and JCR | ||
Total | 2010 | 783 | 8,073 | 9.7 |
2011 | 861 | 8,336 | 10.3 | |
2012 | 928 | 8,471 | 11.0 | |
ISS | 2010 | 27 | 371 | 7.3 |
2011 | 29 | 354 | 8.2 | |
2012 | 35 | 333 | 10.5 |
Interestingly, the 78 new titles in the Directory entering Reports in 2011 represent about 30% of the 263 Reports new entries, and in 2012 the 61 Directory journals represent about 45% of the 135 Reports new entries. The lower part of Table 1 reports data relating to Istituto Superiore di Sanità scientific paper production in 2010-2012. The Reports journals hosting Instituto-authored papers are 371, 354 and 333 in 2010, 2011 and 2012, respectively. Out of these figures, Directory titles were respectively 27 (7.3%) in 2010, 29 (8.2%) in 2011 and 35 (10.5%) in 2012. These results testify to an increasing percentage of journals included in both the Reports and the Directory databases over the considered years, as observed for the total number of journals.
Discipline distribution of surveyed journals
Figure 2 outlines the results regarding the discipline distribution of journals both in Journal citation reports and in the Directory in different areas of science. The x axis reports the disciplines and the y axis refers to the number of times each journal lies in a particular discipline. The total (Q1-Q4) frequency of occurrences (light-grey bar) and the frequency of journals ranked in the first quartile (Q1) (dark-grey bar) are reported.
Discipline grouping was performed according to that which has already been described in the Methods section above. For each journal title, the corresponding discipline in the Reports was counted as an occurrence, and the total number of occurrences recorded is reported on the x axis of the bar graph. The total (Q1-Q4) frequency of occurrences is visualized in the light-grey bar, while the dark-grey bar depicts the portion of journals ranked in the first quartile (Q1).
The most prevalent discipline identified was medicine, with 315 occurrences, followed by biological science disciplines and technology, industry, agriculture with 215 and 200 occurrences, respectively (Figure 2, upper panel). The journals where researchers affiliated with Istituto Superiore di Sanità published were mainly assigned to medicine and biological science disciplines (cumulatively representing 82.3% of total); 9 out the 11 total disciplines are present as well (Figure 2, lower panel).
In Figure 2, upper panel, as far as medicine is concerned, journals covered by both the Directory and the Reports were ranked in Q1 43 times (14% ) out of a total of 315, and as for biological science disciplines, those ranked in Q1 were recorded 40 times (19%) out of a total of 215. The highest percentage (22.6%, equivalent to 17 times) of journals that were ranked in Q1 out of a total of 75, was found for journals pertaining to earth sciences. As for the remaining disciplines, except for technology, industry, agriculture, they are present with less than 100 total recurrences, with a variable percentage rate in Q1, ranging from 10% (natural science disciplines) to 23% (earth sciences).
In the lower panel of Figure 2 results obtained on the frequency of disciplines assigned to journals recorded for ISS, indexed in both the Directory and the Reports databases and ranked in Q1, are summarised. Similarly to data observed for ‘total data’ (Figure 1, upper panel), the highest rate of occurrences (54) was found for journals ranked in Q1-Q4 quartiles and assigned to medicine, followed by those assigned to biological science disciplines (39 occurrences). As far as the number of times journals ranked in Q1 were assigned to a given discipline, the highest percentage rate was reached by medicine with 59% (equivalent to 32 times out of 54), followed by biological science disciplines showing the same percentage (59%) with 23 times out of 39.
Discussion
Principal results
This study analyses the number of open access journals which have acquired impact factor in the years 2010-2012, with the aim of describing the trend of open access publishing.
According to the data collected, since 2006 the number of journals in the Directory seems to have reached a constant rate of annual percentage increase of about 20-30% per year, which is far higher than the corresponding average increase observed for the Reports-covered journals (Figure 1). Besides, in the 2010-2012 year interval, there is an increasing percentage of Directory titles included in the Reports' database, ranging from 9.7 to 11 % (Table 1, total data). The percentage of new journals indexed on both databases compared to those indexed only by the Rreports in 2011 and 2012, goes from + 30% to +45% respectively. By now, these journals have improved their quality standards and tend increasingly to acquire impact factor, thus entering the Reports ranking lists. The adoption by the Directory of more strict journal inclusion criteria (Directory..., 2016) is also a contributing factor to the rapid affirmation and credibility of open access journals of academic interest, and will surely enhance further analyses based on selected well-reputed open access journals.
Thomson Reuters, leader in the citation analysis for research assessment, recently carried out a study on the number of open access journals indexed in Web of Science databases which support such a scenario. In fact, it was found that a remarkable number of journals, assumed as being open access on the basis of their accessibility over the Internet at no charge, 'met the rigorous selection criteria' envisaged to allow inclusion in Web of Science databases (Pringle, 2004). This result was welcomed by the Company as evidence that open access journals have increased their quality and impact. Therefore, it seems that open access journals which meet the criteria to be entered in Journal citation reports gain position at a fast rate in the Q1 ranking of the concerned disciplines.
In other words, the increasing weight of open access journals in those listed in Journal citation reports depicts a framework in which the authors themselves are those who influence the whole scientific publishing landscape by choosing the journals where they publish their articles. Thus, open access journals are becoming a driving force of a virtuous circle which favours data reproducibility in science, and researchers themselves may act as the engines of this mechanism (Meyer et al., 2012).
These considerations are also in line with the results of the analysis performed on articles published by researchers affiliated with Istituto Superiore di Sanità. The titles in the Directory included in the Reports were 7.3% in 2010, increased to 8.2% in 2011 and moved up quickly to 10.5% in 2012 (Table 1), showing the authors’ increasing interest in open access publishing, which deserves to be monitored through the years in order to ascertain a more meaningful trend. It is noteworthy that about 59% of articles by research staff were published in journals listed both in the Reports and the Directory.
Limitations
It may be argued that the reported results, derived from objective analysis of available data, are affected by some methodological limitations. As pointed out by Morrison (2014b), the adoption by the Directory in 2014 of stringent criteria about journal inclusion has led to incomplete matching of the actual number of Directory journals with those previously included in the database. A further element of uncertainty about the number of journals could be a delay in the update of the Directory. These elements can make the analysis of the open access growth misleading, which is the reason why it has been sustained that these limitations are equivalent to comparing apples with oranges.
However, also taking into account the above mentioned reasoning, the bias in the results of the present analysis seems not to be relevant in statistical terms. The only period for which the Directory provides data about both added and removed journals is for 2014 onwards; during 2014, 195 journals were removed from the Directory. Therefore, it may be assumed that the discrepancy relating to the total number of counted journals is about 200 journals a year. The number of journals in the Directory ranges from 5,212 in 2010 to 7,889 in 2012, allowing the estimation of the average error to be roughly between 4% and 8% per year, representing an acceptable value of uncertainty for this kind of analysis. Furthermore, it is unlikely that journals that did not comply with the more strict inclusion criteria adopted by the Directory would affect the double search on its and the Reports databases, thus leading to the assumption that the removed journals were those with impact factor.
In the time span considered, this analysis was carried out on the basis of the only search functions available and should be considered as a valuable starting point for further studies. It is worth noting, however, that future analyses to assess open access growth will also be hindered by the removal, since early 2014, of the filter used for searching the number of journals added each year.
Conclusions
The results of this analysis, beyond some methodological limitations, show a growth of open access scholarly publishing. In fact, a raising awareness of open access advantages, the push for the open access model by major science funding agencies, as well as the great attention devoted by scientific medical journals to the open access option testify for the quality and impact of scientific data reported in open access journals, especially in the biomedical field. As a consequence, the authoritative status achieved by open access journals has moved the open access publication offer from being just an opportunity to becoming a steering tool in the hands of authors, who make an important contribution to the re-shaping of the scholarly publishers’ editorial policies.
In line with this trend ‘even big conservative publishing houses like Elsevier and Nature have started featuring Open Access titles within their journal collections’ (Aguzzi, 2015, p. 145). Further evidence of this successful open access formula, persuading publishers to adopt the open access option, is the availability of a new tool for helping authors choose the journal that best meets their needs: a grid to measure the journal’s level of openness in terms of the authors’ rights to re-use published content. It was launched by the Public Library of Science, an open access pioneer publisher which, in collaboration with the Scholarly Publishing and Academic Resources Coalition (SPARC) and the Open Access Scholarly Publishers Association (OASPA), has created HowOpenIsIt? (Scholarly..., 2014), a guide and an accompanying grid to help authors identify the level of openness of publishers’ policies.
The open access paradigm, once conceived as a top-down disrupting strategy imposed on authors’ heads, is gaining the status of a bottom-up methodology for communicating science operated by authors themselves to give a boost to the publishing market.
Acknowledgements
The authors wish to thank Funda Topuz for revising the manuscript in English. Special thanks to Silvia Negrola for revising the manuscript and bibliography according to the Instructions and to Ugo Visconti for appropriate suggestions to improve the document.
About the authors
Elisabetta Poltronieri is currently working at the Publishing Unit of Istituto Superiore di Sanità (National Health Institute, ISS), Rome (Italy). Her main responsibilities include bibliographic editing and the diffusion of scientific publications authored by the Institute’s researchers. She can be contacted at elisabetta.poltronieri@iss.it ORCID No. orcid.org/0000-0002-7523-5993
Elena Bravo is senior researcher at the Department of Haematology, Oncology and Molecular Medicine, Istituto Superiore di Sanità, Rome, Italy. She works at the project for the optimisation of the Italian Network of Biobanks and from 2009 she is contributing to the Italian participation to BBMRI (Biobanking and Biomolecular Resources Research Infrastructure). She is the delegate of the Italian representative in European legal consortium BBMRI-ERIC. She can be contacted at elena.bravo@iss.it ORCID No. orcid.org/0000-0002-3473-6767
Moreno Curti is a medical doctor, responsible for the Documentation Service and the Scientific Library, Foundation IRCCS - Policlinico San Matteo, Pavia, Italy. He deals with organizational and managerial aspects related to the research activity of his Institution, with particular attention to the process of evaluation of the scientific production through the application of the main bibliometric indicators. He is coordinator of the 'Bibliosan' System, the network of all Italian biomedical research institutions' libraries. He can be contacted at: curtim@smatteo.pv.it ORCID No. orcid.org/0000-0003-4120-0861
Maurizio Ferri has been working at the Managament Data Unit of the Istituto Superiore di Sanità , Rome, Italy since 1987. He currently deals with designing and creating scientific databases to support the activity of ISS internal research staff. He also gained sound experience in development and teaching activity regarding the application of health informatics systems and software. He can be contacted at maurizio.ferri@iss.it
Cristina Mancini has been working at the Library of the Istituto Superiore di Sanità, Rome, Italy since 1989, first as reference librarian and then as responsible of the Serial publications unit. She is currently involved in planning, organizing and teaching activity referring to training courses on information retrieval tools for biomedical librarians and related health professions held at the ISS Library. She is also interested in programmes related to development and enhancement of electronic journal collections and projects on electronic journal digital archiving. She can be contacted at cristina.mancini@iss.it