Next Article in Journal
What Proportion of Systematic Reviews and Meta-Analyses Published in the Annals of Surgery Provide Definitive Conclusions—A Systematic Review and Bibliometric Analysis
Next Article in Special Issue
Should the Use of Patient Medical Information in Research Require the Approval of Attending Physicians?
Previous Article in Journal
RecSys Pertaining to Research Information with Collaborative Filtering Methods: Characteristics and Challenges
Previous Article in Special Issue
How to Protect the Credibility of Articles Published in Predatory Journals
 
 
Order Article Reprints
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Correction of the Scientific Production: Publisher Performance Evaluation Using a Dataset of 4844 PubMed Retractions

1
Independent Researcher, 700054 Iasi, Romania
2
Department of Immunology and Alergology, Emergency Clinical Hospital ‘St. Spiridon’, 700098 Iasi, Romania
3
Faculty of Medicine, “Grigore T. Popa” University of Medicine and Pharmacy of Iași, 700098 Iasi, Romania
*
Author to whom correspondence should be addressed.
Publications 2022, 10(2), 18; https://doi.org/10.3390/publications10020018
Received: 7 February 2022 / Revised: 23 March 2022 / Accepted: 19 April 2022 / Published: 21 April 2022
(This article belongs to the Special Issue Publication Ethics and Research Integrity)

Abstract

:
Background. Retraction of problematic scientific articles after publication is one of the mechanisms for correcting the literature available to publishers. The market volume and the busi-ness model justify publishers’ ethical involvement in the post-publication quality control (PPQC) of human-health-related articles. The limited information about this subject led us to analyze Pub-Med-retracted articles and the main retraction reasons grouped by publisher. We propose a score to appraise publisher’s PPQC results. The dataset used for this article consists of 4844 Pub-Med-retracted papers published between 1.01.2009 and 31.12.2020. Methods. An SDTP score was constructed from the dataset. The calculation formula includes several parameters: speed (article exposure time (ET)), detection rate (percentage of articles whose retraction is initiated by the edi-tor/publisher/institution without the authors’ participation), transparency (percentage of retracted articles available online and the clarity of the retraction notes), and precision (mention of authors’ responsibility and percentage of retractions for reasons other than editorial errors). Results. The 4844 retracted articles were published in 1767 journals by 366 publishers, the average number of retracted articles/journal being 2.74. Forty-five publishers have more than 10 retracted articles, holding 88% of all papers and 79% of journals. Combining our data with data from another study shows that less than 7% of PubMed dataset journals retracted at least one article. Only 10.5% of the retraction notes included the individual responsibility of the authors. Nine of the top 11 publishers had the largest number of retracted articles in 2020. Retraction-reason analysis shows considerable differences between publishers concerning the articles’ ET: median values between 9 and 43 months (mistakes), 9 and 73 months (images), and 10 and 42 months (plagiarism and overlap). The SDTP score shows, from 2018 to 2020, an improvement in PPQC of four publishers in the top 11 and a decrease in the gap between 1st and 11th place. The group of the other 355 publishers also has a positive evolution of the SDTP score. Conclusions. Publishers have to get involved actively and measurably in the post-publication evaluation of scientific products. The introduction of reporting standards for retraction notes and replicable indicators for quantifying publishing QC can help increase the overall quality of scientific literature.

1. Introduction

“One of the greatest criticisms in the blogosphere is not so much that the current rules and guidelines are weak or poor, but that enforcement and irregular application of those rules, particularly by COPE member journals and publishers, confuses the readership, disenfranchises authors who remain confused (despite having a stricter and more regulated system) and provides an imbalanced publishing structure that has weak, or limited, accountability or transparency.” [1]
The publication of scientific literature represents, globally, a market of considerable size, which reached a record value of $28 billion in 2019 (from $9.4 billion in 2011 [2]) and fell to $26.5 billion in 2020, with forecasts suggesting a recovery of losses by 2023. Revenues from the publication of articles in 2019 were $10.81 billion and those from the publication of books $3.19 billion, with derivative products representing the difference. The segment of medical publications ($12.8 billion in 2020) is constantly growing. Estimations show that in 2024 the medical literature will exceed the volume of the technical and scientific literature [3]. The continued growth was accompanied by a consolidation process that made the top five publishers in 2013 represent over 50% of all published articles. These changes occur in an atypical market where publishers have high profit margins [4], do not pay for purchased goods (authors are not paid), do not pay for quality control (peer review), and have a monopoly on the content of published articles (Ingelfinger law) [5]. In the case of medical literature, because of the direct impact that scientific errors or fraud may have on patients’ health, publishers have an ethical obligation to invest resources in post-publication quality control and disseminate identified issues as quickly and widely as possible. One of the methods is the retraction of scientifically, ethically, or legally invalid articles (questionable research practices (QRP), questionable publication practices (QPP)). The interest in correcting the literature seems to be confirmed by recent developments: the number of journals with at least one retracted article increased from 44 in 1997 to 488 in 2016 [6]. The continued growth may be due to an improved capacity to detect and remove problematic articles [7], which, despite a somewhat reluctant if not a resisting editorial environment [8,9] and the lack of significant progress in reporting [1,10] continues to expand in the publishing environment. For example, in the case of the PubMed retractions, 2020 was a record year in terms of retraction notes, targeting 878 articles published in more than 12 years [11].
The retractions in the biomedical journals indexed in PubMed represent a topic that was intensely researched in the last two decades, numerous articles having made valuable contributions in this field: the number of authors [12,13], author countries [14,15,16], publication types [12,13,17], exposure time/time to retraction, retraction reasons, citations received by retracted articles, financing of retracted research, impact factor of journals publishing retracted research, who is retracting, study types, and retraction responsibility [12,13,14,15,16,17,18,19,20,21,22,23,24,25]. However, there is little information on the PubMed article retractions at the publisher level and, therefore, an incomplete picture of the challenges/difficulties they face in the post-publication quality control of products delivered to consumers of scientific information. Several authors point out issues that arise when retracting a scientific article:
The process of retracting an article is a complex one, depending on several factors: who initiates the retraction, the context, the communication between the parties, and the editorial experience [26].
The clarity of the retraction notes leaves much to be desired and presents a significant variability between journals and/or publishers in relation to the COPE guidelines [1,27,28,29] although a uniform approach has long been required [15].
The individual contribution of the authors is rarely mentioned in the retraction notes, contrary to the COPE recommendations [29].
The online presence of retracted articles, required in the COPE guidelines, ensures more transparency and avoids the occurrence of “silent or stealth retractions” [30].
The role of publishers and editors avoiding what is called editorial misconduct is an important one [31]. Editorial errors (duplicate publication, accidental publication of wrong version/rejected article, wrong journal publication) were identified in different proportions: 7.3% (328 cases) in a 2012 study that analyzed 4449 articles retracted between 1928–2011 [32], 1.5% (5 cases) in another study [29], 5% in the Bar-Ilan study [33], and 3.7% in our study on PubMed retractions between 2009 and 2020 [11].
The level of involvement of publishers and editors in article retraction is variable [28,32] although most of them can initiate the retraction of an article without the authors’ consent [34].
The different efficiencies of the QRP and QPP detection mechanisms at the editorial level may explain the differences between publishers [35]; the possible application of post-publication peer review [36,37] can contribute both to the increase of the detection capacity and the reduction of the differences between journals/publishers.
Editorial policies are/can be implemented at the publisher level and could positively/negatively affect the performance of all journals in its portfolio. We considered it would be constructive to present an overview of retracted articles and a structure of retraction reasons of major publishers using a dataset obtained from the analysis of 4844 biomedical PubMed retractions from the period 2009–2020.
We also deemed it worthwhile to initiate a debate on the performance of publishers in correcting the scientific literature. For this reason, we propose a score based on four indicators: speed of article retraction, post-publication ability of the publisher/editors to detect QRP/QPP articles, transparency (measured by the online maintenance of retracted articles and the clarity of retraction notes), and the precision of the correction process (identification of those responsible and the degree of avoidance of editorial errors).
Our study has three main objectives:
  • Current numbers and progression of 2009–2020 human-health-related PubMed retracted articles and their retraction notes, grouped by publisher.
  • Retraction reasons for publishers on the first 10 positions.
  • Appraisal of publisher quality control and post-publication peer review effectiveness using a score based on several components.
What do we consider new in this article?
Presentation of the main retraction reasons and exposure time (ET) for leading publishers:
Dynamics of retraction notes at publisher level;
Evaluation of publisher performance for the three main retraction reasons;
Proposal of a score for measuring publisher post-publication QC performance (SDTP score: speed-detection-transparency-precision score)

2. Materials and Methods

The methodology used to collect the data is presented in detail in another article [11]. Here we describe the main elements: analyzed period: 2009–2020 (last update for retracted articles made on 31 January 2021); data source for retracted articles: PubMed csv files; source of data for publishers holding journals in which the retracted articles and notes were published: SCOPUS; and the period during which the data were analyzed: July 2020–May 2021.
Information extraction and partial processing: teamwork web application developed by the first author in which PubMed csv files were imported and analyzed.
Final processing: export from the web application to the SPSS, coding, and analysis.
For the publishing and editorial performance indicators, we have built a score (SDTP score, Table 1) consisting of four components and six values represented equally and calculated from the dataset collected: speed, detection rate, transparency, and precision.
In the case of publisher/editor involvement (2-Detection rate), we used in the calculation all articles that in the retraction note mentioned the involvement of the publisher, editor in chief, editorial board, institution, and Office of Research Investigation, without authors.
To measure the identification of the authors’ responsibility (5-Precision-Individual responsibility), we used as the calculation base the number of articles with more than one author and retractions for other reasons than editorial errors.

Calculation of SDTP Score

The values for each publisher are compared with the values of the entire set of 4844 retracted articles (2009–2020), 3931 articles (2009–2019), and 3361 articles (2009–2018). There are two situations:
A.
Values above average are considered poor performance. Examples: exposure time (speed); calculation formula for speed: 100 − (publisher value/baseline value) * 100]
B.
A percentage value above the average is considered a good performance. Example: detection rate, percentage of online papers, percentage of clear retraction notes, percentage of retractions in which individual author responsibilities are mentioned, percentage of retractions that are not due to editorial errors. Formula: [(publisher value/baseline value) * 100] − 100.
The values obtained are summed and form the SDTP score of the publisher.

3. Results

3.1. Retractions by Publishers

The 4844 retracted articles were published in 1767 journals. The average number of retracted articles is 2.74/journal. Several studies have reported publisher rankings, with the top positions being consistently occupied by publishers with a large number of publications [7,28,40]. This is also reflected in the results obtained by us. Forty-five publishers with more than 10 retractions account for 88% (n = 4261) of retractions and 79% (n = 1401) of journals (Table 2). The remaining 583 papers are published in 366 journals associated with 321 publishers.
The top 11 publishers have 3405 retracted articles (70.3%) in 1165 journals (65.9%). In the following, we will only analyze their evolution and performance. The rest of the publishers will be analyzed within a single group.

3.2. Retraction Notes/Publisher (2009–2020)

The year 2020 is the most consistent year for retracted articles for almost all publishers in the top 11, except PLOS, which peaked in 2019, SAGE in 2017, and E-Century Publishing in 2015 (Table 3). The year 2015 seems to be, for most publishers, the beginning of a greater interest in correcting the medical literature.

3.3. Publishers and Retraction Reasons

The retraction reasons of the top 11 publishers are presented in Table 4. Multiple reasons in one retraction note were added to the respective categories, thus explaining the publisher percentage sums higher than 100%.

3.4. Mistakes/Inconsistent Data

Of the 1553 cases, 1005 (64.7%) belong to the top 11 publishers, presented in Table 5. The rest of the publishers account for 548/1553 cases. In 229/1553 cases (14.7%, 95% CI 12.9–16.5), the reason for the retraction was data fabrication. For the top 11 publishers, there are 127/1005 (12.6%, CI95% 12.6–14.7) cases of data fabrication (48 Elsevier, 22 for Springer Nature and Wolters Kluwer, 12 for Wiley-Blackwell, 7 for SAGE, 5 for E-Century Publishing, 4 for Taylor & Francis and PLOS, and 3 for Hindawi). The other publishers have 102/548 articles retracted for data fabrication (18.6%, CI95% 15.3–21.8).
The lowest ET average belongs to Taylor & Francis (15.4 months) and the highest to PLOS (41.5 months). We note in the meantime that in most of the cases skewed distributions and median values ranging from an encouraging 9 months (Dove Medical Press and Spandidos Publications), 11 months (Wolters Kluwer), and 12 months (Taylor & Francis) to a rather unexpected 43 months for PLOS.

3.5. Images

Several publishers have a high number of retractions that were due to image problems: PLOS (174 of 275 papers, 63.3%), Elsevier (281 of 846 papers, 33.2%), and Springer Nature (132 of 749 papers, 17.62%), possibly signaling the implementation of procedures and technologies to detect problematic articles.
In the case of PLOS (details presented in Table 6), out of the 174 articles retracted for image problems, 150 (86.2%) were published between 2011–2015 and 90.9% (158) of the retraction notes were published between 2017–2020. This suggests that 2017 could be the year when the systematic and retroactive verification of the images in the articles published in 2009–2020 began. Only 10 articles published by PLOS between 2016–2020 (no articles in 2019 and 2020) were retracted because of images, suggesting the effectiveness of the measures implemented by this publisher and that, probably, the articles with questionable images were stopped before publication.
In the case of Elsevier, out of the total of 281 articles retracted for image issues, 246 (87.5%) were retracted between 2015–2020 (2016 is the first year with a significant number of retraction notes, almost half of those published in that year) and the period 2016–2020 was characterized by a slight decrease in problematic articles (74, 26.3%). Elsevier did not have any articles retracted in 2020 because of image problems. These data suggest an increased effectiveness in dealing with image issues. The year 2016 seems to mark the beginning of implementing the procedures and technologies for image analysis at this publisher (Table 7).
In the case of Springer Nature, the focus on images manifested itself a little later (2018–2019), with each year from 2009–2020 containing articles retracted because of the images (Table 8).
Exposure time (ET) for articles with image problems (Table 9) varies between median values of 9/11 months (Dove Medical Press/Spandidos Publications) and 73 months (PLOS). For most other publishers, the median values are between 43 and 63.5 months (average ET value is over 50 months), an exception being Springer Nature, with a median value of 28 months and an average of 35 months.

3.6. Plagiarism and Overlap

The total number of plagiarism and overlap cases for the top 11 publishers is 907 (893 unique articles from which 14 were retracted for both plagiarism and overlap): 509 for plagiarism and 397 for overlap. One of the publishers outperforms the others: with only 22 instances/21 articles representing less than 10% of their total retracted articles number, PLOS seems to have developed procedures that prevent the publication of articles that reuse text or plagiarize other scientific papers. However, post-publication average exposure time (ET) until retraction is the second largest of all publishers: 37.1 months.
Exposure time for plagiarism and overlap cases is presented in Table 10.
The median values of ET are between 10 months (Dove Medical Press) and 42 months (Hindawi) with major publishers relatively well-positioned: 15 months for Wolters Kluwer, 15.5 months for Taylor & Francis, 17 months for Wiley-Blackwell and SAGE, 19 months for Springer Nature, and 20 months for Elsevier. Average values of ET spread between 17.5 months (Dove Medical Press) and 40.1 months (Hindawi).
Median values for the top three retraction reasons are represented in Figure 1.

3.7. S(peed)D(etection)T(ransparency)P(recision) Score

To quantify the activity of the publishers, we calculated the SDTP score composed of six variables which, in our opinion, can provide an image of their involvement in ensuring the quality of the scientific literature. Baseline values of SDTP score components are shown in Table 11.
The SDTP score of top 11 publishers and for the whole group of publishers below the 11th place was calculated for the intervals 2009–2020 (4844 articles, Table 12), 2009–2019 (3931 articles, Table 13), and 2009–2018 (3361 articles, Table 14),
The scores for 2018, 2019, and 2020 show signs of a consistent approach (such as Elsevier and Wiley-Blackwell, SAGE, Spandidos Publications), in which the increase in the number of retracted items is associated with an improvement in the overall score. There are also signs of a decrease in the quality of the retraction notes (such as PLOS, Wolters Kluwer) or the lack of noticeable changes (such as Springer Nature). Some publishers (Taylor & Francis, Hindawi, Dove Medical Press) seem to manage the quality control of their published articles more effectively, recording, however, a decrease of their overall scores between 2018 and 2020. The group represented by the rest of the publishers also marks an increase in the SDTP score. Individual results for the top 11 publishers and the “rest of the publishers” group are displayed in Table 15, Table 16, Table 17, Table 18, Table 19, Table 20, Table 21, Table 22, Table 23, Table 24, Table 25 and Table 26 ( Publications 10 00018 i004 = performance degradation, Publications 10 00018 i003 = improvement of performance, Publications 10 00018 i005 = a difference ofless than/equal to 0.1 points is considered stationary).

4. Discussion

There are many with the opinion that the actual number of articles that should be retracted is much higher than the current number [14,18,41,42,43].
Strengthening editorial procedures may decrease the number of articles retracted after publication. The quality and structure of the peer review process (author blinding, use of digital tools, mandatory interaction between reviewers and authors, community involvement in review, and registered reports) does have a positive role in preventing the publication of problematic articles [44]. However, at the moment, the process of correcting the scientific literature seems to be on an upward trend [11] which leads us to believe that there are still enough articles already published that require further analysis.
The post-publication analysis of scientific articles requires considerable effort from publishers/editors. Their performance when it comes to controlling the quality of the scientific product depends not only on internal (organizational) factors but also on external factors such as “post-publication peer review” or the intervention of authors/institutions [36,45] This dependence of quality control on external factors is also reflected in our data by the low involvement rate of the institutions to which the authors are affiliated: out of a total of 4844 retraction notes, only 465 (9.6%) mention the involvement of an institution. This number may be underestimated, but even so, given that editorial procedures sometimes include communication with authors’ institutions, the lack of effective communication with them makes the work of editors/publishers difficult when it comes to the quick retraction of an article or clarifying the retraction reasons.
Despite the impediments generated by the complexity of editorial procedures, if, as suggested [10], retracting an article may be regarded as a practical way to correct a human error, it would probably be helpful to measure publisher performance when it comes to quality control over scientific articles.

4.1. How Many Journals?

We note in our study a concentration of retracted articles in a relatively small number of publishers, the first 11 having 70.3% of the total retracted articles and 65.9% of the total number of scientific journals in which these are published. The top 45 publishers account for 88% of all articles and 79% of all journals. In our study, the total number of journals that have retracted at least one article is 1767, representing a small share of the 34,148 journals indexed in PubMed [46] (https://www.ncbi.nlm.nih.gov/nlmcatalog/?term=nlmcatalog+pubmed[subset], accessed on 9 January 2022) up to end of 2020.
Our study included only articles related to human health, excluding 775 articles that did not meet the inclusion criteria. Even if we added another 775 journals (assuming an article/journal), the share would remain extremely low, below 10%. On 9 January 2022, using the term “retracted publication (PT)” we get a total of 10,308 records starting with 1951. Using the same logic (one retracted article/journal) would result, in the most optimistic scenario, in another 5464 journals with retracted articles, which, added to the 1767 in our batch, would bring the total to 7231, just over 20% of the total number of journals registered in PubMed. However, we are helped here by a study published in 2021 [25], which analyzed 6936 PubMed-retracted articles (up to August 2019) and identified 2102 different journals, of which 54.4% had only one article retracted.
Our dataset contains, from September 2019–January 2021, 169 journals that retracted at least one article (within the included articles set) and 59 journals with at least one retracted study (within the excluded articles set). Taking into account these figures and including journal overlaps, we can say that the number of journals in PubMed that reported at least one retraction is at most 2330, less than 7% of the total number of active or inactive PubMed journals (the inactive status of a journal is not relevant for our estimation as the evaluation of the number of journal titles took into account the period 1951–2020, the time interval in which all journals had periods of activity).
Mergers, acquisitions, name changes, or discontinuations make it challenging to assess the number/percentage of journals affected by retractions from a publishers’ portfolio. In the case of those with a smaller number of titles (PLOS, Spandidos, or Verducci), we see that post-publication quality control is implemented for almost all of their journals. When we talk about publishers with a medium number (>100) and a large number (>1000) of titles, the size of the quality problems of the published articles seems smaller. We are not sure if the data we found for an average publisher (out of 105 journals in the website portfolio [47] or 331 registered journals in PubMed or 94 active journal titles in Scopus, only 8 withdrew a total of 12 articles) or large (out of over 2000 of titles in PubMed only a little over 300 had articles retracted) reflect an effective quality control before publication or an insufficient quality control after publication.
Are over 90% of journals without a retracted article faultless? This is a question that is quite difficult to answer at this time, but we believe that the opinion that, in reality, there are many more articles that should be retracted [43] is justified.

4.2. Retraction Reasons

Of the 11 publishers analyzed, 9 recorded the highest number of retraction notes in 2020, which seems to reflect a growing interest in correcting the scientific literature.

4.3. Mistakes/Inconsistent Data

Detection of design and execution errors in research [48] may stop the publishing of the article and cause it to be rejected or corrected before publication. However, there are also situations in which the correction is necessary after publication, the errors not being detected in the peer review [49]. The leading retraction cause of scientific articles in our study is represented by mistakes/inconsistent data, with 1553 articles. The top 11 publishers have 1005 articles (64.7% of the total).
Data from the top 11 publishers show a large dispersion and an average ET duration of 15.4 months (Taylor & Francis), 17 months (Spandidos Publications), 19.7 months (Wolters Kluwer), and 41.6 months (PLOS). Median values, however, express better performances for most publishers (Table 5). At this time, we do not know if these values are due to delays in discovering errors, the length of the retraction procedure, or a systematic retroactive check implemented at the publisher or journal level.
The retraction notes mention 229 data fabrication cases, which justifies the need to develop and test the effectiveness of a set of statistical tools capable of detecting anomalies in published datasets (Hartgerink et al. 2019). In our opinion, the number of data fabrication cases may be underestimated: there are 293 cases in which researchers could not provide raw data, 180 cases of a lack of reproducibility, a lack of IRB approval in 134 cases, 47 cases of research misconduct, and 350 cases of fraudulent peer review. All can camouflage situations where data have been fabricated, even if this was not explicitly mentioned in the retraction. The first 11 publishers have 127/229 (55%) of the data fabrication cases.

4.4. Images

The images represent one of the retraction reasons, which has been growing in recent years, the increased interest in the subject highlighting its unexpected magnitude but also the development of tools that facilitate the detection of image manipulation in scientific articles [50,51,52,53,54,55].
The total number of image retractions is 1088. Of these, 805 (74%) belong to the top 11 publishers, and 587 of those (54%) belong to 3 publishers: Elsevier (281), PLOS (174), and Springer Nature (132)(see Table 9). Exposure time for articles retracted because of images is high for all publishers with two exceptions: Dove Medical Press (44 retractions, 19.3 months ET average, median 9 months) and Spandidos Publications (21 retractions, ET average 22.6 months, median 11 months). Springer Nature has a slightly better value (132 retractions, mean ET 35.2 months, median 28 months). The other publishers have values between 50 and 57 months, with medians between 43 and 63 months. PLOS has the highest ET: 70 months (median 73).
We reported in a previous study that 83% of image retractions were issued in the period 2016–2020 and the average value of ET is 49.2 months [11]. Therefore, the values mentioned above are not necessarily surprising, as they are the result of at least two factors: the relatively recent implementation at the editorial/publisher level of image analysis technologies (quite likely started between 2016–2018, see Table 6, Table 7 and Table 8) and the effort of publishers to analyze and retract from the literature articles published even 10–11 years ago. The good result of Dove Medical Press can be explained by the fact that most of the retracted articles are recently published and the retraction notes were given relatively quickly, in the period 2017–2020, being initiated by the publisher/editor in 33/44 cases. Fast turnaround times appear both at Spandidos Publications and at Springer Nature (Table 9). The percentages of initiation of the retraction by the publisher/editor are 50% and 56%, respectively. In these cases, it is possible that there is a workload that is easier to manage, shorter procedures/deadlines, or, simply, a better organization than competitors. The case of PLOS is a special one, the duration of 70 months of ET being 13 months longer than that of the penultimate place (Wolters Kluwer, 57 months); in 114/174 cases, the retraction was initiated at the editorial level. The profile of the published articles, the lack of involvement of the authors (only 25/174 cases involved the authors in one way or another), the slowness of the internal procedures, or the too-long time given to the authors to correct/provide additional information and data could explain this value. The extended correspondence period with the institutions is not supported by our data (for the 30 cases in which the institution involvement was mentioned in the retraction note, the average value of the ET is 72 months).
The values recorded by the other publishers seem to reflect an effort to correct the literature but also difficulties in managing a rapid retraction process.

4.5. Plagiarism and Overlap (Text and Figures, No Images)

Detection of plagiarism/overlap can not only be an obligation of the authors/institutions to which they are affiliated [56] but should also be an essential component of scientific-product quality control at the publisher level. The identification of a plagiarized paper/an overlap case after publication represents, in our opinion, a modest editorial performance, especially in the context in which methods and applications (with all their shortcomings) are more and more widespread [57,58].
The total number of articles retracted for plagiarism/overlap is 1201. In 18 of them, both plagiarism and overlap are registered simultaneously. The top 11 publishers have 893 articles (74.3%). In terms of the number of articles, the first place is occupied by Springer Nature (222), followed by Elsevier (174) and Wolters Kluwer (139).
In our study, we identified only one publisher with a good performance in terms of quantity (PLOS—only 8% of all articles retracted were plagiarism/overlap). SAGE (17.4% of all retracted articles) and Wiley-Blackwell (23.7% of all retracted articles) also have reasonable levels.
Regarding the speed of retraction of plagiarism/overlap cases, the best performance is recorded by Dove Medical Press (average ET 17.5 months, median 10 months). The highest value of ET is recorded in Hindawi (40.1 months on average, median 42 months). Surprisingly, although it has a small number of articles (21), PLOS has the second-highest ET value (37.1 months average, 37 months median). The rest of the publishers have values around the average for the whole lot (average 24 months, median 17 months).
No publisher falls below an average ET of 22 months, with one exception. Their lower performance seems to be influenced, similar to mistakes or image retractions, by the late detection of a small number of articles (skewed distributions with a median lower than average). However, this modest performance shows severe problems at the editorial/publisher level. More than three years to retract a plagiarized/overlap article indicates significant gaps in publishers’ detection and intervention capacities in this situation.

4.6. SDTP Score

The problem of correcting the scientific literature is one that, by the nature of the procedures to be followed, the resources to be allocated, and the complexity of the interactions needed to retract an article, sometimes exceeds the organizational capacity of a scientific journal [59,60,61,62]. We believe that publishers and editors’ early and effective involvement in stopping, discouraging publication, and retracting QRP and QPP can help increase the quality of the scientific literature as a whole. The implementation of an independent evaluation system can help such an approach.
The parameters we use to evaluate the performance of publishers aim at the speed of the internal procedures of the journals in their portfolio (speed), the proactive behavior of the editorial staff (detection rate), and the transparency and precision of the retraction notes.
The interval 2018–2020 is characterized by an increase in the number of retracted articles by 44%, from 3361 to 4844: the increase varies between 164.3% (Dove Medical Press) and 13.8% (SAGE). At the entire dataset level, there is a decrease in performance in ET (an increase from 24.65 months to 28.89 months) and precision (the percentage of identification of responsible authors decreases from 12.8% in 2018 to 10.5% in 2020). The rest of the components are improving (Table 11).
For our dataset of PubMed retractions from 2009–2020, the first two places are held by two publishers of different sizes (Taylor & Francis and Hindawi), followed by another medium-sized publisher (Dove Medical Press). The same publishers appear, in changed order, when analyzing the periods 2009–2018 (Dove Medical Press, Taylor & Francis, Hindawi) and 2009–2019 (Taylor & Francis, Dove Medical Press, Hindawi).
We notice that an increase (between 2018 and 2020) in the volume of retracted articles has different effects at the publisher level: Dove Medical Press goes from first place to third place with a halving of the SDTP score (Table 23), PLOS (Table 19) goes from fourth place (2018) to tenth place (2020), and its score changes from a positive to a negative value. Taylor & Francis goes from second place in 2018 to first place in 2020 with a slight decrease in the score (Table 21), and Spandidos Publication improves its score (Table 25). Operating since the end of 2017 at Taylor & Francis (Taylor & Francis 2017), Dove Medical Press seems to have gained extra speed by almost doubling the number of retracted articles, perhaps because of access to more resources. PLOS’s performance is affected by late retractions, the average ET increasing from 34.6 months in 2018 to 57.7 months in 2020.
The best and worst performances for the six parameters of the SDTP score are in Table 27 and Table 28.
If we look at the individual performances of publishers between 2018 and 2020 (Table 15, Table 16, Table 17, Table 18, Table 19, Table 20, Table 21, Table 22, Table 23, Table 24, Table 25 and Table 26), the picture presented is rather one of declining performance.
Compared to 2018, seven of the 11 publishers decreased their overall score in 2020. Only four publishers improve their performance between 2018 and 2020: Elsevier, Wiley-Blackwell, SAGE, and Spandidos Publications. The publishers below the 11th-place group also show an improvement in the score between 2018 and 2020.
Changes in score components between 2018 and 2020 show interesting developments:
Only two publishers see an increase in four out of six indicators (Wiley-Blackwell and Taylor & Francis);
The other publishers’ group (below 11th place) also has an improvement in four indicators;
Elsevier is the only publisher with three growing indicators;
The other eight publishers register decreases to at least four indicators.
The changes go toward narrowing the gap between 1st place and 11th place: in 2018, the 11th place has −78.1 points, and 1st place has 109.4 points; in 2020, 11th place has −59.2 and 1st place 82.9 points.
The evolution of scores and indicators, often contrasting with the rank for 2020, associated with the narrowing of the gap between 11th and 1st place, leads us to anticipate a series of developments in the future:
A possible improvement in the performance of the group of publishers below 11th place;
A greater homogeneity of results for the first 11 publishers but also for the entire publishing environment;
An improvement of the results for the big players;
A continuation of the decrease of specific indicators (like ET) following the appearance of retractions for old articles in which the information necessary for a complete retraction note can no longer be obtained;
Possible improvement for the involvement of publishers/editors or the editorial errors.
In this context, it is worth discussing whether the time required to retract an article reflects the publishers’ performance or if there is a need for more complex measuring instruments that consider the multiple dimensions of publishing quality control.

4.7. Limitations

A small number of errors (20) was discovered when analyzing PubMed records, concerning mainly the publication date or the retraction note date;
The interpretation of retraction notes may generate classification errors, when several retraction reasons are mentioned;
Modifications/completions made after the study by publishers or editors to the retraction notes on their sites may modify the figures obtained by us;
The score is obtained by simple summation without taking into account the lower/higher weight that can be assigned to a specific component.

4.8. Recommendations for Future Work

The growing number of PubMed-retracted articles indicates the need for follow-up studies to see if new data confirm this trend.
There are variations between publishers of exposure time for articles retracted because of mistakes or data inconsistencies. A study on this subject may provide solutions for preventing the publication of such articles.
Image retractions are relatively new. For this reason, we consider it worthwhile to study in-depth the factors that determine the retraction of an article because of the images. Is it about technological progress? Have editorial policies already been changed, or are they currently being modified? Is there an editorial re-evaluation of the post-publication external peer review? Do the current retraction guidelines cope with image retraction difficulties?
The time required to remove plagiarized or overlap articles is long. Given the age of the anti-plagiarism technologies, it is not very clear what factors determine the further appearance of such articles in the scientific literature. A study that answers this question would probably allow for an improvement in editorial policies.
Studying changes to the SDTP score for publishers discussed in this article could help them implement consistent policies or correct issues that are more difficult to highlight at this time because of a lack of data.

5. Conclusions

“Like a false news report, printed retractions do not automatically erase the error which often pops up in unexpected places in a disconcerting way. There is no instant “delete key” in science” [63].
Retraction of problematic articles from the scientific literature is a natural process that should involve all stakeholders, including publishers.
Only a small number of journals indexed in PubMed are reporting retracted articles. We estimate that by January 2021, less than 7% of all journals in PubMed had retracted at least one article.
However, the correction efforts are obvious for all publishers, regardless of their size. Exposure time (ET), the involvement of publishers and publishers in initiating retractions, the online availability of retracted articles, and specifying the responsibility of authors are aspects that can be improved for all publishers reviewed in this paper.
The clarity of the retraction notes and the editorial errors are two indicators for which the potential for progress is limited only to specific publishers. The COPE guidelines must not only be accepted but must also be implemented. In this context, we believe that introducing a reporting standard for retraction notes will allow, along with the introduction of new technologies and the exchange of information between publishers, and better quality control of the scientific literature, one that can be easily measured, reproduced, and compared. The SDTP score proposed by us is only a small step in this direction.

Author Contributions

Conceptualization, C.T., L.P. and B.T.; Data curation, C.T. and B.T.; Methodology, C.T.; Project administration, C.T.; Software, C.T.; Validation, L.P.; Writing—original draft, C.T.; Writing—review & editing, L.P., B.T. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data that support the findings of this study are available on request from the corresponding author.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. da Silva, J.A.T.; Dobránszki, J. Notices and Policies for Retractions, Expressions of Concern, Errata and Corrigenda: Their Importance, Content, and Context. Sci. Eng. Ethics 2017, 23, 521–554. [Google Scholar] [CrossRef] [PubMed]
  2. Van Noorden, R. Open access: The true cost of science publishing. Nature 2013, 495, 426–429. [Google Scholar] [CrossRef] [PubMed][Green Version]
  3. International Association of Scientific Technical and Medical Publishers: STM Global Brief 2021—Economics & Market Size. 2021. Available online: https://www.stm-assoc.org/2021_10_19_STM_Global_Brief_2021_Economics_and_Market_Size.pdf (accessed on 21 November 2021).
  4. Hagve, Martin: The Money behind Academic Publishing. 2020. Available online: https://tidsskriftet.no/en/2020/08/kronikk/money-behind-academic-publishing (accessed on 27 December 2021).
  5. Larivière, V.; Haustein, S.; Mongeon, P. The Oligopoly of Academic Publishers in the Digital Era. PLoS ONE 2015, 10, e0127502. [Google Scholar] [CrossRef]
  6. Brainard, J. Rethinking retractions. Science 2018, 362, 390–393. [Google Scholar] [CrossRef]
  7. Vuong, Q.-H.; La, V.-P.; Ho, M.-T.; Vuong, T.-T. Characteristics of retracted articles based on retraction data from online sources through February 2019. Sci. Ed. 2020, 7, 34–44. [Google Scholar] [CrossRef][Green Version]
  8. Marcus, A.; Oransky, I. What Studies of Retractions Tell Us. J. Microbiol. Biol. Educ. 2014, 15, 151–154. [Google Scholar] [CrossRef] [PubMed][Green Version]
  9. Friedman, H.L.; MacDonald, D.A.; Coyne, J.C. Working with psychology journal editors to correct problems in the scientific literature. Can. Psychol. Can. 2020, 61, 342–348. [Google Scholar] [CrossRef]
  10. Vuong, Q.-H. Reform retractions to make them more transparent. Nature 2020, 582, 149. [Google Scholar] [CrossRef]
  11. Toma, C.; Padureanu, L. An Exploratory Analysis of 4844 Withdrawn Articles and Their Retraction Notes. Open J. Soc. Sci. 2021, 09, 415–447. [Google Scholar] [CrossRef]
  12. Pantziarka, P.; Meheus, L. Journal retractions in oncology: A bibliometric study. Future Oncol. 2019, 15, 3597–3608. [Google Scholar] [CrossRef] [PubMed]
  13. Rapani, A.; Lombardi, T.; Berton, F.; Del Lupo, V.; Di Lenarda, R.; Stacchi, C. Retracted publications and their citation in dental literature: A systematic review. Clin. Exp. Dent. Res. 2020, 6, 383–390. [Google Scholar] [CrossRef]
  14. Steen, R.G. Retractions in the scientific literature: Do authors deliberately commit research fraud? J. Med. Ethics 2011, 37, 113–117. [Google Scholar] [CrossRef][Green Version]
  15. Fang, F.C.; Steen, R.G.; Casadevall, A. Misconduct accounts for the majority of retracted scientific publications. Proc. Natl. Acad. Sci. USA 2012, 109, 17028–17033. [Google Scholar] [CrossRef][Green Version]
  16. Decullier, E.; Huot, L.; Samson, G.; Maisonneuve, H. Visibility of retractions: A cross-sectional one-year study. BMC Res. Notes 2013, 6, 238. [Google Scholar] [CrossRef][Green Version]
  17. Wager, E.; Williams, P. Why and how do journals retract articles? An analysis of Medline retractions 1988–2008. J. Med. Ethics 2011, 37, 567–570. [Google Scholar] [CrossRef][Green Version]
  18. Nath, S.B.; Marcus, S.C.; Druss, B.G. Retractions in the research literature: Misconduct or mistakes? Med. J. Aust. 2006, 185, 152–154. [Google Scholar] [CrossRef]
  19. Redman, B.K.; Yarandi, H.N.; Merz, J.F. Empirical developments in retraction. J. Med. Ethics 2008, 34, 807–809. [Google Scholar] [CrossRef][Green Version]
  20. Samp, J.C.; Schumock, G.T.; Pickard, A.S. Retracted Publications in the Drug Literature. Pharmacother. J. Hum. Pharmacol. Drug Ther. 2012, 32, 586–595. [Google Scholar] [CrossRef]
  21. Steen, R.G.; Casadevall, A.; Fang, F.C. Why Has the Number of Scientific Retractions Increased? PLoS ONE 2013, 8, e68397. [Google Scholar] [CrossRef]
  22. Madlock-Brown, C.R.; Eichmann, D. The (lack of) Impact of Retraction on Citation Networks. Sci. Eng. Ethics 2015, 21, 127–137. [Google Scholar] [CrossRef]
  23. Mongeon, P.; Larivière, V. Costly collaborations: The impact of scientific fraud on co-authors’ careers. J. Assoc. Inf. Sci. Technol. 2016, 67, 535–542. [Google Scholar] [CrossRef][Green Version]
  24. Rosenkrantz, A.B. Retracted Publications within Radiology Journals. Am. J. Roentgenol. 2016, 206, 231–235. [Google Scholar] [CrossRef] [PubMed]
  25. Bhatt, B. A multi-perspective analysis of retractions in life sciences. Scientometrics 2021, 126, 4039–4054. [Google Scholar] [CrossRef]
  26. Williams, P.; Wager, E. Exploring Why and How Journal Editors Retract Articles: Findings from a Qualitative Study. Sci. Eng. Ethics 2011, 19, 1–11. [Google Scholar] [CrossRef]
  27. Bilbrey, E.; O’Dell, N.; Creamer, J. A Novel Rubric for Rating the Quality of Retraction Notices. Publications 2014, 2, 14–26. [Google Scholar] [CrossRef][Green Version]
  28. Cox, A.; Craig, R.; Tourish, D. Retraction statements and research malpractice in economics. Res. Policy 2018, 47, 924–935. [Google Scholar] [CrossRef][Green Version]
  29. Coudert, F.-X. Correcting the Scientific Record: Retraction Practices in Chemistry and Materials Science. Chem. Mater. 2019, 31, 3593–3598. [Google Scholar] [CrossRef][Green Version]
  30. da Silva, J.A.T. Silent or Stealth Retractions, the Dangerous Voices of the Unknown, Deleted Literature. Publ. Res. Q. 2016, 32, 44–53. [Google Scholar] [CrossRef]
  31. Shelomi, M. Editorial Misconduct—Definition, Cases, and Causes. Publications 2014, 2, 51–60. [Google Scholar] [CrossRef][Green Version]
  32. Grieneisen, M.L.; Zhang, M. A Comprehensive Survey of Retracted Articles from the Scholarly Literature. PLoS ONE 2012, 7, e44118. [Google Scholar] [CrossRef]
  33. Bar-Ilan, J.; Halevi, G. Temporal characteristics of retracted articles. Scientometrics 2018, 116, 1771–1783. [Google Scholar] [CrossRef]
  34. Resnik, D.B.; Wager, E.; Kissling, G.E. Retraction policies of top scientific journals ranked by impact factor. J. Med. Libr. Assoc. JMLA 2015, 103, 136–139. [Google Scholar] [CrossRef][Green Version]
  35. Fanelli, D. Why Growing Retractions Are (Mostly) a Good Sign. PLoS Med. 2013, 10, e1001563. [Google Scholar] [CrossRef][Green Version]
  36. Da Silva, J.A.T.; Dobránszki, J. Problems with Traditional Science Publishing and Finding a Wider Niche for Post-Publication Peer Review. Account. Res. 2015, 22, 22–40. [Google Scholar] [CrossRef]
  37. Ali, P.A.; Watson, R. Peer review and the publication process. Nurs. Open 2016, 3, 193–202. [Google Scholar] [CrossRef]
  38. COPE Council. COPE Guidelines: Retraction Guidelines. 2019. Available online: https://publicationethics.org/files/retraction-guidelines-cope.pdf (accessed on 1 December 2021).
  39. Kleinert, S. COPE’s retraction guidelines. Lancet 2009, 374, 1876–1877. [Google Scholar] [CrossRef]
  40. Tripathi, M.; Sonkar, S.K.; Kumar, S. A cross sectional study of retraction notices of scholarly journals of science. DESIDOC J. Libr. Inf. Technol. 2019, 39, 74–81. [Google Scholar] [CrossRef]
  41. Fanelli, D. How Many Scientists Fabricate and Falsify Research? A Systematic Review and Meta-Analysis of Survey Data. PLoS ONE 2009, 4, e5738. [Google Scholar] [CrossRef][Green Version]
  42. Poulton, A. Mistakes and misconduct in the research literature: Retractions just the tip of the iceberg. Med. J. Aust. 2007, 186, 323–324. [Google Scholar] [CrossRef]
  43. Oransky, I.; Fremes, S.E.; Kurlansky, P.; Gaudino, M. Retractions in medicine: The tip of the iceberg. Eur. Heart J. 2021, 42, 4205–4206. [Google Scholar] [CrossRef]
  44. Horbach, S.P.J.M.; Halffman, W. The ability of different peer review procedures to flag problematic publications. Scientometrics 2019, 118, 339–373. [Google Scholar] [CrossRef] [PubMed][Green Version]
  45. Knoepfler, P. Reviewing post-publication peer review. Trends Genet. 2015, 31, 221–223. [Google Scholar] [CrossRef] [PubMed][Green Version]
  46. National Library of Medicine. How do I find a list of journals in PubMed? 2022. Available online: https://support.nlm.nih.gov/knowledgebase/article/KA-04961/en-us (accessed on 9 January 2022).
  47. Karger Publishers: Journals—Current Program. 2022. Available online: https://www.karger.com/Journal/IndexListAZ (accessed on 9 January 2022).
  48. Makin, T.R.; De Xivry, J.-J.O. Ten common statistical mistakes to watch out for when writing or reviewing a manuscript. eLife 2019, 8, 1–13. [Google Scholar] [CrossRef]
  49. Schroter, S.; Black, N.; Evans, S.; Godlee, F.; Osorio, L.; Smith, R. What errors do peer reviewers detect, and does training improve their ability to detect them? J. R. Soc. Med. 2008, 101, 507–514. [Google Scholar] [CrossRef] [PubMed][Green Version]
  50. Bik, E.M.; Casadevall, A.; Fang, F.C. The Prevalence of Inappropriate Image Duplication in Biomedical Research Publications. mBio 2016, 7, e00809–e00816. [Google Scholar] [CrossRef] [PubMed][Green Version]
  51. Koppers, L.; Wormer, H.; Ickstadt, K. Towards a Systematic Screening Tool for Quality Assurance and Semiautomatic Fraud Detection for Images in the Life Sciences. Sci. Eng. Ethics 2016, 23, 1113–1128. [Google Scholar] [CrossRef][Green Version]
  52. Bucci, E.M. Automatic detection of image manipulations in the biomedical literature. Cell Death Dis. 2018, 9, 400. [Google Scholar] [CrossRef][Green Version]
  53. Bik, E.M.; Fang, F.C.; Kullas, A.L.; Davis, R.J.; Casadevall, A. Analysis and Correction of Inappropriate Image Duplication: The Molecular and Cellular Biology Experience. Mol. Cell. Biol. 2018, 38, e00309-18. [Google Scholar] [CrossRef][Green Version]
  54. Christopher, J. Systematic fabrication of scientific images revealed. FEBS Lett. 2018, 592, 3027–3029. [Google Scholar] [CrossRef][Green Version]
  55. Sabir, E.; Nandi, S.; AbdAlmageed, W.; Natarajan, P. BioFors: A Large Biomedical Image Forensics Dataset. arXiv 2021, arXiv:2108.12961. [Google Scholar]
  56. Zimba, O.; Gasparyan, A. Plagiarism detection and prevention: A primer for researchers. Reumatologia 2021, 59, 132–137. [Google Scholar] [CrossRef]
  57. Foltýnek, T.; Meuschke, N.; Gipp, B. Academic Plagiarism Detection. ACM Comput. Surv. 2020, 52, 1–42. [Google Scholar] [CrossRef][Green Version]
  58. Sagar, K.; Sharvari, G.; Dhiraj, A. Analysis of Plagiarism Detection Tools and Methods. SSRN J. 2021. [Google Scholar] [CrossRef]
  59. Wlodawer, A.; Dauter, Z.; Porebski, P.; Minor, W.; Stanfield, R.; Jaskolski, M.; Pozharski, E.; Weichenberger, C.X.; Rupp, B. Detect, correct, retract: How to manage incorrect structural models. FEBS J. 2018, 285, 444–466. [Google Scholar] [CrossRef][Green Version]
  60. Marks, D.F. The Hans Eysenck affair: Time to correct the scientific record. J. Health Psychol. 2019, 24, 409–420. [Google Scholar] [CrossRef][Green Version]
  61. Pelosi, A.J. Personality and fatal diseases: Revisiting a scientific scandal. J. Health Psychol. 2019, 24, 421–439. [Google Scholar] [CrossRef][Green Version]
  62. Cooper, A.N.; Dwyer, J.E. Maintaining the integrity of the scientific record: Corrections and best practices at The Lancet group. Eur. Sci. Ed. 2021, 47, e62065. [Google Scholar] [CrossRef]
  63. Kiang, N.Y.-S. How are scientific corrections made? Sci. Eng. Ethics 1995, 1, 347–356. [Google Scholar] [CrossRef]
Figure 1. Median values (in months) of the top three retraction reasons for the top 11 publishers.
Figure 1. Median values (in months) of the top three retraction reasons for the top 11 publishers.
Publications 10 00018 g001
Table 1. SDTP score components.
Table 1. SDTP score components.
Component and Rationale for Inclusion
1Speed Speed of retraction, measured in months [38]
2Detection rate Percentage of total retracted articles for which the retraction was initiated by/involved the editor, the editorial board, the publisher, or institutions, without authors [38]
3TransparencyaThe percentage of papers available online (“Retracted articles should not be removed from electronic archives or printed copies of the journal.” [39]).
bThe retraction note contains the reasons [38]
4PrecisionaIndividual responsibility of authors was clearly stated [38]. Articles with more than one author and no editorial errors were analyzed.
bPercentage of retractions for reasons other than editorial errors (not attributable to editorial errors).
Table 2. Publishers with more than 10 retracted articles.
Table 2. Publishers with more than 10 retracted articles.
PublisherArticlesJournals/Journal
1ELSEVIER8463092.74
2SPRINGER NATURE7493052.46
3WOLTERS KLUWER3461422.45
4WILEY-BLACKWELL3251711.9
5PLOS275739.28
6SAGE248475.28
7TAYLOR AND FRANCIS223952.35
8HINDAWI140522.69
9DOVE MEDICAL PRESS111234.83
10E-CENTURY PUBLISHING71710.14
11SPANDIDOS PUBLICATIONS71710.14
12OXFORD UNIVERSITY PRESS65391.67
13MARY ANN LIEBERT63262.42
14VERDUCI EDITORE56156
15AMERICAN CHEMICAL SOCIETY53173.12
16AMERICAN ASSOCIATION FOR CANCER RESEARCH4968.17
17AMERICAN SOCIETY FOR MICROBIOLOGY44104.4
18FRONTIERS MEDIA44182.44
19MULTIDISCIPLINARY DIGITAL PUBLISHING INSTITUTE42172.47
20NATIONAL ACADEMY OF SCIENCES36136
21PORTLAND PRESS32310.67
22BMJ PUBLISHING GROUP29171.7
23AMERICAN PHYSIOLOGICAL SOCIETY2683.25
24FUNDACAO DE PESQUISAS CIENTIFICAS DE RIBEIRAO PRETO20120
25AMERICAN MEDICAL ASSOCIATION19121.58
26SOCIETY FOR NEUROSCIENCE18118
27AMERICAN ASSOCIATION FOR THE ADVANCEMENT OF SCIENCE1735.66
28AMERICAN SOCIETY OF HEMATOLOGY17117
29AMERICAN SOCIETY FOR PHARMACOLOGY AND EXPERIMENTAL THERAPEUTICS1635.33
30BENTHAM1682
31CELL PHYSIOL BIOCHEM PRESS16116
32IMPACT JOURNALS1628
33MEDICAL SCIENCE INTERNATIONAL PUBLISHING16116
34AMERICAN DIABETES ASSOCIATION1527.5
35AMERICAN SOCIETY OF CLINICAL INVESTIGATION15115
36CUREUS, INC.14114
37THE COMPANY OF BIOLOGISTS LTD.1326.5
38KOWSAR PUBLISHING COMPANY1234
39KARGER1281.5
40AME PUBLISHING COMPANY1142.75
41AMERICAN THORACIC SOCIETY1133.66
42ASSOCIACAO BRASILEIRA DE DIVULGACAO CIENTIFICA11111
43FUTURE MEDICINE LTD1191.22
44INTERNATIONAL INSTITUTE OF ANTICANCER RESEARCH11111
45IOS Press1052
Table 3. Retracted articles and retraction notes by year for top 11 publishers.
Table 3. Retracted articles and retraction notes by year for top 11 publishers.
PublisherN2009201020112012201320142015201620172018201920202021
ELSEVIER8466887821118277706370684721
Retraction Notes 6153842705378100841161121293
SPRINGER NATURE7514864466874116876949475132
Retraction NotesRN818374552431239461587312712
WOLTERS KLUWER346333433343240273327192014
Retraction NotesRN412222621413240283044460
WILEY-BLACKWELL323444026393024301715181921
Retraction NotesRN1215212423262237244118564
PLOS27581128446456251111683
Retraction NotesRN120597715184392733
SAGE2485671225606728711119
Retraction NotesRN00872124021119911190
TAYLOR AND FRANCIS22319133221181410158154018
Retraction NotesRN639361118122117521595
HINDAWI14012915223725155630
Retraction NotesRN00006171421171420301
DOVE MEDICAL PRESS11124101137988121720
Retraction NotesRN00052431111614541
E-CENTURY PUBLISHING710000283466384
Retraction NotesRN000010924659170
SPANDIDOS PUBLICATIONS71001556718151103
Retraction NotesRN00021451171310180
RN for top publishers 3765135192198225345395392340424628
Total number of RN 5497195282300337481570544501570878
The top 11 publishers hold 70.3% (3405/4844) of retracted articles and 66% (1165/1767) of journals. The annual distribution of retraction notes varies between 66% of the total (2013) and 74% (2019).
Table 4. Retraction reasons for top 11 publishers (3405 retracted papers, 70.3%).
Table 4. Retraction reasons for top 11 publishers (3405 retracted papers, 70.3%).
Publisher MISTAKESIMAGESPLAGIARISMOVERLAPFRAUDETHICSAUTHORSHIPUNCLEAREDITORPROPERTYOTHER
ELSEVIER84632.98%33.2%9.8%10.87%2.6%6.38%4.73%5.79%4.25%1.3%1.3%
N279281839222544049361111
SPRINGER NATURE74929.64%17.62%18.42%12.02%17.09%8.94%5.74%0.5%2.4%4.67%1.07%
N222132138901286743418358
WOLTERS KLUWER34628.61%6.94%24.85%15.6%1.73%8.67%6.36%6.65%6.94%1.73%1.73%
N9924865463022232466
WILEY-BLACKWELL32536%18.8%9.2%14.5%1.5%9.5%7.1%2.2%6.5%2.2%1.8%
N1176130475312372176
PLOS27529.5%63.3%4%4%1.1%16.4%5.1%--0.7%1.1%0.4%
N81174111134514--231
SAGE24816.9%0.8%9.3%8.1%62.9%4.8%7.3%1.2%2.4%1.2%--
N42223201561218363--
TAYLOR AND FRANCIS22326.9%11.7%13%17%13%6.7%10.3%1.8%6.7%2.7%1.3%
N6026293829152341563
HINDAWI14022.1%27.1%33.6%16.4%0.7%9.3%10%0.7%--2.1%--
N31384723113141--3--
DOVE MEDICAL PRESS11127.9%39.6%14.4%16.2%2.7%14.4%4.5%----1.8%--
N314416183165----2--
E-CENTURY PUBLISHING7126.8%2.8%36.6%1.4%8.5%2.8%4.2%--25.4%1.4%--
N192261623--181--
SPANDIDOS PUBLICATIONS7133.8%29.6%28.2%5.6%1.4%7%11.3%----1.4%1.4%
N2421204158----11
Table 5. Mistakes/Inconsistent data per publisher.
Table 5. Mistakes/Inconsistent data per publisher.
ArticlesET (95% CI)MedianRangeIQR
Elsevier
Springer Nature
Wolters Kluwer
Wiley-Blackwell
PLOS
SAGE
Taylor & Francis
Hindawi
Dove Medical Press
E-Century Publishing
Spandidos Publications
27929.03 (25.87–32.19)2013333
22227.76 (24.04–31.48)1714029
9919.69 (15.26–24.12)119916
11726.19 (22.45–29.93)239826
8141.58 (34.92–48.24)4311757
4221.36 (16.05–26.67)176127
6015.43 (11.34–19.53)127216
3129.74 (20.24–39.25)168141
3131.90 (16.62–47.19)913946
1922.58 (15.48–29.68)264826
2417.17 (9.68–24.65)97020
Table 6. Evolution of the PLOS image retractions.
Table 6. Evolution of the PLOS image retractions.
Retraction Note YearTotal
2010201320142015201620172018201920202021
Publication Year200900000002204
2010100010224010
2011010021755021
201202000221110128
201300120381121248
201400002471712042
2015000102134011
201600000003104
201700000011103
201800000011103
Total13135122956613174
Table 7. Elsevier image retractions.
Table 7. Elsevier image retractions.
Retraction Note YearTotal
201020112012201320142015201620172018201920202021
Publication Year200911211143322021
2010112233163112035
201100033474464035
2012002311107335035
201300023526227029
201400001325544024
201500000144685028
2016000000254105026
201700000002673119
201800000000268117
2019000000000111012
Total22611121847393650562281
Table 8. Springer Nature image retractions.
Table 8. Springer Nature image retractions.
Retraction Note YearTotal
200920112012201320142015201620172018201920202021
Publication Year20091311001000018
20100221101100109
20110212001120009
201200122011021010
20130001021001218
201400002111414014
201500000023322012
201600000013291117
2017000000002510118
20180000000005207
2019000000000213116
20200000000000404
Total1757539101327405132
Table 9. Image retractions by publisher.
Table 9. Image retractions by publisher.
ArticlesET (95% CI)MedianRangeIQR
Elsevier28150.40 (46.84–53.95)4813350
Springer Nature13235.20 (30.54–39.86)2814731
Wolters Kluwer2457.04 (40.12–73.97)63.512779
Wiley-Blackwell6155.11 (46.39–63.84)5112760
PLOS17470.39 (66.40–74.37)7312833
SAGE257.5
Taylor & Francis2653.92 (40.74–67.11)4312646
Hindawi3850.76 (43.42–58.11)48.510029
Dove Medical Press4419.36 (12.18–26.55)910317
E-Century Publishing222.5
Spandidos Publications2122.57 (11.71–33.44)118722
Table 10. Exposure time (ET) for plagiarism and overlap.
Table 10. Exposure time (ET) for plagiarism and overlap.
PlagiarimOverlapCasesArticlesET (95% CI)MedianRangeIQR
Elsevier839217517426.7 (23.43–29.97)2011426
Springer Nature1389022822223.34 (20.92–25.75)1910024
Wolters Kluwer865414013924.08 (20.04–28.12)1510535
Wiley-Blackwell3047777725.17 (19.44–30.9)1712426
PLOS1111222137.1 (23.58–50.62)378155
SAGE2320434122.83 (17.52–28.14)176516
Taylor & Francis2938676623.14 (17.58–28.69)15.59628
Hindawi4723706940.14 (33.02–47.27)4215150
Dove Medical Press1618343317.55 (10.7–24.4)108423
E-Century Publishing261272724.56 (17.04–32.08)226334
Spandidos Publications204242430.13 (20.89–39.36)33.58240
Table 11. Changes of main components of SDTP score for 2018–2020.
Table 11. Changes of main components of SDTP score for 2018–2020.
201820192020
Speed (ET in months)24.6526.7428.89 Publications 10 00018 i001
Detection54.3%55.5%54.7% Publications 10 00018 i002
Transparency—online article68.5%70.7%72.1% Publications 10 00018 i002
Transparency—clear retraction note93.8%94.3%94.9% Publications 10 00018 i002
Precision—authors role12.8%11.9%10.5% Publications 10 00018 i001
Precision—no editorial errors95.7%95.8%96.3% Publications 10 00018 i002
Publications 10 00018 i001 = reduction Publications 10 00018 i002 = improvement
Table 12. SDTP scores and rank for the period 2009–2020 (n = 4844). * Calculated for 4427 articles with more than one author and no editorial errors as retraction reasons.
Table 12. SDTP scores and rank for the period 2009–2020 (n = 4844). * Calculated for 4427 articles with more than one author and no editorial errors as retraction reasons.
ScoreRankSpeedDetection
(REP)
Transparency Precision
ET Online articleClear retraction noteAuthors role clear *No Editor errors
Baseline4844 28.8954.7%72.1%94.9%10.5%96.3%
Elsevier846 33.3945.5%88.2%94.2%11%95.7%
Points−6.57−15.6−16.822.3−0.74.9−0.6
Springer749 25.0257.5%62.5%99.5%5.8%97.6%
Points−32.8913.4 5.2 −13.3 4.8 −44.2 1.3
Wolters346 23.466.2%70.8%93.4%6.293.1%
Points−7.481921−1.8−1.6−40.7−3.3
Wiley325 28.739.4%43.4%97.8%11.3%93.5%
Points−59.2110.7−28−39.83.17.7−2.9
PLOS275 57.7680.4%100%100%5.9%99.3%
Points−49.810−99.946.938.75.4−443.1
SAGE248 24.9586.3%22.2%98.8%12.4597.6%
Points26513.657.7−69.24.118.51.3
Taylor and Francis223 2365.9%39%98.2%19.893.3%
Points82.9120.420.5−46.93.588.5−3.1
Hindawi140 37.1872.9%99.3%99.3%10.4100%
Points50.22−28.733.237.74.6−0.53.9
Dove Medical Press111 28.4374.8%100%100%6.5%100%
Points48.131.636.738.75.4−38.23.9
E−Century Publishing71 18.1878.9%100%100%0%74.6%
Points2.9637.144.238.75.4−100−22.5
Spandidos Publications71 22.9443.7%98.6%100%9.8%100%
Points40.4420.6−20.136.75.4−6.13.9
Rest of publishers1439 25.7448.6%75.4%89.2%13.5%97.2%
27.9(5)10.9−11.24.6−628.70.9
Table 13. SDTP scores and rank for the period 2009–2019 (n = 3931). * Calculated for 3569 articles with more than one author and no editorial error as retraction reason.
Table 13. SDTP scores and rank for the period 2009–2019 (n = 3931). * Calculated for 3569 articles with more than one author and no editorial error as retraction reason.
ScoreRankSpeedDetection
(REP)
Transparency Precision
ET Online articleClear retraction noteAuthors role clear *No Editor errors
Baseline3931 26.7455.5%70.7%94.3%11.9%95.8%
Elsevier714 31.5846.1%88.7%93.4%11.8%95.7%
Points−11.87−18.1−16.925.5−0.9−1.3−0.1
Springer611 22.6658.6%60.6%99.5%6.7%97.2%
Points−30.3915.25.7−14.35.5−43.91.5
Wolters300 21.5467%70.3%92.7%7.1%93.7%
Points−4.5619.420.8−0.6−1.7−40.2−2.2
Wiley264 28.4239%40.9%97.312.9%92%
Points−70.911−6.3−29.6−42.13.27.9−4
PLOS199 50.0274.4%100%100%8.2%99%
Points−3410−87.134.141.46−31.73.3
SAGE229 25.4488.2%21.4%98.7%10.5%98.7%
Points−10.184.959.1−69.74.7−12.13
Taylor and Francis159 21.6762.3%27%97.5%28.1%91.2%
Points103.711912.3−61.83.4135.6−4.8
Hindawi109 28.8268.8%99.1%100%12.5%100%
Points71.53−7.824.140.264.64.4
Dove Medical Press56 18.1371.4%100%100%9.2%100%
Points90.3232.228.841.46−22.54.4
E−Century Publishing54 13.6785.2%100%100%066.7%
Points19.5548.953.641.46−100−30.4
Spandidos Publications53 16.5735.8%98.1%100%11.3%100%
Points46.5438−35.438.76−5.24.4
Rest of publishers1183 25.1747.3%75.7%88.3%15.3%96.7%
20.9(5)5.9−14.67.1−6.4280.9
Table 14. SDTP scores and rank for the period 2009–2018 (n = 3361). * Calculated for 3037 articles with more than one author and no editorial errors as retraction reasons.
Table 14. SDTP scores and rank for the period 2009–2018 (n = 3361). * Calculated for 3037 articles with more than one author and no editorial errors as retraction reasons.
ScoreRankSpeedDetection
(REP)
Transparency Precision
ET Online articleClear retraction noteAuthors role clear *No Editor errors
Baseline3361 24.6554.3%68.5%93.8%12.8%95.7%
Elsevier602 29.9945.5%88.5%92.5%12.6%95.3%
Points−8.38−21.7−16.229.2−1.4−2.2−0.4
Springer538 21.8859.1%58.2%99.4%7.2%97.8%
Points−30.71011.28.9−156−442.2
Wolters256 17.3865.6%67.2%92.2%8.4%93.4%
Points10.2629.520.9−1.9−1.7−34.6−2
Wiley246 26.8739%39.4%97.2%12.9%92.3%
Points−78.111−9−28.1−42.53.60.5−3.6
PLOS107 34.656.1%100%100%15.1%99.1%
Points36.54−40.43.3466.617.43.6
SAGE218 25.8989.4%21.6%98.6%10%99.5%
Points−21.99−564.7−68.55.1−22.24
Taylor and Francis138 22.4259.4%25.4%97.1%31.6%91.3%
Points100.4299.4−62.93.5146−4.6
Hindawi89 24.1867.4%98.9%100%14.1%100%
Points91.431.924.244.46.69.84.5
Dove Medical Press42 18.8671.4%100%100%12.5%100%
Points109.4123.531.5466.6−2.74.5
E−Century Publishing45 15.2784.4%100%100%0%60%
Points8.9738.155.5466.6−100−37.3
Spandidos Publications43 14.2634.9%97.7%100%9.3%100%
Points31.8542.1−35.842.66.6−27.64.5
Rest of publishers1037 24.3847.3%75.4%87.8%15.6%96.5%
14.2(5)1.1−12.810.1−6.421.40.8
Table 15. Elsevier 2018–2020 change of SDTP score.
Table 15. Elsevier 2018–2020 change of SDTP score.
Elsevier20182020
Articles602846+40.5%
Speed−21.7−15.6 Publications 10 00018 i003
Detection rate−16.2−16.9 Publications 10 00018 i004
Transparency—online article29.222.3 Publications 10 00018 i004
Transparency—clear retraction note−1.4−0.7 Publications 10 00018 i003
Precision—authors role−2.24.9 Publications 10 00018 i003
Precision—no editorial errors−0.4−0.6 Publications 10 00018 i004
General−8.3−6.5 Publications 10 00018 i003
Rank87
Table 16. Springer Nature 2018–2020 change of SDTP score.
Table 16. Springer Nature 2018–2020 change of SDTP score.
Springer Nature20182020
Articles538749+39.2%
Speed11.213.4 Publications 10 00018 i003
Detection rate8.95.2 Publications 10 00018 i004
Transparency—online article−15−13.3 Publications 10 00018 i003
Transparency—clear retraction note64.8 Publications 10 00018 i004
Precision—authors role−44−44.2 Publications 10 00018 i004
Precision—no editorial errors2.21.3 Publications 10 00018 i004
General−30.7−32.8 Publications 10 00018 i004
Rank109
Table 17. Wolters Kluwer 2018–2020 change of SDTP score.
Table 17. Wolters Kluwer 2018–2020 change of SDTP score.
Wolters Kluwer20182020
Articles256346+35.1%
Speed29.519 Publications 10 00018 i003
Detection rate20.921 Publications 10 00018 i005
Transparency—online article−1.9−1.8 Publications 10 00018 i005
Transparency—clear retraction note−1.7−1.6 Publications 10 00018 i005
Precision—authors role−34.6−40.7 Publications 10 00018 i004
Precision—no editorial errors−2−3.3 Publications 10 00018 i004
General10.2−7.4 Publications 10 00018 i004
Rank68
Table 18. Wiley-Blackwell 2018–2020 change of SDTP score.
Table 18. Wiley-Blackwell 2018–2020 change of SDTP score.
Wiley-Blackwell20182020
Articles246325+32.1%
Speed–90.7 Publications 10 00018 i003
Detection rate–28.1–28 Publications 10 00018 i005
Transparency—online article–42.5–39.8 Publications 10 00018 i003
Transparency—clear retraction note3.63.1 Publications 10 00018 i004
Precision—authors role0.57.7 Publications 10 00018 i003
Precision—no editorial errors–3.6–2.9 Publications 10 00018 i003
General–78.1–59.2 Publications 10 00018 i003
Rank1111
Table 19. PLOS 2018–2020 change of SDTP score.
Table 19. PLOS 2018–2020 change of SDTP score.
PLOS20182020
Articles107275+157%
Speed−40.4−99.9 Publications 10 00018 i004
Detection rate3.346.9 Publications 10 00018 i003
Transparency—online article4638.7 Publications 10 00018 i004
Transparency—clear retraction note6.65.4 Publications 10 00018 i004
Precision—authors role17.4−44 Publications 10 00018 i004
Precision—no editorial errors3.63.1 Publications 10 00018 i004
General36.5−49.8 Publications 10 00018 i004
Rank410
Table 20. SAGE 2018–2020 change of SDTP score.
Table 20. SAGE 2018–2020 change of SDTP score.
SAGE20182020
Articles218248+13.8%
Speed−513.6 Publications 10 00018 i003
Detection rate64.757.7 Publications 10 00018 i004
Transparency—online article−68.5−69.2 Publications 10 00018 i004
Transparency—clear retraction note5.14.1 Publications 10 00018 i004
Precision—authors role−23.2−25.4 Publications 10 00018 i004
Precision—no editorial errors−22.218.5 Publications 10 00018 i003
General−21.926 Publications 10 00018 i003
Rank95
Table 21. Taylor & Francis 2018–2020 change of SDTP score.
Table 21. Taylor & Francis 2018–2020 change of SDTP score.
Taylor&Francis20182020
Articles138223+61.6%
Speed920.4 Publications 10 00018 i003
Detection rate9.420.5 Publications 10 00018 i003
Transparency—online article–62.9–46.9 Publications 10 00018 i003
Transparency—clear retraction note3.53.5 Publications 10 00018 i005
Precision—authors role14688.5 Publications 10 00018 i004
Precision—no editorial errors–4.6–3.1 Publications 10 00018 i003
General100.482.9 Publications 10 00018 i004
Rank21
Table 22. Hindawi 2018–2020 change of SDTP score.
Table 22. Hindawi 2018–2020 change of SDTP score.
Hindawi20182020
Articles89140+57.3%
Speed1.9−28.7 Publications 10 00018 i004
Detection rate24.233.2 Publications 10 00018 i003
Transparency—online article44.437.7 Publications 10 00018 i004
Transparency—clear retraction note6.64.6 Publications 10 00018 i004
Precision—authors role9.8−0.5 Publications 10 00018 i004
Precision—no editorial errors4.53.9 Publications 10 00018 i004
General91.450.2 Publications 10 00018 i004
Rank32
Table 23. Dove Medical Press 2018–2020 change of SDTP score.
Table 23. Dove Medical Press 2018–2020 change of SDTP score.
Dove Medical Press20182020
Articles42111+164.3%
Speed23.51.6 Publications 10 00018 i004
Detection rate31.536.7 Publications 10 00018 i003
Transparency—online article4638.7 Publications 10 00018 i004
Transparency—clear retraction note6.65.4 Publications 10 00018 i004
Precision—authors role−2.7−38.2 Publications 10 00018 i004
Precision—no editorial errors4.53.9 Publications 10 00018 i004
General109.448.1 Publications 10 00018 i004
Rank12
Table 24. E-Century Publishing 2018–2020 change of SDTP score.
Table 24. E-Century Publishing 2018–2020 change of SDTP score.
E-Century Publishing20182020
Articles4571+57.8%
Speed38.137.1 Publications 10 00018 i004
Detection rate55.544.2 Publications 10 00018 i004
Transparency—online article4638.7 Publications 10 00018 i004
Transparency—clear retraction note6.65.4 Publications 10 00018 i004
Precision—authors role−100−100 Publications 10 00018 i005
Precision—no editorial errors−37.3−22.5 Publications 10 00018 i003
General8.92.9 Publications 10 00018 i004
Rank76
Table 25. Spandidos Publications 2018–2020 change of SDTP score.
Table 25. Spandidos Publications 2018–2020 change of SDTP score.
Spandidos Publications20182020
Articles4371+73.2%
Speed42.120.6 Publications 10 00018 i004
Detection rate−35.8−20.1 Publications 10 00018 i003
Transparency—online article42.636.7 Publications 10 00018 i004
Transparency—clear retraction note6.65.4 Publications 10 00018 i004
Precision—authors role−27.6−6.1 Publications 10 00018 i003
Precision—no editorial errors4.53.9 Publications 10 00018 i004
General31.840.4 Publications 10 00018 i003
Rank54
Table 26. Group of publishers below 11th place 2018–2020 change of SDTP score.
Table 26. Group of publishers below 11th place 2018–2020 change of SDTP score.
All Other Publishers20182020
Articles10371439+38.7%
Speed1.110.9 Publications 10 00018 i003
Detection rate−12.8−11.2 Publications 10 00018 i003
Transparency—online article10.14.6 Publications 10 00018 i004
Transparency—clear retraction note−6.4−6 Publications 10 00018 i003
Precision—authors role21.428.7 Publications 10 00018 i003
Precision—no editorial errors0.80.9 Publications 10 00018 i005
General14.227.9 Publications 10 00018 i003
Rank55
Table 27. Best SDTP score performance 2009–2020.
Table 27. Best SDTP score performance 2009–2020.
Points
ETE-Century Publishing37.1
DetectionSAGE57.7
Transparency (online article)PLOS, Dove Medical Press, E-Century Publishing38.7
Transparency (clear retraction notes)PLOS, Dove Medical Press, E-Century Publishing and Spandidos Publications5.4
Precision (authors role)Taylor & Francis88.5
Precision (no editorial errors)Hindawi, Dove Medical Press, Spandidos Publications3.9
Table 28. Worst SDTP score performances 2009–2020.
Table 28. Worst SDTP score performances 2009–2020.
Points
ETPLOS−99.9
DetectionWiley-Blackwell−28
Transparency (online article)SAGE−69.2
Transparency (clear retraction notes)Rest of publishers−6
Precision (authors role)E-Century Publishing−100
Precision (no editorial errors)E-Century Publishing−22.5
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Toma, C.; Padureanu, L.; Toma, B. Correction of the Scientific Production: Publisher Performance Evaluation Using a Dataset of 4844 PubMed Retractions. Publications 2022, 10, 18. https://doi.org/10.3390/publications10020018

AMA Style

Toma C, Padureanu L, Toma B. Correction of the Scientific Production: Publisher Performance Evaluation Using a Dataset of 4844 PubMed Retractions. Publications. 2022; 10(2):18. https://doi.org/10.3390/publications10020018

Chicago/Turabian Style

Toma, Catalin, Liliana Padureanu, and Bogdan Toma. 2022. "Correction of the Scientific Production: Publisher Performance Evaluation Using a Dataset of 4844 PubMed Retractions" Publications 10, no. 2: 18. https://doi.org/10.3390/publications10020018

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop