Introduction
The field of epigenetics has experienced remarkable research attention in recent years and there have been major advances in elucidation of how histone modification can regulate gene expression. Proteins in the sirtuin family are highly conserved from bacteria to humans. These proteins play an important role in epigenetic regulation. Epigenetics involves genetic control by factors other than an individual's DNA sequence.1 Epigenetics is involved in many normal cellular processes. Consider the fact that our cells all have the same DNA, but our bodies contain many different types of cells: neurons, liver cells, pancreatic cells, inflammatory cells, and others. Epigenetic processes are natural and essential to many organism functions, but if they occur improperly, there can be major adverse health and behavioral effects.2 While epigenetic changes are required for normal development and health, they can also be responsible for some disease states. The first human disease to be linked to epigenetics was cancer, in 1983.
Many bibliometric analysis methods and tools have been developed to help researchers in different research fields. Statistical analysis of the strength of this relationship between articles can help researchers identify the intellectual base of the discipline, important authors, and other bibliometric information. This article aims to sort out and summarize the research on epigenetics through bibliometric analysis.
Review of Literature
Kringel et al analysed the pubmed database search yielded 3,051 hits on epigenetics and drugs, starting in 1992 and peaking in 2016.3 Annual citations increased to a plateau in 2000 and show a downward trend since 2008. Approved and investigational drugs in the Drug Bank database included 122 compounds that interacted with 68 unique epigenetic enzymes. Mills and Rahal reviewed of genome-wide association studies (GWAS) from 2005 to 2018 (3,639 studies; 3,508 traits) reveals extraordinary increases in sample sizes, rates of discovery and traits studied.4 A longitudinal examination shows fluctuating ancestral diversity, still predominantly European Ancestry (88% in 2017) with 72% of discoveries from participants recruited from three countries (US, UK, Iceland). US agencies, primarily NIH, fund 85% and women are less often senior authors.
Carlos Olmeda Gomez analysed the literature on epigenetics published between 2009 and 2017 using qualitative and visualization techniques.5 CiteSpace V software was used to establish an intellectual overview, based on 13,295 scientific articles and review papers were drawn from Web of science core collection in January 2018. Document co-citations were analysed and a variety of graphics was created. Zhang et al analysed the DNA methylation in CVDs from January 1, 2001, to September 15, 2021, were searched and confirmed from the Web of Science.6 A total of 2,617 publications were included in 912 academic journals by 15,584 authors from 963 institutions from 85 countries/regions. Among them, the United States of America, China, and England were the top 3 countries contributing to the field of DNA methylation.
Barbosa et al performed a quantitative analysis regarding genetic polymorphisms on the GMR scientific production between 2009 and 2013.7 We used the keywords polymorphism AND genetics OR molecular marker in order to conduct the literature survey. We found 423 articles related to genetic polymorphism and 87% were original articles. Six countries account for about 89% of publications and China is responsible for 56% of all publications. Shao et al elucidated the status of oncology research from 2001 to 2010.8 Studies published in 30 representative oncology journals were retrieved from the Web of Science (2001-2010) to compose our dataset. Knowledge domain visualisation, co-citation analysis and social network analysis methods were used. Over the past 10 years, America has led oncology research, while China is the sole developing country to be ranked in the top 10.
Jia et al studied for publications on the Web of Science database in the field of atherosclerosis related to epigenetics was conducted from the earliest mention to 31 December 2020.9 Data on total and annual publications, citations, impact factors, Hirsch (H)-index, citation times, most prolific authors, and frequently published journals were collected for quantitative and qualitative comparison. A total of 1,848 publications related to epigenetics and atherosclerosis were found. Haotian Jiang analysed the N6-methyladenosine publications were retrieved from Web of Science Core Collection and PubMed from 2000 to 2021, with keywords ‘m6A’ and ‘cancer’, and analyzed in biblioshiny and VOSviewer.10 A total of 1,013 documents were included, and China and the USA were the top countries with close collaboration.
Dario Kringel analyzed computational functional genomics of drugs with epigenetic interactions.3 PubMed database search yielded 3,051 hits on epigenetics and drugs, starting in 1992 and peaking in 2016. Annual citations increased to a plateau in 2000 and show a downward trend since 2008. Approved and investigational drugs in the Drug Bank database included 122 compounds that interacted with 68 unique epigenetic enzymes. Chenchen Zhu analysed a total of 867 publications met the inclusion criteria, which spanned the period from 2000 to 2022 indexed in Web of Science database.11 Over the years, the number of publications and the frequency of citations exhibited a clear upward trend in general, reaching a peak in 2021. The major contributing countries in terms of the number of publications were China, the United States, and Japan. Andy Wai Kan Yeung examined the incidence rate of reviews being mislabelled by Scopus, and compared this rate with Web of Science (WoS), PubMed and official websites of publishers. Top 400 cited publications defined by Scopus as ‘articles’ were examined.12 Their contents were evaluated to see if any were actually reviews. These publications were cross-checked in WoS, PubMed and publisher websites to identify the assigned document type labels. Out of the 400 Scopus ‘articles’, 117 were reviews (29.3%).
Le Guan et al conducted a comprehensive search of the literature from January 1981 to December 2021 using the Web of Science core database.13 The medical subject term ‘visceral pain’ was searched. A total of 5,047 articles were included in the analysis. The number of articles on visceral pain has continued to grow steadily over the past 40 years. The United States (1,716 articles), University of California (159 articles), and Neurogastroenterology and Motility (276 articles) were the country, institution, and journal with the most publications, respectively. Zou and Sun studied on 3131 publications of Web of Science to identify the current research status and research trends in this field.14 The results show that since 2010, the number of publications has been growing rapidly. Cooperative network analysis shows that the United States, the University of Toronto and Roger S Mcintyre are the most influential countries, research institutes and scholars, respectively. Insulin resistance, obesity, and metabolic syndrome are hot topics in this field.14, 15
Materials and Methods
The research sample included 9,677 articles related to epigenetics obtained from the Web of science database from January 2012 to December 2021. Bibliometric exploration of web of science listed publications on the topic of epigenetics. In the present study, only articles and reviews were included. The information for the documents that meet the requirements contained year of publication, source title, author, affiliation, subjects which were exported into excel format. The date of the retrieval was 20th August 2022.
Data Analysis and Interpretations
Year-wise distribution of publications
A total of 9,677 epigenetics publications were published during 2012-2021. The highest number of publications was 1,183 (12.22%) published in 2021. The least no. of publication is found to be of 2012, because it was the first period of the study. The average number of publications published per year was 967.7. It reveals that research productivity does not remain constant each year. It increases or decreases year after year. The research productivity of epigenetics fluctuates year after year.
There was variation in annual growth during the study period and suddenly decreased from 12.30 in 2014 to -1.31 in 2015. Where as in suddenly increased up to 4.88 in the year 2016, it was decreased to -3.92 in 2017 and it was increased to 23.35 in the year 2018. Likewise, there was a variation after year as indicated in Figure 1 in the AGR for the publications. The significant reason for variations is that there is no constant growth of publications every year in the area of study.
Table 1
Table 2
The study reveals that the maximum number of publications has been published in English language with 9,593 publications (99.13%), followed by German language with 36 publications (0.37), French language ranks third position with 22 publications (0.23%). And the remaining languages such as Spanish, Russian, Japanese and other languages are constituted in negligible percentage. The English language superiority was found in every year in total productivity of the discipline.
Most productive authors
Table 3
The list of top 10 prolific authors in terms of productivity count is listed in Table 3. The list is ranked in the order of decreasing productivity. It is found that, Zhang Y is the most productive author with 65 (0.67%) publications followed by Wang Y with 62 (0.64%) publications, Xu Y with 60 (0.62%) publications, Li Y with 59 (0.61%) publications, Liu Y with 56 (0.58%) publications, Wang J with 48 (0.50%) publications, and Chen Y with 45 (0.46%) publications. And a total of 46,566 authors are contributed entire research output of the study period.
Author’s productivity
In a study of author productivity, a large number of authors were classified according to the number of articles they had published during a certain period. To calculate Author productivity a formula has been applied. The formula is mathematically represented as below:
Average Author per Paper = No. of Authors/No. of Papers
Productivity per Author = No. of Papers/No. of Authors
Table 4
It is one of the productivity of the total authors involved in contributing the research productivity. But the contribution of individual authors can’t measured in these methods of measurements. Table 4 depicts the data pertaining to author productivity and average author per paper. It is revealed from the table 4 that the average number of authors per articles is 4.77 for 9,677 articles published between the periods 2012 to 2021.
The average productivity per author for the period 2012 to 2021 is 0.21. It is also clear from above table that for the years 2012, 2015, 2016 and for the years 2013, 2017, 2018 years are equal average productivity per author is recorded i.e., 0.23 and 0.21 respectively.
Degree of collaboration
The degree of collaboration is defined as the ratio of the number of collaborative research papers to the total number of research papers in the discipline during a certain period of time. The formula suggested by Subramanyam (1983) is used. It is expressed as:
C = Nm/Nm + Ns
Where, C is the degree of collaboration in a discipline. Nm is the number of multiauthored research papers in the discipline published during a year. Ns is the number of single authored papers in the discipline published during the same year.
Table 5
The authorship pattern was analysed to determine the percentage of single and multiple authors. Table 5 presents the single and multiple authors productivity pattern on yearly basis. There were 8,646 (89.35%) multi authored and only 1,031 (10.65%) single authored publications. The productivity patterns on the epigenetics publications are much contributed by the multiple authors than the single author since 2012 to 2021.
The degree of collaboration is determined by using this formula based on this study, the result of degree of collaboration C = 0.89. i.e, 89 percents of collaborative author's articles is published in this study. The degree of collaboration in producing research output on epigenetics research has shown a fluctuating trend during the study period.
Collaboration Index means number of authors per joint papers. Analysis in the Table 5 shows the variation in the Collaboration Index. It varies from 4.29 in 2016 and highest collaboration notices in 2019 i.e. 5.40. The average collaboration index is 4.77. It implies the research team falls between 4 and 5 authorship patterns in field of epigenetics.
Highly productive countries
Table 6
Analysis of worldwide research activity revealed that publications on epigenetics originated from 117 countries. 3,941 (40.72%) articles published in the United States ranked first place, which was 25.02% higher than those in China, whose publication number was 1,526 (15.77%), thereby ranking second. India should ranks 11th in the list, which is a developing country with 266 publications. The countries are ordered according to the total number of publications.
Table 7
The 10 most productive institutions are shown in Table 7, taking into account the number of publications since 2012. The 9,677 epigenetics articles were published by 8,145 different authoring organizations. These institutions are related to the number of publications per author and the frequency per affiliation; Thus, University of Michigan is the institution where the most research related to epigenetics analysis has been developed, with 141 publications, followed by Johns Hopkins University with 136 publications. Interestingly, most of the universities in this table are from USA, it is no surprise that US organizations occupy the top five spots.
Most preferred source titles
Table 8
A total of 1,945 journals published publications related to deep learning in epigenetics research. The top 10 journals are presented in Table 8. The highest count belonged to the Proceedings of the National Academy of Sciences of the United States of America (n=263), followed by International Journal of Molecular Sciences (n=168). Among these journals, Genes Development had the highest impact factor (12.89).
High productivity subject areas
Table 9
The scientific literature on epigenetics research is spread over 87 different subjects. The top 10 subjects with their frequencies are shown in Table 9. It is found that Biochemistry Molecular Biology has highest number of articles with 1,345 (13.90%) followed by Research Experimental Medicine contributing 1,017 (10.51%) articles. Behavioral Sciences occupy the third position with 990 (10.23%) articles. The fourth highest articles belonged to the subject Neurosciences Neurology with 935 (9.66%), Oncology with 817 (8.44%) and Genetics Heredity with 682 (7.05%) articles respectively.
Conclusions
In the present analysis, research and development of epigenetic scientometric information queried from web of science database.
Summary Points
In total, 9,677 documents were included in this study from 2000 to 2021, with 7,840 articles and 1,837 reviews.
Review articles were the lowest publication rate compared to original research.
The publications showed fluctuated during the study period.
USA and the China were the top two ranked countries of corresponding authors. Meanwhile, USA institutions occupy the top five spots.
Multi authored documents contained an average of 8.9 authors, with only 11% of the documents being single authored. The collaboration index (CI) averaged 4.77.
The United States has been leading the rank for years, but China’s publications have been gradually growing.
USA and University of Michigan were the most influential country and institution, respectively.
9,677 articles related to epigenetics were published in 1945 academic journals. Proceedings of the National Academy of Sciences of the United States of America published the most papers (n = 263; 2.72%).
Overall, the present study uses multiple complementary bibliometric methods to generate a panoramic view of the recent developments in epigenetics research.