Review Open Access Logo

Evolving Genetic Research in Rheumatoid Arthritis: Mapping Research Trends and Global Contributions

Wan Nur Amalina Zakaria 1, 2, *
Wan Syamimee Wan Ghazali 3, 2
Nur Diyana Mohd Shukri 4, 2
Adi Wijaya 5
  1. Human Genome Centre, School of Medical Sciences, Health Campus
  2. Hospital Pakar Universiti Sains Malaysia, 16150 Kubang Kerian, Kelantan, Malaysia
  3. Department of Internal Medicine, School of Medical Sciences, Health Campus
  4. Department of Immunology, School of Medical Sciences, Health Campus
  5. Department of Health Information Management, Universitas Indonesia Maju, Jakarta, Indonesia
Correspondence to: Wan Nur Amalina Zakaria, Human Genome Centre, School of Medical Sciences, Health Campus; Hospital Pakar Universiti Sains Malaysia, 16150 Kubang Kerian, Kelantan, Malaysia. Email: [email protected].
Volume & Issue: Vol. 13 No. 2 (2026) | Page No.: 8257-8270 | DOI: 10.15419/y029q988
Published: 2026-02-28

Online metrics


Statistics from the website

  • HTML Views: 0
  • PDF Views: 0
  • XML Views: 0

Statistics from Dimensions

This article is published with open access by BioMedPress. This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0) which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited. 

Abstract

Background: Significant advancements in genetic research have profoundly expanded our understanding of rheumatoid arthritis, elucidating numerous loci and pathways associated with disease susceptibility, progression, and therapeutic response. Methods: This bibliometric study mapped global research trends in genetics and rheumatoid arthritis using 3,731 publications indexed in Scopus from 1953 to June 17, 2025. Publications from 2025 were excluded from the total publication and citation trend analyses owing to incomplete data. Results: Publication volume ascended progressively from the 1990s, peaking in 2022. China leads in total output, while the United States demonstrates the highest citation impact and research influence. The United States, the United Kingdom, the Netherlands, Sweden, and Spain emerged as pivotal research hubs, contributing the majority of the most prolific authors and highly cited articles. Key publishing platforms include Arthritis and Rheumatism and Annals of the Rheumatic Diseases. Keyword co-occurrence analysis revealed dynamic shifts in research priorities: traditional topics such as Human Leukocyte Antigen (HLA), shared epitope, and rheumatoid factors are waning, while emerging domains include machine learning, ribonucleic acid sequencing (RNA-seq), Mendelian randomization, genome-wide association studies (GWAS), bioinformatics, and biomarkers. Conclusion: These findings highlight the evolving paradigm of rheumatoid arthritis genetics research and offer insights to guide future investigations and bolster international collaboration.

INTRODUCTION

Rheumatoid arthritis is a systemic, chronic inflammatory pathology and an autoimmune disorder1. It is characterized by progressive and destructive arthropathy stemming from synovial inflammation and hyperplasia, along with an elevation in autoantibodies such as rheumatoid factor (RF) and anti-cyclic citrullinated peptide (anti-CCP)2. In addition to musculoskeletal involvement, rheumatoid arthritis is associated with extra-articular manifestations, particularly cardiovascular disease, which contributes to diminished life expectancy among affected individuals3. Rheumatoid arthritis is a complex polygenic disease influenced by the interplay of genetic predisposition and environmental triggers, both of which play instrumental roles in disease susceptibility and progression4.

Significant strides in genetic research have markedly enhanced our understanding of rheumatoid arthritis pathogenesis, identifying numerous loci and molecular pathways involved in disease susceptibility, severity, and therapeutic response5. These insights have highlighted critical immune-related cell types and mechanisms, providing the groundwork for personalized medicine approaches. Current and future challenges include leveraging genetic variants as instrumental variables to establish causal relationships with intermediate phenotypes (e.g., transcriptomic markers), developing polygenic risk scores for diagnosis and treatment stratification, and identifying novel therapeutic targets4.

While interest in rheumatoid arthritis genetics continues to intensify, bibliometric analyses of this expanding research domain remain limited. Extant studies have primarily focused on specific subtopics, such as deoxyribonucleic acid (DNA) methylation6, non-coding ribonucleic acids (RNA)7, genome-wide association studies (GWAS)8, RNA-binding proteins9, and microRNAs10. However, a comprehensive evaluation of the global research landscape concerning rheumatoid arthritis genetics is still lacking. This study aims to fill that gap by mapping publication trends, research impact, collaborative networks, leading journals, influential authors, the most frequently cited articles, and the thematic evolution within the field. By offering an in-depth bibliometric overview, this analysis provides valuable insights that can guide future research directions and inform clinical applications11,12.

METHODS

Database selection

Scopus was selected as the primary data source for this study due to its comprehensive coverage of global scientific output. Widely recognized by the international research community, Scopus is considered a reliable, high-quality bibliometric database for conducting quantitative analyses in academic research13,14.

Search strategy

A systematic search was conducted in Scopus on June 17, 2025, using the query: TITLE ("rheumatoid arthritis") AND TITLE (gene OR genes OR genetic OR genetics OR genome OR polymorphism OR snp OR allele* OR haplotype* OR genotype OR mutation OR chromosome OR rna OR mrna OR dna). The TITLE field was utilized in the Scopus search to enhance specificity by retrieving only articles explicitly focused on the genetic aspects of rheumatoid arthritis. This minimized the inclusion of studies where genetic terms appear incidentally in abstracts or keywords, thereby reducing noise and sharpening the analytical focus. The search strategy was validated by reviewing the relevance of the retrieved articles. Only English-language original research articles focusing on the genetic aspects of rheumatoid arthritis were included, while reviews, conference papers, editorials, letters, and non-English publications were excluded. A total of 3,731 documents represent the eligible records retrieved directly from Scopus after filtering. Figure 1 depicts the PRISMA flow diagram of the study screening and selection process.

Figure 1

Flow diagram of search strategy

Data Analysis

Citation analysis was performed using Harzing’s Publish or Perish software (version 8)16 on data exported in RIS format, from which citation metrics were extracted. Country co-authorship and author keywords co-occurrence maps were generated using VOSviewer (version 1.6.20)17 based on data in CSV format. For all VOSviewer analyses, the default resolution parameter (resolution = 1.0) was applied. In the country co-authorship analysis, a minimum threshold of five documents per country was set, resulting in the inclusion of 62 countries in the network visualization. For the author keyword co-occurrence analysis18, a minimum occurrence threshold of 10 occurrences per keyword was applied. To improve analytical accuracy and reduce term fragmentation, a thesaurus file was employed to standardize synonymous and variant terms, including singular and plural forms as well as commonly used abbreviations. For example, “cytokine” was merged with “cytokines,” “major histocompatibility complex” with “MHC,” and “interleukin-1” with “IL-1”. This standardization enhanced the precision and interpretability of the VOSviewer analyses.

RESULTS

A total of 3,731 documents were included in the final analysis to map publication trends, research impact, collaborative networks, leading journals, influential authors, the most cited articles, and the thematic evolution within the field.

Annual Trends in Scientific Publications and Citations

The annual trends in scientific publications and citations from 1953 to 2024 demonstrate a clear evolution in research activity (Figure 2). Early output was negligible, with fewer than five papers per year until the 1980s. A gradual rise began in the late 1980s, marked by a spike in 1987, when 11 publications received 2,659 citations, reflecting the influence of seminal works. A sustained increase followed through the 1990s, highlighting growing interest in the field. The most productive epoch occurred between 2000 and 2015, with publication volume and citation counts peaking at 173 papers in 2016 and 6,146 citations in 2005. Although annual output remained high beyond 2016, citations have declined slightly, likely due to the citation lag for newer publications. Despite this, the continued high publication rate suggests enduring interest in this research area.

Figure 2

Trends in the number of publications and citations from 1953 to 2024 (year 2025 is not shown because of incomplete data)

Country Contributions to Publications and International Collaboration Analysis

Table 1 presents the top 10 contributing countries in rheumatoid arthritis and genetics research based on bibliometric indicators. China leads in total publications (663), followed by the United States (608) and the United Kingdom (409). Despite producing fewer papers than China, the United States ranks highest in citation impact, with the most total citations (38,878), the highest average citations per publication (63.94), and the top h-index (95), reflecting both the volume and influence of its research output. European countries, such as Sweden, the Netherlands, and the United Kingdom, demonstrate strong citation performance relative to their output. Sweden, in particular, attained the highest average citations per publication (87.31) and cited publication (90.41), indicating high-impact research despite fewer publications. Meanwhile, East Asian countries like Japan, South Korea, and China contribute substantially to publication volume, though citation rates vary—Japan shows moderate impact (45.03), while China’s lower average (17.65) suggests a growing but still maturing influence.

Table 1

Top 10 countries that contributed to the publications

CountryTPNCPTCC/PC/CPhg
China6635941170317.6519.704378
United States6085873887863.9466.2395172
United Kingdom4093922524061.7164.3977142
Japan2632501184445.0347.3855100
Spain2382301052044.2045.744891
Netherlands2132081696779.6681.5765125
France183175903749.3851.644490
Sweden1751691527987.3190.4158121
South Korea168164733043.6344.704079

VOSviewer was utilized to visualize the international co-authorship network. As shown in Figure 3, the analysis encompassed 62 countries with at least five publications, stratified into seven clusters based on co-authorship frequency. These clusters underscore distinct regional and thematic research collaborations, with each cluster representing a group of countries more likely to publish together. The United States (turquoise cluster), along with the United Kingdom, Netherlands, and Sweden (red cluster), and Spain (turquoise cluster), stood out as key contributors, showing both high publication output and strong citation impact. Their high total link strength values suggest central roles in global research networks. In contrast, countries like Thailand and Lithuania show weaker international collaboration, highlighting room for growth. Emerging nations such as China, Iran, Pakistan, and several Middle Eastern countries (yellow nodes) are becoming more active, with increasing publication counts. Although many of their outputs are recent, resulting in lower average citations, their growing publication volume suggests strong potential for future impact.

Figure 3

Co-authorship map of countries in rheumatoid arthritis and genetics research. (a) Network visualization showing collaboration clusters among countries; node size represents the number of documents, and link thickness indicates the strength of co-authorship relationships. (b) Overlay visualization illustrating the temporal evolution of country collaborations, where node size reflects publication output and node color (purple to yellow) represents the average publication year (2005–2021)

Leading Journals

Table 2 highlights the top ten journals contributing to rheumatoid arthritis and genetics research, emphasizing both publication volume and citation impact. leads in output with 302 articles and over 25,000 citations, reflecting its historical significance, though it has since transitioned to . Its legacy is underscored by strong metrics, including a CiteScore of 21.4. In current influence, stands out with the highest CiteScore (33.2) and a robust citation profile, ranking second in volume but maintaining top-tier impact. Other key contributors include , , and , each showing strong bibliometric performance. , for example, garnered over 5,000 citations from 131 papers, supported by a CiteScore of 8.5. Generalist and interdisciplinary journals like and also feature prominently, reflecting the expanding interface between rheumatology, immunology, and broader biomedical research. , despite fewer papers, demonstrates high impact with a CiteScore of 10.8 and strong visibility across disciplines.

Table 2

Top 10 Most Productive Journals

Source TitleTPTCPublisherCite Score (2024)SJR (2024)SNIP (2024)
Arthritis and Rheumatism*30225908John Wiley & Sons21.4*3.784*2.788*
Annals of the Rheumatic Diseases1909466Elsevier33.25.7314.435
Journal of Rheumatology1674929Journal of Rheumatology5.91.1351.028
Arthritis Research and Therapy1315010Springer Nature8.51.5871.274
Rheumatology1053798Oxford University Press9.91.7211.373
Rheumatology International1052060Springer Nature7.00.9651.155
Clinical and Experimental Rheumatology991853Clinical and Experimental Rheumatology S.A.S.5.91.0040.952
Plos One932231Public Library of Science5.40.8031.065
Clinical Rheumatology841107Springer Nature5.90.8550.943
Frontiers in Immunology621059Frontiers Media S.A.10.81.9411.259

Most Prolific Authors

Table 3 highlights the top ten most prolific authors, showcasing their affiliations, countries, and bibliometric impact. Martín J (Spain) leads in total publications (99) and cited papers (97), reflecting both productivity and consistent scholarly influence. Barton A (UK), despite fewer publications (73), achieves the highest average citations per paper (110.90), emphasizing the impact of his contributions. Other highly influential figures include Gregersen PK (USA) with the most total citations (12,849), and Huizinga TWJ (Netherlands), with a remarkable average citations per paper of 132.62. Swedish researchers Klareskog L and Padyukov L stand out with high citation averages (155.00 and 163.22, respectively), reinforcing Sweden's research prominence. Spanish authors like Balsa A and González-Gay MA further underscore Spain’s active role in the field, while Bae SC and Lee YH represent South Korea’s growing influence through consistent output and strong citation metrics.

Table 3

Top 10 Most Prolific Authors

Author’s NameAffiliationCountryTPNCPTCC/PC/CPhg
Martín JCSIC - Instituto de Parasitologia y Biomedicina Lopez Neyra (IPBLN)Spain9997605661.1762.433776
Balsa AHospital Universitario La PazSpain8383329539.7039.703154
Bae SCHanyang UniversitySouth Korea8282457355.7755.772966
Barton AUniversity of ManchesterUnited Kingdom73718096110.90114.034173
González-Gay MAUniversidad de CantabriaSpain7170464365.3966.332768
Klareskog LKarolinska InstitutetSweden696610695155.00162.054269
Lee YHKorea University MedicineSouth Korea6968210430.4930.942543
Gregersen PKNorthwell Health SystemUnited States656512849197.68197.683865
Huizinga TWJLeids Universitair Medisch CentrumNetherlands65648620132.62134.694165
Padyukov LKarolinska UniversitetssjukhusetSweden60599793163.22165.983560

Most Influential Articles

Table 4 highlights the 10 most highly cited publications in rheumatoid arthritis and genetics, representing foundational work that has shaped current scientific understanding. Leading the list is Gregersen et al. (1987)19, and their shared epitope hypothesis remains a seminal contribution with 2,265 citations and a lasting impact on molecular rheumatoid arthritis research. Okada et al. (2014)20, published in , with 1,807 citations and the highest annual citation rate (164.27), marking a transformative step in linking GWAS findings to therapeutic development. Similarly, Begovich et al. (2004)21 identified a key single-nucleotide polymorphism (SNP) in protein tyrosine phosphatase non-receptor type 22 (PTPN22), earning 1,256 citations and influencing subsequent genetic studies on autoimmune susceptibility. Several other top-cited articles delve into GWAS and functional genomics, such as those by Stahl et al. (2010)22 and Suzuki et al. (2003)23, which expanded the map of rheumatoid arthritis-associated loci. Padyukov et al. (2004)27 made a pivotal contribution by linking smoking with Human Leukocyte Antigen (HLA)–DR alleles, offering early evidence of gene–environment interactions. Liu et al. (2013)25 brought epigenetics to the forefront with insights into DNA methylation in rheumatoid arthritis. Earlier landmark studies by MacGregor et al. (2000)24 on heritability and Firestein et al. (1990)26 on cytokine gene expression continue to be influential, reflecting their foundational role in both clinical and molecular aspects of rheumatoid arthritis research.

Table 4

Top 10 influential articles

Authors and yearTitleCitesCites/Year
Gregersen et al. (1987)19The shared epitope hypothesis. an approach to understanding the molecular genetics of susceptibility to rheumatoid arthritis. https://doi.org/10.1002/art.1780301102226559.58
Okada et al. (2014)20Genetics of rheumatoid arthritis contributes to biology and drug discovery. https://doi.org/10.1038/nature128731807164.27
Begovich et al. (2004)21A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis. https://doi.org/10.1086/422827125659.81
Stahl et al. (2010)22Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. https://doi.org/10.1038/ng.582108772.47
Suzuki et al. (2003)23Functional haplotypes of PADI4, encoding citrullinating enzyme peptidylarginine deiminase 4, are associated with rheumatoid arthritis. https://doi.org/10.1038/ng120697844.45
MacGregor et al. (2000)24Characterizing the quantitative genetic contribution to rheumatoid arthritis using data from twins. https://doi.org/10.1002/1529-0131(200001)43:1%3C30::aid-anr5%3E3.0.co;2-b95638.24
Liu et al. (2013)25Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis. https://doi.org/10.1038/nbt.248777964.92
Firestein et al. (1990)26Quantitative analysis of cytokine gene expression in rheumatoid arthritis. https://doi.org/10.4049/jimmunol.144.9.334756716.20
Padyukov et al. (2004)27A gene–environment interaction between smoking and shared epitope genes in HLA–DR provides a high risk of seropositive rheumatoid arthritis. https://doi.org/10.1002/art.2055356026.67
Tokuhiro et al. (2003)28An intronic SNP in a RUNX1 binding site of SLC22A4, encoding an organic cation transporter, is associated with rheumatoid arthritis. https://doi.org/10.1038/ng126753924.50

Visualization Analysis of Dynamic Changes in Co-Occurrence of the Author’s Keywords

VOSviewer was used to generate a keyword co-occurrence map based on 102 author keywords grouped into four thematic clusters (Figure 4). Cluster 1 (red) is the largest and most recent cluster, focusing on the molecular underpinnings of rheumatoid arthritis and the development of personalized treatment strategies. Prominent keywords include rheumatoid arthritis, methotrexate, inflammation, autoimmune diseases, gene expression, biomarkers, and cytokines. Emerging keywords like bioinformatics (2021.8), machine learning (2023.1), and RNA-seq (2020.1) reflect a data-driven shift in rheumatoid arthritis research. Recent interests include Mendelian randomization (2023.3), COVID-19 (2022.7), and interstitial lung disease (2022.9). The green cluster (Cluster 2) captures classical genetic epidemiology and comparative autoimmune research. Recurring keywords are polymorphisms, SNPs, meta-analysis, systemic lupus erythematosus (SLE), and autoantibodies. Although many terms are older (e.g., juvenile rheumatoid arthritis, 2002), their continued relevance reflects enduring efforts in gene discovery. The blue cluster (Cluster 3) represents well-established immunogenetic domains with clinical relevance. Main keywords include HLA, HLA-DRB1, rheumatoid factors, anti-CCP, PTPN22, and shared epitope. Keywords such as cardiovascular disease, atherosclerosis, and disease activity reflect links between genetics and clinical outcomes. Many keywords are either older (e.g., HLA, 2004.2) or more recent, such as anti-CCP (2016.0) and ACPA (2016.2). The yellow cluster (Cluster 4) explores functional polymorphisms and genotype-phenotype relationships. The keywords include gene polymorphisms, haplotypes, genotypes, tumor necrosis factor (TNF), vitamin D receptor (VDR), etanercept, and TNF-α. Some keywords, such as vitamin D (2019.4) and TNF-α (2018.5), reflect recent interest in functional gene variants. This cluster also contains older keywords related to foundational methods like restriction fragment length polymorphism (RFLP, 2005.5) and immunogenetics (2002.3).

Figure 4

Co-occurrence of author keywords in rheumatoid arthritis and genetics research. (a) Network visualization displaying keyword clusters based on co-occurrence relationships; node size indicates the frequency of keyword occurrence, and link thickness represents the strength of co-occurrence. (b) Overlay visualization showing the temporal distribution of keywords, with node color (purple to yellow) indicating the average publication year

DISCUSSION

Our results indicate a clear trend of growth in the number of publications on rheumatoid arthritis and genetics over time. The annual trends from 1953 to 2024 reveal a steady evolution, beginning with negligible publication rates until the 1980s. A pivotal turning point occurred in 1987, when a high-impact study on the shared epitope hypothesis by Gregersen et al. generated a significant citation spike, signaling foundational contributions to the field. Research activity accelerated throughout the 1990s, culminating in the most productive epoch between 2000 and 2015. This rise may have been facilitated by the concurrent launch of the Human Genome Project (HGP) in October 1990. The HGP was a multinational project involving the United States, the United Kingdom, France, Japan, Canada, Germany, and China, undertaking to sequence the entire human genome and classify human genes 29. Although citations have slightly declined since 2016, likely due to the natural lag in citation accrual, publication rates have remained consistently high, indicating sustained global interest and active investigation in this domain.

The analysis of country contributions and international collaboration highlights both established and emerging global players. While China leads in publication volume, the United States excels in citation impact and research influence, as reflected in its top average citations per publication and h-index. European countries like Sweden and the Netherlands also show high citation efficiency, producing fewer but highly impactful papers. VOSviewer’s co-authorship network visualization indicates that these influential nations are central to global collaboration clusters, with the United States, United Kingdom, Netherlands, Sweden, and Spain forming key hubs. In contrast, countries with lower publication output, such as Thailand and Lithuania, display limited international connectivity. However, emerging contributors, particularly China, Iran, Pakistan, and several Middle Eastern nations, are becoming more active, indicating expanding global participation and growing research potential despite currently modest citation impact. Notably, China’s high publication output is largely driven by large-scale initiatives like the China Kadoorie Biobank (CKB), GWAS consortia, and other cohort studies.

The leading journals, authors, and most-cited articles reveal a tightly interlinked landscape of scholarly influence, scientific advancement, and interdisciplinary growth. High-impact journals like and serve as key publication venues for seminal works, such as Gregersen et al. (1987)19, MacGregor et al. (2000)24, and Padyukov et al. (2004)27, highlighting their role in disseminating groundbreaking genetic discoveries. The prominence of generalist outlets like and immunology-focused platforms like signals the increasing cross-pollination between rheumatology, genomics, and broader biomedical research. This aligns with the evolving scope of top-cited papers, which span from early heritability studies to cutting-edge GWAS, epigenetics, and gene–environment interactions. Importantly, this pattern is not only of academic interest but also highlights the translational trajectory of rheumatoid arthritis genetics research. For instance, GWAS that identified variants in the Janus kinase (JAK)- signal transducer and activator of transcription (STAT) or JAK–STAT signaling pathway provided the scientific foundation for the development of JAK inhibitors such as tofacitinib and baricitinib, which are now approved for clinical use in rheumatoid arthritis5,20,30.

A strong correlation also emerges between the productivity of influential authors and the top-cited articles. Researchers such as Martín (Spain), Gregersen (United States), and Klareskog (Sweden), all of whom appear in the top 10 author rankings, are also behind some of the most cited articles, reinforcing their leadership in shaping the field. The keyword co-occurrence analysis using VOSviewer reveals a rich and evolving thematic structure in rheumatoid arthritis and genetics research, segmented into four distinct yet interconnected clusters. Cluster 1 (red) emerges as the largest and most recent, showcasing the field’s shift toward molecular precision and personalized therapy31,32,33. Core concepts such as rheumatoid arthritis, methotrexate, gene expression, and cytokines anchor this cluster in the disease's biological basis and treatment landscape. However, the appearance of newer keywords like machine learning, RNA-seq, and bioinformatics signals a growing reliance on high-throughput and computational methodologies to dissect complex genetic patterns34,35,36. The inclusion of Mendelian randomization, COVID-19, and interstitial lung disease further reflects an expanding research scope that now incorporates global health concerns and comorbidity profiling37,38,39.

The second largest cluster (green) emphasizes polymorphisms, SNPs, and meta-analysis, as well as comparisons with related autoimmune diseases like SLE40,41. The persistent use of older keywords such as juvenile rheumatoid arthritis underscores how foundational studies in genetic susceptibility continue to inform current hypotheses. These insights highlight opportunities for future studies to explore shared genetic mechanisms across autoimmune conditions and to refine genotype-based risk stratification models42,43. Clinically, such advancements could support the development of more targeted diagnostic tools and personalized therapeutic strategies in rheumatoid arthritis and related disorders44. In contrast, Cluster 3 (blue) focuses on well-established immunogenetic hallmarks that continue to shape clinical practice. Keywords like HLA, shared epitope, and PTPN22 are deeply embedded in rheumatoid arthritis's pathogenic narrative45,46, while the presence of anti-CCP, cardiovascular disease, and disease activity highlights a maturing focus on translational outcomes47,48. Lastly, Cluster 4 (yellow) bridges basic and translational science through exploration of functional gene variants and genotype–phenotype relationships. Keywords like VDR, etanercept, and TNF-α underscore the genetic underpinnings of therapeutic response49,50,51, with vitamin D and gene polymorphisms suggesting newer angles on immune modulation52,53. The coexistence of older methods such as RFLP and immunogenetics with newer molecular targets illustrates the ongoing synthesis of traditional genetic tools with modern approaches54,55.

Beyond bibliometric trends, a closer look at the four keyword clusters reveals both consensus and ongoing controversies in rheumatoid arthritis genetics. In Cluster 1, molecular and transcriptomic studies consistently implicate cytokine dysregulation, particularly IL6, TNF-α, and JAK–STAT pathway components, as core mediators of inflammation and drug response5,31,33. These findings underpin the rationale for targeted therapies such as JAK inhibitors, yet cross-cohort transcriptomic analyses show inconsistent predictive signatures for methotrexate or TNF inhibitor efficacy. Cluster 2 highlights strong consensus on shared autoimmune susceptibility genes such as HLA-DRB1 and PADI4, which exhibit reproducible risk effects across ancestries19,23,46. However, variants such as PTPN22 display marked population-specific heterogeneity, strongly associated with rheumatoid arthritis in European and North American cohorts but weak or absent in East Asian and African populations21,45. These inconsistencies suggest allelic heterogeneity or gene–environment interactions shaping regional disease risk. In Cluster 3, HLA-DRB1 remains the most robust universal locus, whereas other loci such as STAT4 and CTLA4 show divergent significance across populations, as demonstrated by studies in Iraqi, Mexican, and Algerian cohorts46,47,53. Such discrepancies limit the transferability of polygenic risk scores and highlight the need for ancestry-inclusive GWAS. Finally, Cluster 4 highlights the translational challenges in linking functional polymorphisms to treatment response. Variants in VDR, TNF, and TP53 have been correlated with biologic response, remission, or disease activity in some studies, yet replication remains inconsistent and clinical utility uncertain49,50,52,55.

Collectively, these patterns illustrate a maturing yet heterogeneous field, where established immunogenetic mechanisms coexist with ancestry-specific associations and persistent barriers to clinical translation. This study has several limitations that warrant consideration. First, the analysis relied exclusively on Scopus as the data source. While Scopus provides broad coverage, reliable citation metadata, and strong compatibility with bibliometric tools13,14, the use of a single database may have excluded relevant publications indexed only in Web of Science or PubMed. Variations in indexing standards, author affiliations, and institutional naming conventions across databases also pose challenges for data harmonization. Second, our search strategy required the presence of inclusion search terms in article titles, which may have led to the omission of some relevant studies where these terms were reported only in abstracts or keywords. Third, the analysis was based on raw citation counts rather than normalized measures. As a result, some influential recent publications may be underrepresented in citation-based rankings. Fourth, the overlay visualization generated using VOSviewer is based on the average publication year. While this approach is useful for indicating the relative recency of research themes, it does not capture the frequency dynamics of keyword usage over time. Accordingly, our findings should be interpreted in light of these constraints.

CONCLUSIONS

Bibliometric analysis of rheumatoid arthritis and genetics research reveals a field marked by steady growth and evolving scientific priorities. From foundational breakthroughs in the late 20th century to the recent surge in interest in pharmacogenetics, epigenetics, and data-driven methodologies, the field has matured into a highly interdisciplinary domain. Thematic keyword clusters further reflect the field’s transition from traditional immunogenetics toward personalized medicine, complex comorbidity research, and computational genomics, signaling a future direction centered on precision healthcare and integrative approaches to disease understanding and management.

Research Gaps and Future Agenda

Although data-driven and integrative genomic approaches are increasingly shaping rheumatoid arthritis research, several knowledge gaps remain that limit the translation of genetic discoveries into clinical benefit. Current GWAS are still dominated by European ancestry cohorts, resulting in limited transferability of risk loci and polygenic risk scores to underrepresented populations5,8,20. Furthermore, functional follow-up studies integrating multi-omics layers such as epigenomic QTLs, single-cell transcriptomics, and proteomics remain sparse25,31,33. These gaps hinder the identification of causal variants and the mapping of gene regulatory networks underlying extra-articular manifestations such as interstitial lung disease or cardiovascular comorbidities3,39. Moreover, while polygenic risk scores and Mendelian randomization offer tools for prediction and causal inference, their clinical validation and incorporation into personalized treatment strategies remain nascent34,36,51.

Future research should prioritize the following questions to accelerate scientific and translational progress:

  1. How can multi-ancestry GWAS improve fine-mapping accuracy and reveal population-specific risk variants in rheumatoid arthritis?

  2. What role can multi-omics Mendelian randomization play in clarifying causal pathways linking genetic variants to extra-articular manifestations (e.g., lung fibrosis, atherosclerosis)?

  3. How can integrative analyses of epigenomic QTLs and single-cell data refine our understanding of disease-relevant immune cell subtypes?

  4. What strategies can enhance the clinical utility, equity, and calibration of polygenic risk scores across diverse ancestries?

  5. How can predictive machine learning models be combined with genomic data to guide therapeutic response and drug repurposing?

Addressing these questions will be pivotal for transforming genetic discoveries into precision healthcare tools for rheumatoid arthritis.

ABBREVIATIONS

ACPA: anti-citrullinated protein antibody, anti-CCP: anti-cyclic citrullinated peptide, CKB: China Kadoorie Biobank, COVID-19: coronavirus disease 2019, CTLA4: cytotoxic T-lymphocyte-associated protein 4, DNA: deoxyribonucleic acid, GWAS: genome-wide association studies, HGP: Human Genome Project, HLA: Human Leukocyte Antigen, JAK-STAT: Janus kinase - signal transducer and activator of transcription, PTPN22: protein tyrosine phosphatase non-receptor type 22, QTLs: epigenomic quantitative trait loci, RFLP: restriction fragment length polymorphism, RNA: ribonucleic acid, SLE: systemic lupus erythematosus, SNP: single-nucleotide polymorphism, TNF: tumor necrosis factor, VDR: vitamin D receptor.

ACKNOWLEDGEMENTS

None.

AUTHORS’ CONTRIBUTIONS

(I) Conceptualization, methodology development, data curation, formal analysis, and drafting of the original manuscript: WNAZ (II) Administrative support, critical review, and manuscript editing: AW, WSWG, NDMS (III) Data acquisition, validation, and analysis: WNAZ, AW (IV) Final approval of the manuscript and accountability for the integrity of the work: All authors.

FUNDING

None.

AVAILABILITY OF DATA AND MATERIALS

Supplementary files, including the bibliometric dataset (.csv) and reference file (.ris), are available in Zenodo at

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

 

Declaration of generative AI and AI-assisted technologies in the writing process

The authors declare that they have not used generative AI (a type of artificial intelligence technology that can produce various types of content including text, imagery, audio and synthetic data).

Competing interests

The authors declare that they have no competing interests.

  1. M. Jahid, K. U. Khan, Rehan-Ul-Haq, R. S. Ahmed. Overview of rheumatoid arthritis and scientific understanding of the disease. Mediterranean Journal of Rheumatology 2023; 34(3): 284-291.
  2. M. V. Sokolova, G. Schett, U. Steffen. Autoantibodies in rheumatoid arthritis: historical background and novel findings. Clinical Reviews in Allergy & Immunology 2022; 63(2): 138-151.
  3. N. Johri, S. Varshney, S. Gandha, A. Maurya, P. Mittal, S. Jangra, R. Garg, A. Saraf. Association of cardiovascular risks in rheumatoid arthritis patients: management, treatment and future perspectives. Health Sciences Review 2023; 8: 100108.
  4. S. D. Sharma, S. H. Leung, S. Viatte. Genetics of rheumatoid arthritis. Best Practice & Research Clinical Rheumatology 2024; : 101968.
  5. L. Chen, J. Zhao, Q. Meng. From genetic variants to therapeutic targets: insights into understanding rheumatoid arthritis. Frontiers in Immunology 2025; 16: 1556971.
  6. X. Huang, L. Huang, X. Gao, C. Liu. Global research trends in DNA methylation in rheumatoid arthritis: a bibliometric analysis and visual analysis. Medicine (Baltimore) 2024; 103(1): e36218.
  7. Z. Wei, H. Li, S. Lv, J. Yang. Current situation and trend of non-coding RNA in rheumatoid arthritis: a review and bibliometric analysis. Frontiers in Immunology 2024; 14: 1301545.
  8. W. H. Wang, M. H. Xia, X. R. Liu, S. F. Lei, P. He. A bibliometric analysis of GWAS on rheumatoid arthritis from 2002 to 2024. Human Heredity 2025; 90(1): 18-32.
  9. W. Zhang, J. Song, S. Xian, S. Xie, Y. Liu, Y. Yao, X. Tong, X. Wu, Y. Li, H. Zhang, B. Lu. A bibliometric analysis of rheumatology: knowledge structure and research trends of RNA-binding proteins in rheumatic diseases. Clinical Rheumatology 2025; : 1-16.
  10. X. Pang, F. Xu, C. Fan, H. Jiang. Global research trends of microRNAs in rheumatoid arthritis: bibliometrics and visualization analysis. Clinical Rheumatology 2025; 44(1): 53-66.
  11. R. Ahmad, K. F. Azman, R. Yahaya, N. Shafin, N. Omar, A. H. Ahmad, R. Zakaria, A. Wijaya, Z. Othman. Brain-derived neurotrophic factor (BDNF) in schizophrenia research: a quantitative review and future directions. AIMS Neuroscience 2023; 10(1): 5.
  12. W. N. Zakaria, S. Y. Yoon, A. Wijaya, A. H. Ahmad, R. Zakaria, Z. Othman. Global trends and themes in genetic counseling research. European Journal of Human Genetics 2023; 31(10): 1181-1184.
  13. J. Baas, M. Schotten, A. Plume, G. Côté, R. Karimi. Scopus as a curated, high-quality bibliometric data source for academic research in quantitative science studies. Quantitative Science Studies 2020; 1: 377-386.
  14. R. Zakaria, P. Vit, A. Wijaya, A. H. Ahmad, Z. Othman, B. Mezzetti. Evolution of blueberry (Vaccinium corymbosum L), raspberry (Rubus idaeus L) and strawberry (Fragaria x ananassa Duch.) research: 2012–2021. Journal of Berry Research 2022; 12(3): 365-381.
  15. R. Zakaria, A. Ahmi, A. H. Ahmad, Z. Othman. Worldwide melatonin research: a bibliometric analysis of the published literature between 2015 and 2019. Chronobiology International 2021; 38(1): 27-37.
  16. A. W. Harzing. Publish or perish 2007; :
  17. N. Van Eck, L. Waltman. Manual for VOSviewer version 1.6.20 2023; :
  18. N. Shafin, C. A. Ismail, M. Z. Mustafa, N. Ghani, A. H. Ahmad, Z. Othman, A. Wijaya, R. Zakaria. Thematic analysis of multiple sclerosis research by enhanced strategic diagram. Multiple Sclerosis Journal 2022; 28(14): 2160-2170.
  19. P. K. Gregersen, J. Silver, R. J. Winchester. The shared epitope hypothesis. An approach to understanding the molecular genetics of susceptibility to rheumatoid arthritis. Arthritis and Rheumatism 1987; 30(11): 1205-1213.
  20. Y. Okada, D. Wu, G. Trynka, T. Raj, C. Terao, K. Ikari, Y. Kochi, K. Ohmura, A. Suzuki, S. Yoshida, R. R. Graham. Genetics of rheumatoid arthritis contributes to biology and drug discovery. Nature 2014; 506(7488): 376-381.
  21. A. B. Begovich, V. E. Carlton, L. A. Honigberg, S. J. Schrodi, A. P. Chokkalingam, H. C. Alexander, K. G. Ardlie, Q. Huang, A. M. Smith, J. M. Spoerke, M. T. Conn. A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis. American Journal of Human Genetics 2004; 75(2): 330-337.
  22. E. A. Stahl, S. Raychaudhuri, E. F. Remmers, G. Xie, S. Eyre, B. P. Thomson, Y. Li, F. A. Kurreeman, A. Zhernakova, A. Hinks, C. Guiducci. Genome-wide association study meta-analysis identifies seven new rheumatoid arthritis risk loci. Nature Genetics 2010; 42(6): 508-514.
  23. A. Suzuki, R. Yamada, X. Chang, S. Tokuhiro, T. Sawada, M. Suzuki, M. Nagasaki, M. Nakayama-Hamada, R. Kawaida, M. Ono, M. Ohtsuki. Functional haplotypes of PADI4, encoding citrullinating enzyme peptidylarginine deiminase 4, are associated with rheumatoid arthritis. Nature Genetics 2003; 34(4): 395-402.
  24. A. J. MacGregor, H. Snieder, A. S. Rigby, M. Koskenvuo, J. Kaprio, K. Aho, A. J. Silman. Characterizing the quantitative genetic contribution to rheumatoid arthritis using data from twins. Arthritis and Rheumatism 2000; 43(1): 30-37.
  25. Y. Liu, M. J. Aryee, L. Padyukov, M. D. Fallin, E. Hesselberg, A. Runarsson, L. Reinius, N. Acevedo, M. Taub, M. Ronninger, K. Shchetynsky. Epigenome-wide association data implicate DNA methylation as an intermediary of genetic risk in rheumatoid arthritis. Nature Biotechnology 2013; 31(2): 142-147.
  26. G. S. Firestein, J. M. Alvaro-Gracia, R. Maki. Quantitative analysis of cytokine gene expression in rheumatoid arthritis. Journal of Immunology 1990; 144(9): 3347-3353.
  27. L. Padyukov, C. Silva, P. Stolt, L. Alfredsson, L. Klareskog. A gene–environment interaction between smoking and shared epitope genes in HLA–DR provides a high risk of seropositive rheumatoid arthritis. Arthritis and Rheumatism 2004; 50(10): 3085-3092.
  28. S. Tokuhiro, R. Yamada, X. Chang, A. Suzuki, Y. Kochi, T. Sawada, M. Suzuki, M. Nagasaki, M. Ohtsuki, M. Ono, H. Furukawa. An intronic SNP in a RUNX1 binding site of SLC22A4, encoding an organic cation transporter, is associated with rheumatoid arthritis. Nature Genetics 2003; 35(4): 341-348.
  29. F. Moraes, A. Góes. A decade of human genome project conclusion: scientific diffusion about our genome knowledge. Biochemistry and Molecular Biology Education 2016; 44(3): 215-223.
  30. D. Diogo, Y. Okada, R. M. Plenge. Genome-wide association studies to advance our understanding of critical cell types and pathways in rheumatoid arthritis: recent findings and challenges. Current Opinion in Rheumatology 2014; 26(1): 85-92.
  31. C. He, Z. Ren, X. Xia, Z. Xue, F. Zhang, M. Qian, Y. Shu, G. Yin, Q. Xie, H. Xu. A single-cell RNA-seq dataset of synovial fluid from rheumatoid arthritis treated with TNF-α/JAK inhibitor. Scientific Data 2025; 12(1): 1-7.
  32. G. Li, H. Chen, J. Shen, Y. Ding, J. Chen, Y. Zhang, M. Tang, N. Xu, Y. Fang. Unveiling new therapeutic horizons in rheumatoid arthritis: an in-depth exploration of circular RNAs derived from plasma exosomes. Journal of Orthopaedic Surgery and Research 2025; 20(1): 109.
  33. Q. Ren, K. Ding, W. Jiang, W. Zhu, Y. Gao. Molecular crosstalk and potential causal mechanisms of rheumatoid arthritis and sarcopenia co-morbidity: a gene integration analysis. Experimental Gerontology 2025; 203: 112729.
  34. M. Binvignat, B. Y. Miao, C. Wibrand, M. M. Yang, D. Rychkov, E. Flynn, J. Nititham, W. Tamaki, U. Khan, A. Carvidi, M. Krueger. Single-cell RNA-seq analysis reveals cell subsets and gene signatures associated with rheumatoid arthritis disease activity. JCI Insight 2024; 9(16): e178499.
  35. X. Li, S. Ding, P. Zhang, J. Yan, X. Yu, X. Wang, H. Zhan, Z. Wang. Revealing the impact of autophagy-related genes in rheumatoid arthritis: insights from bioinformatics. Heliyon 2024; 10(9): e29849.
  36. H. Nisar, K. Javed, A. Arshad, H. Habib, K. I. Sahibzada, S. Shahid. Identification of potential candidate genes in rheumatoid arthritis using integrated machine learning and WGCNA approach on transcriptomic data. Network Modeling Analysis in Health Informatics and Bioinformatics 2025; 14(1): 1-17.
  37. Y. Fan, S. Huang, D. Wu, M. Chu, J. Zhao, J. Zhang, Y. Wang, Y. Gui, X. Ye, G. Wang, Y. Geng. Immune features revealed by single‐cell RNA and single‐cell TCR/BCR sequencing in patients with rheumatoid arthritis receiving COVID‐19 booster vaccination. Journal of Medical Virology 2024; 96(4): e29573.
  38. M. Chen, J. Lei, Z. Liu, R. Zhao, Y. Jiang, K. Xu, T. Zhang, C. Suo, X. Chen. Association between red blood cell distribution width and genetic risk in patients with rheumatoid arthritis: a prospective cohort study and Mendelian randomization analysis. BMC Rheumatology 2025; 9(1): 1.
  39. B. Kaczmarczyk, C. de la Calle-Fabregat, A. Conde, A. C. Duarte, N. Mena-Vazquez, A. Fernandez-Nebro, A. Triguero-Martinez, S. Castañeda, R. Dos-Santos Sobrin, A. Mera-Varela, C. Lopez-Pedrera. DNA methylome biomarkers of rheumatoid arthritis-associated interstitial lung disease reflecting lung fibrosis pathways, an exploratory case–control study. Scientific Reports 2025; 15(1): 15123.
  40. H. Ebrahimiyan, E. Madreseh, M. Akhtari, E. Farhadi, A. Jamshidi, M. Mahmoudi. TP53 gene polymorphism (rs1042522) in rheumatoid arthritis and systemic lupus erythematosus: an updated systematic review and meta-analysis. Current Rheumatology Reviews 2025; :
  41. R. Pauline, G. Kumar Subbaraj. Association of tumor necrosis factor-α (rs1800629) with RA patients and receptor activator of nuclear factor-kappa B (rs1805034) gene polymorphism with rheumatoid arthritis with osteoporosis susceptibility: a systemic review and meta-analysis. Egyptian Rheumatology & Rehabilitation 2025; 52(1): 13.
  42. J. Lim, K. Kim. Genetic variants differentially associated with rheumatoid arthritis and systemic lupus erythematosus reveal disease-specific biology. Scientific Reports 2019; 9(1): 2739.
  43. H. Lu, J. Zhang, Z. Jiang, M. Zhang, T. Wang, H. Zhao, P. Zeng. Detection of genetic overlap between rheumatoid arthritis and systemic lupus erythematosus using GWAS summary statistics. Frontiers in Genetics 2021; 12: 656545.
  44. L. Shakeel, A. Shaukat, N. Khaliq, A. Kashif, A. Mujeeb, Z. Adnan, J. Taj, A. Akilimali. Rheumatoid arthritis: a comprehensive overview of genetic markers, emerging therapies, and personalized medicine. Annals of Medicine and Surgery (London) 2025; 87(2): 696-710.
  45. M. S. Al-Balushi, I. Antony, A. H. Al-Shirawi, H. Al-Riyami, J. Z. Al-Busaidi, C. Y. Koh, K. M. Al-Naamani, S. S. Hasson, A. A. Al-Jabri, E. A. Said. HLA-B58 and HLA-C2 alleles are associated with the occurrence of rheumatoid arthritis among Omanis. Journal of Clinical Medicine 2025; 14(4): 1219.
  46. S. Louahchi, N. Khaldoun, I. Allam, S. Douaoui, A. Galleze, C. Touil-Boukoffa, R. Raache, F. Hanni, R. Djidjik. STAT4 variants and HLA-shared epitope allele involvement in rheumatoid arthritis susceptibility: a study in Algerian patients. Gene Reports 2025; 40: 102243.
  47. K. M. Bravo-Villagra, J. F. Muñoz-Valle, C. J. Baños-Hernández, S. Cerpa-Cruz, J. E. Navarro-Zarza, I. Parra-Rojas, J. A. Aguilar-Velázquez, S. García-Arellano, A. López-Quintero. STAT4 gene variant rs7574865 is associated with rheumatoid arthritis activity and anti-CCP levels in the western but not in the southern population of Mexico. Genes (Basel) 2024; 15(2): 241.
  48. C. Xu, L. W. Khin, H. Z. Tam, L. L. Goh, E. T. Koh, R. Dalan, K. P. Leong. Haptoglobin 2-2 genotype is associated with increased risk of cardiovascular disease in patients with rheumatoid arthritis: a matched case-control study. Frontiers in Medicine (Lausanne) 2024; 11: 1442858.
  49. A. Latini, G. De Benedittis, P. Conigliaro, C. Bonini, C. Morgante, M. Iacovantuono, A. D’Antonio, A. Bergamini, G. Novelli, M. S. Chimenti, C. Ciccacci. The rs11568820 variant in the promoter region of vitamin D receptor gene is associated with clinical remission in rheumatoid arthritis patients receiving tumor necrosis factor inhibitors. Genes (Basel) 2024; 15(2): 234.
  50. J. J. J. Balcázar, D. F. G. Sánchez, G. Q. López. CD20, CTLA4, CXCL9, IL18RAP, IL-6, SOCS2, and TNF as potential biomarkers for rheumatoid arthritis disease progression: systematic review of RNA-seq studies. Heliyon 2025; 11(9): e43108.
  51. M. E. Husni, J. Lin, J. Zhang, U. M. Chandrasekharan. Defining a personalized treatment approach to rheumatoid arthritis: using genetic markers of TNFi response. Seminars in Arthritis and Rheumatism 2025; 70: 152579.
  52. R. A. Alabdulwahed, S. O. Khoja, M. F. Elshal, S. M. Bahlas, S. M. Khojah. Modulation of CD28 and p38 expression by treatment with vitamin D in patients with rheumatoid arthritis. Journal of Biochemical Technology 2019; 10(3): 76-82.
  53. A. S. Mahmood, A. S. Kadhim, Y. W. Issa. Effects of Vitamin D3 level on the gene expression of Immune checkpoint Cytotoxic T-lymphocytes antigen-4 in Iraqi patients with rheumatoid arthritis. Advancements in Life Sciences 2023; 10S(1): 90-94.
  54. S. Aghaei, M. H. Sahami-Fard, S. Gharibi, L. Feizi, E. Farashahi-Yazd, M. Mahmoudi, H. Nikkhah, M. Akhlaghi. The association of SDF-1 and CXCR4 gene polymorphisms with rheumatoid arthritis susceptibility in the Iranian population. Human Gene 2025; 44: 201402.
  55. F. Nezhad Shamohammadi, G. Daryabor, M. R. Ataollahi, S. Meri, K. K. Eklund, K. Kalantar. Epstein-Barr virus infection and toll-like receptor 9 gene variants affect the susceptibility to rheumatoid arthritis. Future Virology 2025; :

Comments