INTRODUCTION
The genus Alnus in the family Betulaceae (Fagales) comprises approximately 29–35 species (Gryta et al., 2017). These include tree and shrub species distributed across the northern temperate zone (Benson and Silvester, 1993; Colagar et al., 2016), with the highest species diversity (18–23 species) in Asia (Kennedy et al., 2010; Haq et al., 2021). Because members of the genus Alnus typically occupy soils with limited nitrogen availability, they have evolved a unique ecological strategy to thrive in such environments (Benson and Silvester, 1993). They establish symbiotic associations with the nitrogen-fixing actinomycete Frankia, which induces specialized root nodules that supply nitrogen to the host plant. This symbiotic relationship confers rapid growth and resilience to Alnus species under nutrient-poor conditions, enabling them to colonize and stabilize marginal habitats (Lee and Tsai, 2018; Yuan et al., 2023). Although the genus Alnus is ecologically unified by its Frankia symbiosis, its taxonomic history and phylogenetic relationships remain complex and merit clarification.
The evolutionary relationships within the birch family (Betulaceae) have been clarified through integration of morphological evidence, molecular datasets, and the fossil record (Bousquet et al., 1992; Wang et al., 2013). Natural hybridization with subsequent introgression is common in this family, and polyploidy has been reported in several lineages (Atkinson, 1992). While these processes are important drivers of diversification in the Betuloideae, they also obscure phylogenetic signal and complicate taxonomic circumscription, particularly in the genera Betula and Alnus (Järvinen et al., 2004; Thomson et al., 2015; Gryta et al., 2017). Historically, the genus Alnus was treated as part of Betula but was subsequently recognized as a distinct genus, a separation later reinforced by morphological and molecular studies (Bousquet et al., 1992; Savard et al., 1993). Furlow’s (1979) monograph further stabilized this framework by renaming Alnaster as subg. Alnobetula and Gymnothyrsus as subg. Alnus, and by elevating Clethropsis to subgeneric rank based on a suite of diagnostic traits.
Within the genus Alnus, recent molecular phylogenies consistently support a three-subgenus framework, with a core clade comprising subg. Alnus and subg. Clethropsis sister to subg. Alnobetula (Navarro et al., 2003; Chen and Li, 2004; Ren et al., 2010; Rochet et al., 2011). Subg. Alnus is characterized by stalked shoot buds and flowering from late winter to spring, and it includes widely distributed species such as A. glutinosa. This historical and phylogenetic context explains why species limits and deeper relationships in Alnus remain challenging. Hybridization and polyploidization blur boundaries, while morphological convergence across subgenera can mask evolutionary signal.
Chloroplasts, the organelles responsible for photosynthesis in green plants, possess genomes that are most often uniparentally inherited. In angiosperms, chloroplast genomes generally span ~120–160 kb and display a conserved quadripartite architecture: two inverted repeats (Ira and IRb) flanking the large single-copy (LSC) and small single-copy (SSC) regions (Chumley et al., 2006; Wicke et al., 2011). The advent of next-generation sequencing (NGS) has enabled plastome-scale studies in many lineages, allowing reconstruction of structural variation, well-supported phylogenies, divergence-time estimation, and identification of gene losses (Blazier et al., 2016; Daniell et al., 2016; Mohanta et al., 2020; Yu et al., 2022; Zhang et al., 2022; Liu et al., 2023). Within the genus Alnus, chloroplast (plastome) genome studies have become increasingly common, documenting genome structure and phylogenetic relationships (Lee et al., 2019; Zhang et al., 2021; Yang et al., 2022). However, plastome studies have not been conducted for A. hirsuta.
Alnus hirsuta Turcz. is a Northeast Asian alder distributed across Korea, China, Japan, and Mongolia. It is morphologically distinguished from the closely related A. japonica by its generally broader ovate to suborbicular leaves (Huh and Huh, 1999). Alnus hirsuta has attracted pharmacological interest because diarylheptanoids isolated from this species exhibit antioxidative and anti-inflammatory activities, underscoring its value as a useful plant resource (Hu and Wang, 2011). Owing to its rapid growth, stress tolerance, and soil-stabilizing root system, A. hirsuta is used for roadside shelterbelts and riparian buffers and is also planted as a street tree in many regions. These attributes collectively highlight the species’ ecological and applied significance and justify focused comparative work on its genome and evolution.
In this study, we aimed to assemble and annotate the complete chloroplast genome of A. hirsuta using NGS. We also aimed to comprehensively analyze chloroplast-genome features—including genome size, GC content, gene content, and organization—together with codon usage and repeat sequences, based on the annotated plastome. Furthermore, we aimed to clarify the phylogenetic placement of A. hirsuta and its relationships with congeners by constructing a phylogenetic tree from concatenated protein-coding genes sampled across Alnus and representative Betulaceae. This provides a theoretical basis for understanding Alnus evolution and for species identification.
MATERIALS AND METHODS
Plant sampling, DNA extraction, and sequencing
Fresh leaf tissue of A. hirsuta was collected from Samhyeri, Yean-myeon, Andong-si (36°43’45.3”N, 128°54’10.6”E) and deposited in the herbarium of Kyungpook National University (KNU) (specimen number: KNU_NI_001). The sample was identified by KyoungSu Choi (Department of Biology, Kyungpook National University). Total genomic DNA was extracted from silica-dried leaves using the DNeasy Plant Mini Kits (Qiagen Corporation, Valencia, CA, USA). Sequencing was performed on the Illumina NovaSeq 6000 platform (Illumina, Inc., San Diego, CA, USA) with paired-end 150-bp reads and an average insert size of 500 bp.
Plastome assembly and annotation
The complete chloroplast genome of A. hirsuta was assembled with GetOrganelle v.1.7.7.1 (Jin et al., 2020). Gene annotation was performed with GeSeq, and tRNA loci were identified with tRNAscan-SE v2.0.9 (Chan et al., 2021). The circular genome map was generated with OGDRAW v1.3.1 (Greiner et al., 2019).
Comparative genomics, divergence hotspot, and repeat analysis
We compiled plastomes from 19 Alnus species for comparative analyses of structural variation. IRscope (Amiryousefi et al., 2018) was used to examine inverted-repeat (IR) lengths and junction positions (LSC–IRb, IRb– SSC, SSC–IRa, IRa–LSC) across all plastomes. Whole plastomes were aligned with MAFFT (Katoh, 2002), and nucleotide diversity (Pi) was estimated in DnaSP v6.9 (Rozas et al., 2017) with a 600-bp sliding window and a 200-bp step. Simple sequence repeats (SSRs) were identified with MISA (Beier et al., 2017), using minimum repeat thresholds of 10, 5, 4, 3, 3, and 3 for mono-, di-, tri-, tetra-, penta-, and hexa-nucleotide motifs, respectively.
Phylogenetic analysis
Phylogenetic inference was based on 76 chloroplast protein-coding genes from the complete chloroplast genomes of 19 Alnus species, with Casuarina glauca as the outgroup. The sequences were aligned with MAFFT v7.4.0.9 (Katoh, 2002) and concatenated into a single data set. A maximum-likelihood (ML) tree was constructed in Geneious Prime (https://www.geneious.com) using RAxML v8.2.11 (Stamatakis, 2014) under the GTRGAMMA model with 1,000 bootstrap replicates.
RESULTS AND DISCUSSION
Plastome feature of A. hirsuta
The nucleotide sequence of A. hirsuta has been deposited in GenBank. The chloroplast genome of A. hirsuta was 160,879 bp in length (Fig. 1) and displayed the typical quadripartite structure, comprising a pair of IRs (26,052 bp), LSC (89,571 bp), and SSC (19,204 bp). Alnus hirsuta exhibited 112 annotated genes, including 78 protein-coding genes, 30 tRNA genes, and 4 rRNA genes. The GC content of the plastome of A. hirsuta was 36.4%.
To provide broader phylogenomic context, we incorporated plastomes from 19 Alnus species from the National Center for Biotechnology Information into our comparative analyses (Table 1). Among these 19 species, plastome lengths ranged from 160,013 bp (A. firma) to 161,117 bp (A. lanata). The plastomes showed no variation in gene number and shared identical GC content. In most angiosperms, chloroplast genomes are typically maternally inherited and undergo limited recombination, which contributes to their highly conserved structures among closely related species (Daniell et al., 2016). Previous studies on Betulaceae (Yang et al., 2022) have demonstrated that chloroplast genomes are highly conserved, and our analyses further confirmed this high level of conservation within the genus Alnus.
IR boundary analysis
Although plastomes remain strongly conserved in overall architecture, shifts at IR boundaries are widely reported. Among closely related species, such boundary movements are typically subtle and involve only the gain or loss of a few genes (Goulding et al., 1996; Yoo et al., 2021; Amenu et al., 2022; Choi et al., 2023; Li et al., 2025; Yan et al., 2025).
We found that the IR boundary analysis of the 19 Alnus plastomes revealed a highly conserved overall structure, with only minor lineage-specific variations (Fig. 2). The LSC/IRb junction was consistently located within the rps19 gene, with approximately 20–37 bp extending into the IRb region. At the IRa/LSC boundary, trnH was adjacent to the junction, whereas rps19 was partially duplicated in IRa.
At the IRb/SSC and SSC/IRa junctions, two distinct boundary configurations were identified, primarily governed by the length and positioning of ndhF and ycf1. In the predominant configuration (Type I), the IRb/SSC boundary lied within ndhF, producing a short ndhF fragment (from single-digit to a few hundred bp) that was duplicated into IRb and overlapped ycf1. The S SC/IRa boundary was typically situated within ycf1, which extended several kilobases into IRa. In the alternative configuration (Type II), the ndhF extension into IRb was markedly reduced or absent, resulting in a slightly different junction architecture. Consistent with this genus-wide pattern, A. hirsuta conformed to Type I—ndhF crossed into IRb and ycf1 extended into IRa—while retaining the conserved LSC/IR boundaries (i.e., rps19 within IRb and trnH adjacent to the IRa/LSC junction). Despite these minor lineage-specific differences, the plastomes of the genus Alnus exhibited an overall stable and conserved IR–SC boundary framework, consistent with previous studies reporting minimal divergence of IR–SC junctions within Betulaceae (Yang et al., 2022).
SSRs analysis
SSRs are tandemly repeated DNA motifs of 1–10 bp per repeat unit, dispersed throughout the genome. Because of their high allelic variability, SSRs are informative molecular markers widely used for genotyping and population-genetic analyses (Fu et al., 2016; Ahmad et al., 2018). We identified 100 SSRs in A. hirsuta, comprising 52 mononucleotides, 21 dinucleotides, 8 trinucleotides, 16 tetranucleotides, 2 pentanucleotides, and 1 hexanucleotide (Fig. 3).
The total number of chloroplast SSRs across the 19 Alnus plastomes ranged from 89 in A. firma to 112 in A. nitida. Mononucleotide repeats were the most abundant, with the highest count in A. ferdinandi-coburgii (58). Dinucleotide repeats reached a maximum of 24 in A. incana. Trinucleotide, tetranucleotide, pentanucleotide, and hexanucleotide repeats were comparatively rare, with maximum counts of 12 (A. cremastogyne), 18 (A. alnobetula), 5 (A. nitida, A. rubra, A. ferdinandi-coburgii, A. lanata, and A. cremastogyne), and 3 (A. incana), respectively.
Comparative divergence hotspots
Plastome-wide alignments across the family Betulaceae typically show high similarity, with variability concentrated in the single-copy regions. In the present study, the sliding-window analysis (Fig. 4) using the Alnus dataset also revealed higher nucleotide diversity in the LSC and SSC than in the IRs, and non-coding intervals were more variable than protein-coding regions. Peaks of diversity within the genus Alnus were centered on intergenic spacers adjacent to cemA–petA, ycf2, rpl32–trnL, and psaC–ndhE. These patterns agreed with the broader Betulaceae tendency for divergence hotspots to cluster in non-coding LSC/SSC spacers, though some locus-level differences were evident. Previous family-wide analyses (Yang et al., 2022) highlighted five coding genes with elevated nucleotide polymorphisms (Pi) (psaI, ycf1, rpl22, psaJ, and cemA) and identified rpl22 as a promising barcode owing to its moderate variability and suitable length. They also recovered highly variable non-coding intervals, including trnT_GGT–psbD, trnE_TTC–trnT_GGT, ndhC–trnV_UAC, trnH–psbA, and ycf4–cemA. In contrast, our findings identified cemA–petA and psaC–ndhE as prominent lineage-specific peaks, whereas some previously noted hotspots, including trnT–psbD, were less pronounced. Together, these results indicate that the regional architecture of variability is conserved across Betulaceae, but the exact positions of maxima vary among lineages. Hence, it is important to tailor chloroplast markers to the focal clade. Within the genus Alnus, spacers flanking cemA, psaC/ndhE, and rpl32–trnL, along with ycf2, appear especially informative for phylogenetic inference and species delimitation.
Phylogenetic analysis of Alnus
Previous molecular phylogenetic studies (Navarro et al., 2003; Chen and Li, 2004; Colagar et al., 2016; Gryta et al., 2017) have consistently indicated the monophyly of the genus Alnus, but the taxonomic relationships within subgenus Alnus remained poorly resolved. In nrITS-based phylogenetic analyses (Navarro et al., 2003; Colagar et al., 2016), A. hirsuta exhibited a highly ambiguous taxonomic placement. Although analyses based on the chloroplast genome (Meucci et al., 2021; Zhang et al., 2021; Yang et al., 2022) and nuclear genes have been conducted for the genus Alnus, these analyses and datasets did not include A. hirsuta. The inferred molecular phylogeny shows that all species of the genus form a monophyletic group (Fig. 5). Subgenus Alnobetula (A. firma and A. alnobetula subsp. alnobetula) separates from other Alnus species. However, one species of the subgenus Clethropsis (A. maritima) and two species of subgenus Alnus (A. rubra and A. acuminata) were intermingled (100% bootstrap values). We found that A. hirsuta is closely related to A. fauriei with a 100% bootstrap value and forms a sister group to A. formosana. In conclusion, our findings elucidate the relationships among the close relatives of A. hirsuta, which had previously remained unclear.








