The complete chloroplast genome of Diarthron linifolium (Thymelaeaceae), a species found on a limestone outcrop in eastern Asia

Article information

Korean J. Pl. Taxon. 2021;51(4):345-352
Publication date (electronic) : 2021 December 31
doi : https://doi.org/10.11110/kjpt.2021.51.4.345
1Department of Medical & Biological Sciences, The Catholic University of Korea, Bucheon 14662, Korea
2Department of Biology, Daejeon University, Daejeon 34520, Korea
3InfoBoss Inc., Seoul 07766, Korea
4InfoBoss Research Center, Seoul 07766, Korea
*Author for correspondence: soh42@dju.kr, starflr@infoboss.co.kr

Editor: Tae-Soo JANG

Received 2021 November 04; Revised 2021 November 25; Accepted 2021 December 02.

Abstract

Diarthron linifolium Turcz. is an annual herb usually found in sandy soil or limestone areas. Plants in the genus Diarthron are known to have toxic chemicals that may, however, be potentially useful as an anticancer treatment. Diarthron linifolium is a unique species among the species of the genus distributed in Korea. Here, we determine the genetic variation of D. linifolium collected in Korea with a full chloroplast genome and investigate its evolutionary status by means of a phylogenetic analysis. The chloroplast genome of Korean D. linifolium has a total length of 172,644 bp with four subregions; 86,158 bp of large single copy and 2,858 bp of small single copy (SSC) regions are separated by 41,814 bp of inverted repeat (IR) regions. We found that the SSC region of D. linifolium is considerably short but that IRs are relatively long in comparison with other chloroplast genomes. Various simple sequence repeats were identified, and our nucleotide diversity analysis suggested potential marker regions near ndhF. The phylogenetic analysis indicated that D. linifolium from Korea is a sister to the group of Daphne species.

Diarthron linifolium Turcz. is an annual herb mainly distributed in eastern Asia including western China, Mongolia, Far East Russia, and Korea. Plants belonging to D. linifolium are usually found in sandy soil or limestone area. Plants of Diarthron are known to be very toxic (Tan, 1982). It has been shown that species of Diarthron contain sesquiterpenoids in the roots with antineoplastic activities, suggesting that they may be useful for anticancer treatment (Sun et al., 2018). The Korean plants of D. linifolium are the southernmost population of the species and isolated from its main distribution range. It is recently designated as a vulnerable species listed by the Korean government (https://species.nibr.go.kr) because it is hardly found, and its distribution is confined in limestone area. Thus, detailed understanding of genetic diversity and biogeographic origin is needed for the conservation of the species.

Tan (1982) summarized the taxonomic status of the genus Diarthron by suggesting new combinations of taxonomic ranks and subgroups representing three subgenera, Diarthron (two species in two sections, Diarthron and Arthrochlamys), Dendrostellera (eight species), and Stelleropsis (nine species in two sections, Stelleropsis and Turcomanica), which are mainly distinguished from habit (herbaceous vs. suffrutescent or shrubby) or floral morphology (sericeous-villous vs. glabrous or sparsely pubescent of perianth) (Tan, 1982). Diarthron linifolium is similar to Diarthron vesiculosum (Fisch. & C. A. Mey. ex Kar. & Kir.) C. A. Mey. in gross morphology and habit in that both species are annual herb with slender stems and racemes but distinctly recognized by the number of stamens, 4 or 5 in D. linifolium but 8 in D. vesiculosum. Interestingly, the distribution of D. linifolium is not overlapped that of D. vesiculosum that is more concentrated in northwestern Central Asia.

The phylogenetic position of Diarthron including D. linifolium is not fully assessed so far. The close relationship of Diarthron to Stellera was proposed based on the gross morphology (Tan, 1982) and molecular phylogeny (Galicia-Herbada, 2006) but other molecular phylogenetic studies with species in the family, Thymelaeaceae, suggested that Diarthron is more closely related to Daphne and Thymelaea with its sister relationship to Stellera and Wikstroemia (Van der Bank et al., 2002; Beaumont et al., 2009). In contrast, Qian et al. (2021) suggested closer relationship of Diarthron to Stellera and Wikstroemia with moderate bootstrap support and posterior probability from nuclear rDNA internal transcribed spacer (ITS) sequence analysis (Qian et al., 2021).

Recent rapid development of next-generation and third-generation sequencing technologies (Roberts et al., 2013; Deamer et al., 2016; Goodwin et al., 2016) have facilitated organelle genome research, resulting in more than 10,000 chloroplast genomes in National Center for Biotechnology Information (NCBI). It indicates that chloroplast genomes of closely related species within the family will be enough to conduct comparative analysis of Diathron. At least 29 chloroplast genomes of Thymelaeaceae are available (Supplementary Table 1), which is favorable to conduct comparative analysis using chloroplast genome of D. linifolium. As part of genetic and biogeographic study of D. linifolium, we first determined and presented the complete chloroplast genome of D. linifolium in this study and conducted comparative analyses.

Materials and Methods

De novo assembly and annotation of Diarthron linifolium chloroplast genome

The sample of D. linifolium for DNA isolation was collected at the limestone beds in the lakeshore of the Lake Cheongpungho, Jecheon-si, Chungcheongbuk-do, Korea (37º1′44″N, 128º11′15″E). Voucher specimen was deposited in the herbarium of Daejeon University (TUT) under the number Oh 8252. Total DNA was extracted from fresh leaves using DNeasy Plant Mini Kit (QIAGEN, Hilden, Germany). The DNA library for the high-throughput short read sequencing was prepared with the protocol provided by the Illumina TruSeq DNA Library Prep Kit v2 (Illumina, San Diego, CA, USA) and the whole genome sequencing was conducted with Illumina NovaSeq6000 at Macrogen, Korea. 6.9-Gbp raw reads were used for chloroplast de novo genome assembly with Velvet v1.2.10 (Zerbino and Birney, 2008) after filtering raw reads using Trimmomatic v0.33 (Bolger et al., 2014). After getting first draft of chloroplast genome sequences, gap was filled with GapCloser v1.12 (Zhao et al., 2011) and all bases from the assembled sequences were confirmed using BWA v0.7.17 (Li, 2013) and SAMtools v1.9 (Li et al., 2009). All bioinformatic processes for de novo assembly was conducted by the pipeline, Genome Information System (GeIS; http://geis.infoboss.co.kr), which has been efficiently utilized and optimized in the previous studies (Min et al., 2019; Yun et al., 2019; Kim et al., 2021a, 2021b; Park et al., 2021b).

Geneious Prime v2020.2.4 software (Biomatters Ltd, Auckland, New Zealand) was used for chloroplast genome annotation for which we utilized information from Daphne genkwa chloroplast genome (MT754180) (Yoo et al., 2021) by transferring annotations with corrections of exceptional cases including missing start or stop codons. tRNA was predicted and confirmed using tRNAScan-SE v2.0.6 (Schattner et al., 2005).

Identification of simple sequence repeats on D. linifolium chloroplast genome

Simple sequence repeats (SSRs) were identified on the chloroplast genome sequence using the pipeline of the SSR database (SSRDB; http://ssrdb.infoboss.co.kr/) which has been utilized in various organelle genomic researches (Kim et al., 2019; Lee et al., 2020; Park et al., 2020a, 2021a; Choi et al., 2021). The SSR is conventionally recognized as the nucleotide array composed of repeats with one or up to six base pair units. For example, monoSSR refers an array of nucleotide repeats containing a particular base and hexaSSR an array of nucleotide repeats containing six base pair unit. The overall length of SSR is mostly over 10 base pairs. In this study, we tried to classify SSR with more criteria which has been applied in previous analyses (Gandhi et al., 2010; Chen et al., 2015; Cheng et al., 2016; Shukla et al., 2018; Jeon and Kim, 2019; Li et al., 2019). The criteria applied are (1) ‘normal SSR’ as a conventional definition from monoSSR to hexaSSR, (2) ‘extented SSR’ referring from heptaSSR (repeats of 7 bp unit) to decaSSR (repeats of 10 bp unit), and (3) ‘potential SSR’ referring specific cases with only 2 units in pentaSSR and hexaSSR. These criteria have been applied and provided better understanding of SSR patterns in previous analyses in chloroplast genomes of Dysphania species (Kim et al., 2019), Arabidopsis thaliana (L.) Heynh. (Park et al., 2020a), Chenopodium album L. (Park et al., 2021a), and mitochondrial genome of Rosa rugosa Thunb. (Park et al., 2020b).

Nucleotide diversity analysis of chloroplast genomes

We calculated nucleotide diversity in the full chloroplast genomes from eight taxa of the genus Daphne and Diarthron linifolium using the method proposed by Nei and Li (Nei and Li, 1979) with the perl script. Nucleotide diversities were scanned along the genome with 500-bp window size and 200-bp step size for overlapped sliding windows. Our script for nucleotide diversity calculation has been utilized in previous studies (Kim et al., 2019; Lee et al., 2020; Park et al., 2020a, 2020c, 2021a; Lee and Park, 2021; Park and Xi, 2021).

Construction of phylogenetic trees

Whole chloroplast genomes of 31 Thymelaeaceae including one outgroup species, Aquilaria rostrata (Hishamuddin et al., 2020), were aligned using MAFFT v7.450 software (Katoh and Standley, 2013). The maximum-likelihood (ML) and Neighbor-joining (NJ) trees were inferred in MEGA X (Kumar et al., 2018) using heuristic search with Nearest Neighbor Interchange (NNI) branch swapping, Tamura-Nei model, and uniform rates among sites. All other options were set as default. We performed bootstrap analyses with 1,000 and 10,000 pseudoreplicates for ML and NJ methods, respectively, with the same option. Posterior probability of each node was estimated by Bayesian inference using MrBayes v3.2.6 (Huelsenbeck and Ronquist, 2001) implemented in the software package, Geneious Prime v2020.2.4. The HKY85 model with gamma rates was used as a molecular model. A Markov chain Monte Carlo algorithm was employed for 1,100,000 generations with four chains running simultaneously. To build a consensus tree, we sampled trees every 200 generations after removing trees from the first 100,000 generations as burn-in.

Results and Discussion

Complete chloroplast genome of Diarthron linifolium

Chloroplast genome of D. linifolium is 172,644 bp and has four subregions: 86,158 bp of large single copy (LSC), 2,858 bp of small sing copy (SSC) regions, and two inverted repeat regions (IR; 41,814 bp) separating LSC and SSC in circular genome structure (Fig. 1). In the LSC and SSC regions, we identified 138 genes (92 protein-coding genes, eight rRNAs, and 38 tRNAs) while 28 genes (16 protein-coding genes, eight tRNAs, and four rRNAs) are duplicated in two IR regions (Fig. 1). The SSC region of Diarthron linifolium chloroplast genome is extremely short, which is also found in the chloroplast genomes of neighbor species, including Daphne species (Cho et al., 2018; Yan et al., 2019a, 2019b, 2021) except Daphne genkwa Siebold & Zucc. (Yoo et al., 2021). The overall GC content is 36.3% and those in the LSC, SSC, and IR regions are 35.0%, 29.3%, and 38.9%, respectively. Chloroplast genome sequence can be accessed via accession number MW566785 in GenBank of NCBI at https://www.ncbi.nlm.nih.gov. The associated BioProject, SRA, and Bio-Sample numbers are PRJNA692680, SRR13485453, and SAMN17360683, respectively.

Fig. 1.

Circular map, simple sequence repeat analysis, and nucleotide diversity analysis of Diarthron linifolium complete chloroplast genome. A. The genes located outside of the circle are transcribed clockwise, while those located inside are transcribed counterclockwise. The dark gray plot in the inner circle corresponds to GC content. Large single copy (LSC), small single copy (SSC), and inverted repeat (IR) are indicated with LSC, SSC, and IR (IRA and IRB), respectively. B. X-axis presents simple sequence repeat (SSR) types based on motif length and Y-axis shows frequency of SSRs. Based on SSR types, different colors were used along with normal SSRs, potential SSRs, and extended SSRs. Each bar contains two parts: genic SSRs displayed in the bottom of bars and intergenic SSRs occupied in the upper part of each bar. C. X-axis shows chloroplast coordination and Y-axis presented nucleotide diversity calculated based on the sliding window method (See “Materials and Methods”). Blue graph indicates nucleotide diversity along with chloroplast genomic coordination and black labels on the peak indicate position of peaks based on gene names around them. Blue-colored arrows indicate LSC and SSC regions and orange-colored arrows mean IR regions.

Identification of SSRs on Diarthron linifolium chloroplast genomes

SSRs have been used for developing molecular markers (Huang et al., 2015; Li et al., 2020a, 2020b). We identified 90 SSRs as normal SSRs, 542 potential SSRs, and 63 extended SSRs from the analysis using SSRDB pipeline (See “Materials and Methods”) (Fig. 1B, Supplementary Table 2). Our results indicated that the number of identified SSRs from D. linifolium is higher than those identified in Chenopodium album (Park et al., 2021a), Arabidopsis thaliana (Park et al., 2020a) and four Dysphania species (Kim et al., 2019). This result can be explained that the chloroplast genome size of D. linifolium is larger than those in aforementioned species. In the case of the Plantago depressa Willd. (Kwon et al., 2019), and Plantago fengdouensis (Z. E. Chao & Yong Wang) Yong Wang & Z. Yu Li (Wang et al., 2020) chloroplast genomes, of which length is similar to D. linifolium, numbers of normal and potential SSRs are still smaller (Park et al., unpubl. data), suggesting that D. linifolium chloroplast genome contains relatively large number of SSRs.

In detail, monoSSR displayed the high proportion (71.11%; 64 monoSSRs) among normal SSRs and HeptaSSR exhibited the highest among extended SSRs (66.67%; 42 HeptaSSRs) (Fig. 1B). This trend of numbers of normal and extended SSRs was also found in all four species. Seventeen genic normal (18.88%), 171 genic potential (31.55%), and 13 genic extended SSRs (20.63%) were identified (Fig. 1B), displaying the different proportion of genic SSRs along with three types of SSRs. Among them, ycf1 and ycf2 contained the largest number of SSRs (46 and 36, respectively) (Supplementary Fig. S1). These SSRs may have more intraspecific or interspecific variations because these genes displayed high nucleotide diversities in various plant species (Hong et al., 2017; Jiang et al., 2017; Liu et al., 2018; de Souza et al., 2019; Kim et al., 2019; Li et al., 2019; Park et al., 2020c, 2021a; Loeuille et al., 2021), which can be considered as molecular markers to distinguish species or intraspecific taxa (Neubig et al., 2009; Dong et al., 2015). The remaining genic SSRs will not have many variations in comparison to those in ycf1 and ycf2, which will be utilized as molecular markers to recognize relatively high level of taxa.

Nucleotide diversity analysis of Diarthron linifolium and Daphne chloroplast genomes

Using eight Daphne and Diarthron linifolium chloroplast genomes, the nucleotide diversity was investigated to understand variable regions on chloroplast genomes. We excluded two Daphne genkwa chloroplast genomes because it was clustered into the group consisting of Wikstroemia species (Yoo et al., 2021). Overall nucleotide diversity was 0.00244, which is lower than those calculated in four Dysphaina species (0.0068) (Kim et al., 2019) and eight Plantago species (0.01751) (Park et al., unpubl. data) but higher than those from Agrimonia species (0.000727) (Park et al., unpubl. data) and four Viburnum species (0.00176) (Park et al., 2020c).

Eight peaks over 0.01 in nucleotide diversity were identified (Fig. 1C). Seven peaks were found in intergenic regions but one peak in the genic region harboring ndhF (Fig. 1C). The peak in the gene, ndhF was located in the junction between IR and SSC and only two out of nine chloroplast genomes were annotated as ndhF genic region (Supplementary Fig. S2). These regions with high nucleotide diversity can be considered as candidates of molecular markers to distinguish species of Daphne and Diarthron.

Phylogenetic analysis of Thymelaeaceae chloroplast genomes

Phylogenetic tree exhibited that Diarthron linifolium was placed as a sister of the group composed of species in the genus Daphne (Fig. 2). Our results of genome data are consistent with previous analyses of a few chloroplast regions such as rbcL (Van der Bank et al., 2002; Beaumont et al., 2009) and whole plastid genomes (Yoo et al., 2003; He et al., 2021). Although more species of Diarthron should be included in further phylogenetic analysis, the current chloroplast genome data suggest that Diarthron is distantly related with Stellera. Similarity between Diarthron and Stellera in gross morphology, both are adapted to dry environment, may have derived independently. Diarthron is easily distinguished from Stellera by having 4-merous flowers, while Stellera has 5- or 6-merous flowers.

Fig. 2.

Phylogenetic tree inferred from 31 chloroplast genomes in Thymelaeaceae. Maximum-Likelihood tree is presented. The numbers above branches indicate bootstrap support values of maximum-likelihood (1,000 bootstrap repeats) and Neighbor-joining methods (10,000 bootstrap repeats), and posterior probabilities from Bayesian inference.

Supplementary Materials

Online supplemental data can be found at https://www.e-kjpt.org.

Supplementary Table 1.

List of the 31 chloroplast genomes used in this study.

Supplementary Table 2.

List of the SSRs identified on the D. linifolium chloroplast genome

Supplementary Fig. S1.

Number of simple sequence repeats (SSRs) located in genic regions along with protein-coding genes (PCGs)

Supplementary Fig. S2.

Nucleotide alignment of the nine Daphne and Diarthron chloroplast genomes around the small single copy (SSC) region.

Acknowledgements

We are grateful to Hye Ryun Na for the help of fieldwork and Yun-Gyeong Choi for the laboratory assistance. This work was supported by The Catholic University of Korea under the new faculty support grant (No. M-2020-B0014-00004) to S-TK.

Notes

Conflicts of Interest

Sang-Hun OH, a contributing editor of the Korean Journal of Plant Taxonomy, was not involved in the editorial evaluation or decision to publish this article. All remaining authors have declared no conflicts of interest.

References

Beaumont AJ, Edwards T. J., Manning J, Maurin O, Rautenbach M, Motsi MC, Fay MF, Chase MW, Van der Bank M. 2009; Gnidia (Thymelaeaceae) is not monophyletic: Taxonomic implications for Thymelaeoideae and a partial new generic taxonomy for. Gnidia Botanical Journal of the Linnean Society 160:402–417.
Bolger A.M, Lohse M., Usadel B. 2014;Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30:2114–2120.
Chen J, Hao Z, Xu H, Yang L, Liu G, Sheng Y, Zheng C, Cheng W, Cheng T, Shi J. 2015;The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng. Frontiers in Plant Science 6:447.
Cheng J, Zhao Z, Li B, Qin C, Wu Z, Trejo-Saavedra D. L., Luo X, Cui J, Rivera-Bustamante RF, Li S, Hu K. 2016;A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum . Scientific Reports 6:18919.
Cho W.-B., Han E.-K, Choi G, Lee J.-H. 2018;The complete chloroplast genome of Daphne kiusiana, an evergreen broad-leaved shrub on Jeju Island. Conservation Genetics Resources 10:103–106.
Choi N.J, Xi H, Park J. 2021;A comparative analyses of the complete mitochondrial genomes of fungal endosymbionts in Sogatella furcifera, white-backed planthoppers. International Journal of Genomics 2021:6652508.
de Souza UJB, Nunes R., Targueta CP, Diniz-Filho JAF, de Campos Telles M. P.. 2019;The complete chloroplast genome of Stryphnodendron adstringens (Leguminosae-Caesalpinioideae): Comparative analysis with related Mimosoid species. Scientific Reports 9:14206.
Deamer D, Akeson M, Branton D. 2016;Three decades of nanopore sequencing. Nature Biotechnology 34:518–524.
Dong W, Xu C, Li C, Sun J, Zuo Y, Shi S, Cheng T, Guo J, Zhou S. 2015; ycf1, the most promising plastid DNA barcode of land plants. Scientific Reports 5:8348.
Galicia-Herbada D. 2006;Origin and diversification of Thymelaea (Thymelaeaceae): Inferences from a phylogenetic study based on ITS (rDNA) sequences. Plant Systematics and Evolution 257:159–187.
Gandhi SG, Awasthi P, Bedi Y. S. 2010;Analysis of SSR dynamics in chloroplast genomes of Brassicaceae family. Bio-information 5:16–20.
Goodwin S, McPherson JD, McCombie WR. 2016;Coming of age: Ten years of next-generation sequencing technologies. Nature Reviews Genetics 17:333–351.
He L, Zhang Y, Lee S. Y. 2021;Complete plastomes of six species of Wikstroemia (Thymelaeaceae) reveal paraphyly with the monotypic genus Stellera . Scientific Reports 11:13608.
Hishamuddin MS, Lee S. Y., Ng W. L., Ramlee S. I., Lamasudin DU, Mohamed R. 2020;Comparison of eight complete chloroplast genomes of the endangered Aquilaria tree species (Thymelaeaceae) and their phylogenetic relationships. Scientific Reports 10:13034.
Hong S.-Y, Cheon K.-S, Yoo K.-O, Lee H.-O, Cho K.-S, Suh J.-T, Kim S.-J, Nam J.-H, Sohn H.-B, Kim Y.-H. 2017;Complete chloroplast genome sequences and comparative analysis of Chenopodium quinoa and C. album . Frontiers in Plant Science 8:1696.
Huang J, Yang X, Zhang C, Yin X, Liu S, Li X. 2015;Development of chloroplast microsatellite markers and analysis of chloroplast diversity in Chinese jujube (Ziziphus jujuba Mill.) and wild jujube (Ziziphus acidojujuba Mill.). PLoS ONE 10:e0134519.
Huelsenbeck J.P, Ronquist F. 2001;MRBAYES: Bayesian inference of phylogenetic trees. Bioinformatics 17:754–755.
Jeon J.-H, Kim S.-C. 2019;Comparative analysis of the complete chloroplast genome sequences of three closely related East-Asian wild roses (Rosa sect. Synstylae; Rosaceae). Genes 10:23.
Jiang D, Zhao Z, Zhang T, Zhong W, Liu C, Yuan Q, Huang L. 2017;The chloroplast genome sequence of Scutellaria baicalensis provides insight into intraspecific and interspecific chloroplast genome diversity in Scutellaria . Genes 8:227.
Katoh K, Standley D. M. 2013;MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Molecular Biology and Evolution 30:772–780.
Kim M, Xi H, Park J. 2021a;Genome-wide comparative analyses of GATA transcription factors among 19 Arabidopsis ecotype genomes: Intraspecific characteristics of GATA transcription factors. PLoS ONE 16:e0252181.
Kim M, Xi H, Park S, Yun Y, Park J. 2021b;Genome-wide comparative analyses of GATA transcription factors among seven Populus genomes. Scientific Reports 11:16578.
Kim Y, Park J, Chung Y. 2019;Comparative analysis of chloroplast genome of Dysphania ambrosioides (L.) Mosyakin & Clemants understanding phylogenetic relationship in genus Dysphania R. Br. Korean Journal of Plant Resources 32:644–668.
Kumar S., Stecher G, Li M, Knyaz C, Tamura K. 2018;MEGA X: Molecular evolutionary genetics analysis across computing platforms. Molecular Biology and Evolution 35:1547–1549.
Kwon W, Kim Y, Park C.-H, Park J. 2019;The complete chloroplast genome sequence of traditional medical herb, Plantago depressa Willd. (Plantaginaceae). Mitochondrial DNA Part B Resources 4:437–438.
Lee B, Park J. 2021;The complete chloroplast genome of Zoysia japonica Steud. isolated in Korea (Poaceae): Investigation of potential molecular markers on Z. japonica chloroplast genomes. Plant Biotechnology Reports 15:707–715.
Lee J, Park J, Xi H, Park J. 2020;Comprehensive analyses of the complete mitochondrial genome of Figulus binodulus (Coleoptera: Lucanidae). Journal of Insect Science 20:10.
Li B, Lin F, Huang P, Guo W, Zheng Y. 2020a;Development of nuclear SSR and chloroplast genome markers in diverse Liriodendron chinense germplasm based on low-coverage whole genome sequencing. Biological Research 53:21.
Li C, Zheng Y, Huang P. 2020b;Molecular markers from the chloroplast genome of rose provide a complementary tool for variety discrimination and profiling. Scientific Reports 10:12188.
Li H. 2013. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. arXiv preprint at https://arxiv.org/abs/1303.3997 .
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. 1000;Genome Project Data Processing Subgroup. 2009. The sequence alignment/map format and SAMtools. Bioinformatics 25:2078–2079.
Li W, Zhang C, Guo X, Liu Q, Wang K. 2019;Complete chloroplast genome of Camellia japonica genome structures, comparative and phylogenetic analysis. PLoS ONE 14:e0216645.
Liu W, Kong H, Zhou J, Fritsch PW, Hao G, Gong W. 2018;Complete chloroplast genome of Cercis chuniana (Fabaceae) with structural and genetic comparison to six species in Caesalpinioideae. International Journal of Molecular Sciences 19:1286.
Loeuille B, Thode V, Siniscalchi C, Andrade S, Rossi M, Pirani J. R. 2021;Extremely low nucleotide diversity among thirty-six new chloroplast genome sequences from Aldama (Heliantheae, Asteraceae) and comparative chloroplast genomics analyses with closely related genera. PeerJ 9:e10886.
Min J, Kwon W, Xi H, Park J. 2019;The complete chloroplast genome of Leucobryum juniperoideum (brid.) C. Müll. (Leucobryaceae, Bryophyta). Mitochondrial DNA Part B Resources 4:2962–2963.
Nei M, Li WH. 1979;Mathematical model for studying genetic variation in terms of restriction endonucleases. Proceedings of the National Academy of Sciences of the United States of America 76:5269–5273.
Neubig KM, Whitten WM, Carlsward BS, Blanco MA, Endara L, Williams NH, Moore M. 2009;Phylogenetic utility of ycf 1 in orchids: A plastid gene more variable than matK . Plant Systematics and Evolution 277:75–84.
Park J, Min J, Kim Y, Chung Y. 2021a;The comparative analyses of six complete chloroplast genomes of morphologically diverse Chenopodium album L. (Amaranthaceae) collected in Korea. International Journal of Genomics 2021:6643444.
Park J, Xi H. 2021;Investigation of nucleotide diversity based on 17 sea cucumber mitochondrial genomes and assessment of sea cucumber mitochondrial gene markers. Advances in Oceanography and Marine Biology 2:2021.
Park J, Xi H, Kim Y. 2020a;The complete chloroplast genome of Arabidopsis thaliana isolated in Korea (Brassicaceae): An investigation of intraspecific variations of the chloroplast genome of Korean A. thaliana . International Journal of Genomics 2020:3236461.
Park J, Xi H, Kim Y. 2021b;The complete mitochondrial genome of Arabidopsis thaliana (Brassicaceae) isolated in Korea. Korean Journal of Plant Taxonomy 51:176–180.
Park J, Xi H, Kim Y, Nam S, Heo K.-I. 2020b;The complete mitochondrial genome of new species candidate of Rosa rugosa (Rosaceae). Mitochondrial DNA Part B Resources 5:3435–3437.
Park J, Xi H, Oh S.-H. 2020c;Comparative chloroplast genomics and phylogenetic analysis of the Viburnum dilatatum complex (Adoxaceae) in Korea. Korean Journal of Plant Taxonomy 50:8–16.
Qian S, Zhang Y, Lee S. Y. 2021;Comparative analysis of complete chloroplast genome sequences in Edgeworthia (Thymelaeaceae) and new insights into phylogenetic relationships. Frontiers in Genetics 12:643552.
Roberts RJ, Carneiro MO, Schatz MC. 2013;The advantages of SMRT sequencing. Genome Biology 14:405.
Schattner P, Brooks AN, Lowe TM. 2005;The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Research 33(Suppl):W686–W689.
Shukla N, Kuntal H, Shanker A, Sharma SN. 2018;Mining and analysis of simple sequence repeats in the chloroplast genomes of genus Vigna . Biotechnology Research and Innovation 2:9–18.
Sun D.-X, Zhao D, Wei H.-Y, Ma X.-L, Shi L.-L, Zhang J. 2018;Four new sesquiterpenoids from the roots of Diarthron tianschanica with their antineoplastic activity. Molecules 23:1383.
Tan K. 1982;Studies in the Thymelaeaceae. III. The status of Diarthon, Dendrostellera, Stelleropsis and Stellera . Notes from the Royal Botanic Garden Edinburgh 40:213–221.
Van der Bank M, Fay M. F, Chase M. W. 2002;Molecular phylogenetics of Thymelaeaceae with particular reference to African and Australian genera. Taxon 51:329–339.
Wang Q, Mao L.-H, Ding B, Li M.-T, Fu Z.-X, Deng H.-P. 2020;The complete chloroplast genome of Plantago fengdouensis (Plantaginaceae): An endemic and endangered species from China. Mitochondrial DNA Part B Resources 5:51–52.
Yan F, Tao X, Wang Q.-L, Juan ZY, Zhang C.-M, Yu H. L. 2019a;The complete chloroplast genome sequence of the medicinal shrub Daphne giraldii Nitsche. (Thymelaeaceae). Mitochondrial DNA Part B Resources 4:2685–2686.
Yan F, Wang Q.-L, Zhang Y.-J, Zhang C.-M, Chen Y. 2019b;The complete chloroplast genome sequence of medicinal plant, Daphne tangutica Maxim. (Thymelaeaceae). Mitochondrial DNA Part B Resources 4:1776–1777.
Yan F, Zhang C.-Y, Wang Q.-L, Wang J.-D, Wang H.-P, Xu T, Wang E.-J, Hou L.-Y. 2021;Characterization of the complete chloroplast genome sequence of Daphne retusa Hemsl. (Thymelaeaceae), a rare alpine plant species in northwestern China. Mitochondrial DNA Part B Resources 6:2139–2141.
Yoo S.-C, Oh S.-H, Park J. 2021;Phylogenetic position of Daphne genkwa (Thymelaeaceae) inferred from complete chloroplast data. Korean Journal of Plant Taxonomy 51:171–175.
Yun N, Park J, Oh S.-H. 2019;The complete chloroplast genome of the traditional medicinal plant Stellera chamaejasme L. (Thymelaeaceae). Mitochondrial DNA Part B Resources 4:1796–1797.
Zerbino DR, Birney E. 2008;Velvet: algorithms for de novo short read assembly using de Bruijn graphs. Genome Research 18:821–829.
Zhao Q.-Y, Wang Y, Kong Y.-M, Luo D, Li X, Hao P. 2011;Optimizing de novo transcriptome assembly from short-read RNA-Seq data: A comparative study. BMC Bioinformatics 12(Suppl 14):S2.

Article information Continued

Fig. 1.

Circular map, simple sequence repeat analysis, and nucleotide diversity analysis of Diarthron linifolium complete chloroplast genome. A. The genes located outside of the circle are transcribed clockwise, while those located inside are transcribed counterclockwise. The dark gray plot in the inner circle corresponds to GC content. Large single copy (LSC), small single copy (SSC), and inverted repeat (IR) are indicated with LSC, SSC, and IR (IRA and IRB), respectively. B. X-axis presents simple sequence repeat (SSR) types based on motif length and Y-axis shows frequency of SSRs. Based on SSR types, different colors were used along with normal SSRs, potential SSRs, and extended SSRs. Each bar contains two parts: genic SSRs displayed in the bottom of bars and intergenic SSRs occupied in the upper part of each bar. C. X-axis shows chloroplast coordination and Y-axis presented nucleotide diversity calculated based on the sliding window method (See “Materials and Methods”). Blue graph indicates nucleotide diversity along with chloroplast genomic coordination and black labels on the peak indicate position of peaks based on gene names around them. Blue-colored arrows indicate LSC and SSC regions and orange-colored arrows mean IR regions.

Fig. 2.

Phylogenetic tree inferred from 31 chloroplast genomes in Thymelaeaceae. Maximum-Likelihood tree is presented. The numbers above branches indicate bootstrap support values of maximum-likelihood (1,000 bootstrap repeats) and Neighbor-joining methods (10,000 bootstrap repeats), and posterior probabilities from Bayesian inference.