Abstract:
Angelica dahurica is a common kind of medicine and food homology, which is not only a common clinical traditional Chinese medicine, but also a spice, with a wide range of uses. In order to obtain the whole genome sequence information of A. dahurica, this study used A. dahurica var. formosana leaf DNA as material, and the Nanopore sequencing technology was used to establish its nucleotide sequences database, then genome assembly, function annotation and evolution analysis were carried out by bioinformatic methods. The results were as follows: (1) 662 Gb the third-generation data were obtained after the original sequencing data, with the Read N50 about 32 932 bp. The assembled A. dahurica var. formosana genome size was 5.6 Gb, contig N50 being about 806 638 bp. (2) 66.47% of the genes from the assembled sequence got gene annotation after being compared with functional databases such as NR, KOG and KEGG. The result of KOG gene annotation was that the protein function of A. dahurica var. formosana concentrated in the general functional prediction only, posttranslational modification, protein turnover, chaperones signal transduction mechanisms. GO functional classification indicated that the genes of A. dahurica var. formosana concentrated on cell biological processes and components. KEGG analysis found that the A. dahurica var. formosana genes mostly involved in metabolic pathways. (3) 45 genes of BGLU family were identified in A. dahurica var. formosana. In this study, the whole genome of A. dahurica var. formosana is resolved by the third-generation sequencing technology for the first time, which lay a foundation for the systematic biological study of A. dahurica var. formosana , and is conducive to the further development and utilization. At the same time, the BGLU family genes were preliminarily analyzed, it also provides an important theoretical basis for the further study of the function of BGLU in the growth and development of A. dahurica var. formosana .