Whole-genome sequencing and SNP analysis of Thai Cannabis sativa cultivar ‘Hang Kra Rog Phu Phan’ (Cannabaceae)
Main Article Content
Abstract
Abstract. Kamoltham T, Luangpirom N, Kuamsab N, Kummalue T, Chaiphongpachara T. 2025. Whole-genome sequencing and SNP analysis of Thai Cannabis sativa cultivar ‘Hang Kra Rog Phu Phan’ (Cannabaceae). Biodiversitas 26: 4946-4953. Cannabis (Cannabis sativa) is a herbaceous plant valued for its medicinal and therapeutic uses. In Thailand, the indigenous cultivar ‘Hang Kra Rog Phu Phan’ has long been applied in traditional medicine and is recognized for its high Tetrahydrocannabinol (THC) content. This study provides the first genomic characterization of this traditional Thai cultivar using Whole-Genome Sequencing (WGS). Fresh leaf samples were collected from a licensed cultivation site, and paired-end libraries were prepared with the Illumina TruSeq DNA PCR-Free Kit. Sequencing on the NovaSeq 6000 platform produced 63.8 million raw reads (9.57 Gb), yielding 63.3 million high-quality reads (9.50 Gb) after trimming. Clean reads showed a 94.43% alignment rate to the ‘Pink Pepper’ reference genome, with an average depth of 11.39×. Variant calling identified 23.0 million genomic variants, including 18.5 million SNPs and 4.5 million Indels, with 6.04 million high-confidence SNPs retained after stringent filtering. Phylogenetic and principal component analyses revealed unexpected genomic proximity between ‘Hang Kra Rog Phu Phan’ and the CBD-dominant ‘CBD Shark’ cultivar, highlighting its distinct lineage among high-THC cultivars. These findings provide valuable genomic resources for precise cultivar authentication, marker-assisted breeding, conservation of native Thai germplasm, and functional genomics to advance cannabis-based therapeutics.
Article Details
Issue
Section

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
References
Adams TK, Masondo NA, Malatsi P, Makunga NP. 2021. Cannabis sativa: From therapeutic uses to micropropagation and beyond. Plants 10 (10): 2078. DOI: 10.3390/plants10102078.
Balant M, González RR, Garcia S, Garnatje T, Pellicer J, Vallès J, Vitales D, Hidalgo O. 2022. Novel insights into the nature of intraspecific genome size diversity in Cannabis sativa L. Plants 11 (20): 2736. DOI: 10.3390/plants11202736.
Bolger AM, Lohse M, Usadel B. 2014. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30 (15): 2114-2120. DOI: 10.1093/bioinformatics/btu170.
Braich S, Baillie RC, Spangenberg GC, Cogan NOI. 2020. A new and improved genome sequence of Cannabis sativa. GigaByte 2020: gigabyte10. DOI: 10.46471/gigabyte.10.
Busta L, Dweikat I, Sato SJ, Qu H, Xue Y, Zhou B, Gan L, Yu B, Clemente TE, Cahoon EB, Zhang C. 2022. Chemical and genetic variation in feral Cannabis sativa populations across the Nebraska climate gradient. Phytochemistry 200: 113206. DOI: 10.1016/j.phytochem.2022.113206.
Chen L, Li H-L, Zhou H-J, Zhang G-Z, Zhang Y, Wang Y-M, Wang M-Y, Yang H, Gao W. 2024. Feature-based molecular network-assisted cannabinoid and flavonoid profiling of Cannabis sativa leaves and their antioxidant properties. Antioxidants 13 (6): 749. DOI: 10.3390/antiox13060749.
Chen X, Guo H-Y, Zhang Q-Y, Wang L, Guo R, Zhan Y-X, Lv P, Xu Y-P, Guo M-B, Zhang Y, Zhang K, Liu Y-H, Yang M. 2022. Whole-genome resequencing of wild and cultivated cannabis reveals the genetic structure and adaptive selection of important traits. BMC Plant Biol 22 (1): 371. DOI: 10.1186/s12870-022-03744-0.
Dal Martello R, Min R, Stevens CJ, Qin L, Fuller DQ. 2024. Morphometric approaches to Cannabis evolution and differentiation from archaeological sites: Interpreting the archaeobotanical evidence from bronze age Haimenkou, Yunnan. Veg Hist Archaeobot 33 (4): 503-518. DOI: 10.1007/s00334-023-00966-6.
Danecek P, Bonfield JK, Liddle J, Marshall J, Ohan V, Pollard MO, Whitwham A, Keane T, McCarthy SA, Davies RM, Li H. 2021. Twelve years of SAMtools and BCFtools. Gigascience 10: giab008. DOI: 10.1093/gigascience/giab008.
Doyle JJ, Doyle JL. 1987. A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochem Bull 19: 11-15.
El Oihabi M, Soultana M, Ammari M, Ben Allal L, Fakih Lanjri A. 2024. Diversity and variability of bioactive compounds in Cannabis sativa: Effects on therapeutic and environmental uses and orientations for future research. Case Stud Chem Environ Eng 9: 100732. DOI: 10.1016/j.cscee.2024.100732.
Fordjour E, Manful CF, Sey AA, Javed R, Pham TH, Thomas R, Cheema M. 2023. Cannabis: A multifaceted plant with endless potentials. Front Pharmacol 14: 1200269. DOI: 10.3389/fphar.2023.1200269.
Gao S, Wang B, Xie S, Xu X, Zhang J, Pei L, Yu Y, Yang W, Zhang Y. 2020. A high-quality reference genome of wild Cannabis sativa. Hortic Res 7 (1): 73. DOI: 10.1038/s41438-020-0295-3.
Grassa CJ, Weiblen GD, Wenger JP, Dabney C, Poplawski SG, Motley ST, Michael TP, Schwartz CJ. 2021. A new Cannabis genome assembly associates elevated Cannabidiol (CBD) with hemp introgressed into marijuana. New Phytol 230 (4): 1665-1679. DOI: 10.1111/nph.17243.
Hu J, Wang Z, Xu H, Wang Z, Li N, Feng R, Yin J, Liu F, Wang B. 2025. Transcriptomic analysis of wild Cannabis sativa: Insights into tissue- and stage-specific expression and secondary metabolic regulation. BMC Genomics 26 (1): 528. DOI: 10.1186/s12864-025-11697-5.
Hurgobin B, Tamiru-Oli M, Welling MT, Doblin MS, Bacic A, Whelan J, Lewsey MG. 2021. Recent advances in Cannabis sativa genomics research. New Phytol 230 (1): 73-89. DOI: 10.1111/nph.17140.
Hussain T, Jeena G, Pitakbut T, Vasilev N, Kayser O. 2021. Cannabis sativa research trends, challenges, and new-age perspectives. iScience 24 (12): 103391. DOI: 10.1016/j.isci.2021.103391.
Ingvardsen CR, Brinch-Pedersen H. 2023. Challenges and potentials of new breeding techniques in Cannabis sativa. Front Plant Sci 14: 1154332. DOI: 10.3389/fpls.2023.1154332.
Kumar S, Stecher G, Li M, Knyaz C, Tamura K. 2018. MEGA X: Molecular Evolutionary Genetics Analysis across computing platforms. Mol Biol Evol 35: 1547-1549. DOI: 10.1093/molbev/msy096.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, DePristo MA. 2010. The genome analysis toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res 20 (9): 1297-1303. DOI: 10.1101/gr.107524.110.
Ren G, Zhang X, Li Y, Ridout K, Serrano-Serrano ML, Yang Y, Liu A, Ravikanth G, Nawaz MA, Mumtaz AS, Salamin N, Fumagalli L. 2021. Large-scale whole-genome resequencing unravels the domestication history of Cannabis sativa. Sci Adv 7 (29): eabg2286. DOI: 10.1126/sciadv.abg2286.
Sawler J, Stout JM, Gardner KM, Hudson D, Vidmar J, Butler L, Page JE, Myles S. 2015. The genetic structure of Marijuana and Hemp. PLoS One 10 (8): e0133292. DOI: 10.1371/journal.pone.0133292.
Shujat S, Robinson GI, Norouzkhani F, Kovalchuk I. 2024. Using advanced biotechnological techniques to improve Cannabis cultivars. Biocatal Agric Biotechnol 60: 103250. DOI: 10.1016/j.bcab.2024.103250.
Sunanta P, Sombat T, Moaphadungkul J, Chaemthet S, Nagle M, Bakshani C, Willats W, Sangta J, Sommano S. 2024. Evaluation of value adding components from postharvest biomass of Thai medicinal Cannabis var. Hang Kra Rog Phu Phan. J Appl Res Med Aromat Plants 43: 100593. DOI: 10.1016/j.jarmap.2024.100593.
van Bakel H, Stout JM, Cote AG, Tallon CM, Sharpe AG, Hughes TR, Page JE. 2011. The draft genome and transcriptome of Cannabis sativa. Genome Biol 12 (10): R102. DOI: 10.1186/gb-2011-12-10-r102.
Vergara D, White KH, Keepers KG, Kane NC. 2016. The complete chloroplast genomes of Cannabis sativa and Humulus lupulus. Mitochondrial DNA A DNA Mapp Seq Anal 27 (5): 3793-3794. DOI: 10.3109/19401736.2015.1079905.
Weiblen GD, Wenger JP, Craft KJ, ElSohly MA, Mehmedic Z, Treiber EL, Marks MD. 2015. Gene duplication and divergence affecting drug content in Cannabis sativa. New Phytol 208 (4): 1241-1250. DOI: 10.1111/nph.13562.
Woods P, Price N, Matthews P, McKay JK. 2022. Genome-wide polymorphism and genic selection in feral and domesticated lineages of Cannabis sativa. Genes Genom Genet 13 (2): jkac209. DOI: 10.1093/g3journal/jkac209.
Younas M, Qureshi R, van Velzen R, Mashwani Z-U-R, Saqib Z, Ali A, Rehman S, Farah MA, Al-Anazi KM. 2024. Geo-climatic factors co-drive the phenotypic diversity of wild hemp (Cannabis sativa L.) in the Potohar Plateau and Lesser Himalayas. BMC Plant Biol 24 (1): 1031. DOI: 10.1186/s12870-024-05730-0.