Expanding the cattle reference graph genome
Abstract
Recent studies have highlighted several key advantages of graph genomes over standard linear reference genomes. These advantages include improvements in read mapping rates at divergent loci and the ability to more accurately call structural variants. However, the availability of graph genomes that represent the extensive diversity of livestock species remain limited. To address this limitation, we have incorporated 15 cattle genomes into an expanded cattle graph genome, including 8 completely novel high-quality cattle assemblies from 4 divergent breeds (Holstein-Friesian, N'Dama, Boran and Nelore), each with high contiguity (N50>10 Mb). This graph genome incorporates over 250 Mb (9.5%) of novel sequence across the primary chromosomes, providing a better reference representation of the bovine pangenome and a key resource for the livestock community.
Citation
Talenti, A., Powell, J., Wragg, D., Paxton, E., Chepkwony, M., Miyunga, A., Njeru, R., Hemmink, J.D., Fisch, A., Ferreira, B.R., Hammond, J.A., Archibald, A.L., Toye, P., Connelley, T., Morrison, L. and Prendergast, J. 2023. Expanding the cattle reference graph genome. IN: Veerkamp, R.F. and Haas, Y. de. (eds), Proceedings of the 12th World Congress on Genetics Applied to Livestock Production (WCGALP): Technical and species orientated innovations in animal breeding, and contribution of genetics to solving societal challenges. Wageningen, the Netherlands: Wageningen Academic Publishers: 1737-1740.