Published in Genome Biology, this study marks a major milestone from the Dog10K international consortium, whose goal is to generate high-quality genomic data from thousands of dogs and wild canids. The dataset integrates 20× whole-genome coverage from 1611 dogs across 321 breeds, 309 village dogs, 63 wolves, and four coyotes, providing a dense map of genomic variation across domestic and wild lineages.
Researchers identified more than 48 million single-nucleotide, indel, and structural variants across the dog genome, spanning autosomes, the X chromosome, and mitochondrial DNA. More than 75% of variation present in 239 sampled breeds was newly discovered, filling critical gaps in breed-level genomic knowledge.
Allele-sharing analyses revealed that 94.9% of dog breeds form monophyletic clusters, organizing into 25 major clades. Notably, German Shepherd Dogs exhibited unusually high allele sharing with independent breeds from multiple clades, reflecting their global influence and genetic diversity.
Structural variation was also extensive: compared with the reference genome (UU_Cfam_GSD_1.0), each breed dog had on average 26,960 deletions and 14,034 insertions greater than 50 bp. Wolves carried approximately 14% more structural variants, highlighting their broader genomic diversity. The team additionally identified retrogene insertions from 926 parent genes, expanding understanding of genomic innovation in canids.
To support functional interpretation, variants were annotated with SnpEff and Zoonomia phyloP constraint scores. Highly constrained positions—those evolutionarily conserved across mammals—showed lower allele frequencies, aiding researchers in pinpointing mutations likely to affect gene function, behavior, and disease susceptibility.
Finally, the Dog10K dataset proved to be a powerful imputation reference panel, enabling accurate genotype prediction across multiple platforms, even for breeds not directly represented in the dataset. This opens the door to high-resolution genomic studies in working dogs, village populations, and heritage breeds alike.
Source: Meadows, J., Kidd, J., et al. (2023). Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. Genome Biology. Published August 15, 2023.







