Dog10K: A Genomic Breakthrough in Canid Diversity

Research Study Chiang Mai, Thailand, December 29, 2025Meadows et al. (2023) presented genome sequencing of 1,987 canids, offering a transformative resource for studying demography, genome function, architecture, and canine domestication.

Published in Genome Biology, this study marks a major milestone from the Dog10K international consortium, whose goal is to generate high-quality genomic data from thousands of dogs and wild canids. The dataset integrates 20× whole-genome coverage from 1611 dogs across 321 breeds, 309 village dogs, 63 wolves, and four coyotes, providing a dense map of genomic variation across domestic and wild lineages.

Researchers identified more than 48 million single-nucleotide, indel, and structural variants across the dog genome, spanning autosomes, the X chromosome, and mitochondrial DNA. More than 75% of variation present in 239 sampled breeds was newly discovered, filling critical gaps in breed-level genomic knowledge.

Allele-sharing analyses revealed that 94.9% of dog breeds form monophyletic clusters, organizing into 25 major clades. Notably, German Shepherd Dogs exhibited unusually high allele sharing with independent breeds from multiple clades, reflecting their global influence and genetic diversity.

Structural variation was also extensive: compared with the reference genome (UU_Cfam_GSD_1.0), each breed dog had on average 26,960 deletions and 14,034 insertions greater than 50 bp. Wolves carried approximately 14% more structural variants, highlighting their broader genomic diversity. The team additionally identified retrogene insertions from 926 parent genes, expanding understanding of genomic innovation in canids.

To support functional interpretation, variants were annotated with SnpEff and Zoonomia phyloP constraint scores. Highly constrained positions—those evolutionarily conserved across mammals—showed lower allele frequencies, aiding researchers in pinpointing mutations likely to affect gene function, behavior, and disease susceptibility.

Finally, the Dog10K dataset proved to be a powerful imputation reference panel, enabling accurate genotype prediction across multiple platforms, even for breeds not directly represented in the dataset. This opens the door to high-resolution genomic studies in working dogs, village populations, and heritage breeds alike.

Source: Meadows, J., Kidd, J., et al. (2023). Genome sequencing of 2000 canids by the Dog10K consortium advances the understanding of demography, genome function and architecture. Genome Biology. Published August 15, 2023.

zoeta-dogsoul-logo

Contact

50130 Chiang Mai
Thailand

Trainer Knowledge Base
Email-Contact

App Roadmap

Connect

Google-Reviews

📄 Published whitepaper: The Invisible Leash, Aggression in Multiple Dog Households, Instinct Interrupted & Boredom–Frustration–Aggression Pipeline, NeuroBond Method

DOI DOIDOI DOI DOI

Subscribe

Join our email list to receive the latest updates.

AI Knowledge Hub: Behavior Framework Source

Dogsoul AI Assistant
Chat
Ask Zoeta Dogsoul