Genotype imputation in the domestic dog

S. G. Friedenberg, K. M. Meurs

Research output: Contribution to journalArticlepeer-review

38 Scopus citations

Abstract

Application of imputation methods to accurately predict a dense array of SNP genotypes in the dog could provide an important supplement to current analyses of array-based genotyping data. Here, we developed a reference panel of 4,885,283 SNPs in 83 dogs across 15 breeds using whole genome sequencing. We used this panel to predict the genotypes of 268 dogs across three breeds with 84,193 SNP array-derived genotypes as inputs. We then (1) performed breed clustering of the actual and imputed data; (2) evaluated several reference panel breed combinations to determine an optimal reference panel composition; and (3) compared the accuracy of two commonly used software algorithms (Beagle and IMPUTE2). Breed clustering was well preserved in the imputation process across eigenvalues representing 75 % of the variation in the imputed data. Using Beagle with a target panel from a single breed, genotype concordance was highest using a multi-breed reference panel (92.4 %) compared to a breed-specific reference panel (87.0 %) or a reference panel containing no breeds overlapping with the target panel (74.9 %). This finding was confirmed using target panels derived from two other breeds. Additionally, using the multi-breed reference panel, genotype concordance was slightly higher with IMPUTE2 (94.1 %) compared to Beagle; Pearson correlation coefficients were slightly higher for both software packages (0.946 for Beagle, 0.961 for IMPUTE2). Our findings demonstrate that genotype imputation from SNP array-derived data to whole genome-level genotypes is both feasible and accurate in the dog with appropriate breed overlap between the target and reference panels.

Original languageEnglish (US)
Pages (from-to)485-494
Number of pages10
JournalMammalian Genome
Volume27
Issue number9-10
DOIs
StatePublished - Oct 1 2016

Bibliographical note

Funding Information:
SGF is supported by a National Institutes of Health T32 training award (5T32OD011130-07). Funding for whole genome sequencing was provided in part by the Poodle Club of America Foundation and the American Kennel Club Canine Health Foundation. Some whole genome sequencing data were graciously contributed by Drs. Leigh Anne Clark (13 dogs), Natasha J. Olby and Theirry Olivry (11 dogs), and Joshua A. Stern (2 dogs).

Publisher Copyright:
© 2016, Springer Science+Business Media New York.

Fingerprint

Dive into the research topics of 'Genotype imputation in the domestic dog'. Together they form a unique fingerprint.

Cite this