Investigating the performance of different methods in the selection of causative SNP markers on the breed differentiation of horses
The present study was conducted in order to select effective markers in breed discrimination and compare the performance of SNP marker selection methods with the data of 304 animals from 14 different breeds that were genotyped using the Illumina SNP50K marker panel. Knowledge of genetic structure are very important for better understanding of genetic changes in genomic studies. The information content of each marker is used as an index for selecting markers in reducing the size of marker panels. To estimate the information content of each marker, the following selection methods were used: Fst (pairwise & global), Theta, Delta, D, Gst, G'st, G"st and Principal Component Analysis. In this study, the logarithm of the likelihood ratio was used to select markers. According to the results, all selection methods for identifying markers had similar performance. The number of common markers between the methods was at least 42 markers and at most 499 SNP markers. In general, the F_ST statistical method required a smaller number of markers to achieve a successful assignment. G'st and G"st statistics showed poor performance with more than 350 markers to achieve 95% correct assignment. It should be noted that with only the top 60 selected markers, it is possible to achieve a success rate of more than 70%. According to the results, Wright's paired Fst had better performance than other SNP selection methods. The obtained results lead to the creation of exclusive panels to identify various breeds, which have great economic importance.