TY - JOUR
T1 - Similarity-Based Analysis of Allele Frequency Distribution among Multiple Populations Identifies Adaptive Genomic Structural Variants
AU - Saitou, Marie
AU - Masuda, Naoki
AU - Gokcumen, Omer
N1 - Publisher Copyright:
© 2022 The Author(s). Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
PY - 2022/3/1
Y1 - 2022/3/1
N2 - Structural variants have a considerable impact on human genomic diversity. However, their evolutionary history remains mostly unexplored. Here, we developed a new method to identify potentially adaptive structural variants based on a similarity-based analysis that incorporates genotype frequency data from 26 populations simultaneously. Using this method, we analyzed 57,629 structural variants and identified 576 structural variants that show unusual population differentiation. Of these putatively adaptive structural variants, we further showed that 24 variants are multiallelic and overlap with coding sequences, and 20 variants are significantly associated with GWAS traits. Closer inspection of the haplotypic variation associated with these putatively adaptive and functional structural variants reveals deviations from neutral expectations due to: 1) population differentiation of rapidly evolving multiallelic variants, 2) incomplete sweeps, and 3) recent population-specific negative selection. Overall, our study provides new methodological insights, documents hundreds of putatively adaptive variants, and introduces evolutionary models that may better explain the complex evolution of structural variants.
AB - Structural variants have a considerable impact on human genomic diversity. However, their evolutionary history remains mostly unexplored. Here, we developed a new method to identify potentially adaptive structural variants based on a similarity-based analysis that incorporates genotype frequency data from 26 populations simultaneously. Using this method, we analyzed 57,629 structural variants and identified 576 structural variants that show unusual population differentiation. Of these putatively adaptive structural variants, we further showed that 24 variants are multiallelic and overlap with coding sequences, and 20 variants are significantly associated with GWAS traits. Closer inspection of the haplotypic variation associated with these putatively adaptive and functional structural variants reveals deviations from neutral expectations due to: 1) population differentiation of rapidly evolving multiallelic variants, 2) incomplete sweeps, and 3) recent population-specific negative selection. Overall, our study provides new methodological insights, documents hundreds of putatively adaptive variants, and introduces evolutionary models that may better explain the complex evolution of structural variants.
KW - Complex traits
KW - Copy number variation
KW - Denisovan
KW - Neutrality test
KW - Population genetics
UR - http://www.scopus.com/inward/record.url?scp=85125882656&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85125882656&partnerID=8YFLogxK
U2 - 10.1093/molbev/msab313
DO - 10.1093/molbev/msab313
M3 - Article
C2 - 34718708
AN - SCOPUS:85125882656
SN - 0737-4038
VL - 39
JO - Molecular Biology and Evolution
JF - Molecular Biology and Evolution
IS - 3
M1 - msab313
ER -