TY - JOUR
T1 - A review
T2 - simulation tools for genome-wide interaction studies
AU - Shang, Junliang
AU - Xu, Anqi
AU - Bi, Mingyuan
AU - Zhang, Yuanyuan
AU - Li, Feng
AU - Liu, Jin Xing
N1 - Publisher Copyright:
© The Author(s) 2024. Published by Oxford University Press. All rights reserved.
PY - 2024/11/1
Y1 - 2024/11/1
N2 - Genome-wide association study (GWAS) is essential for investigating the genetic basis of complex diseases; nevertheless, it usually ignores the interaction of multiple single nucleotide polymorphisms (SNPs). Genome-wide interaction studies provide crucial means for exploring complex genetic interactions that GWAS may miss. Although many interaction methods have been proposed, challenges still persist, including the lack of epistasis models and the inconsistency of benchmark datasets. SNP data simulation is a pivotal intermediary between interaction methods and real applications. Therefore, it is important to obtain epistasis models and benchmark datasets by simulation tools, which is helpful for further improving interaction methods. At present, many simulation tools have been widely employed in the field of population genetics. According to their basic principles, these existing tools can be divided into four categories: coalescent simulation, forward-time simulation, resampling simulation, and other simulation frameworks. In this paper, their basic principles and representative simulation tools are compared and analyzed in detail. Additionally, this paper provides a discussion and summary of the advantages and disadvantages of these frameworks and tools, offering technical insights for the design of new methods, and serving as valuable reference tools for researchers to comprehensively understand GWAS and genome-wide interaction studies.
AB - Genome-wide association study (GWAS) is essential for investigating the genetic basis of complex diseases; nevertheless, it usually ignores the interaction of multiple single nucleotide polymorphisms (SNPs). Genome-wide interaction studies provide crucial means for exploring complex genetic interactions that GWAS may miss. Although many interaction methods have been proposed, challenges still persist, including the lack of epistasis models and the inconsistency of benchmark datasets. SNP data simulation is a pivotal intermediary between interaction methods and real applications. Therefore, it is important to obtain epistasis models and benchmark datasets by simulation tools, which is helpful for further improving interaction methods. At present, many simulation tools have been widely employed in the field of population genetics. According to their basic principles, these existing tools can be divided into four categories: coalescent simulation, forward-time simulation, resampling simulation, and other simulation frameworks. In this paper, their basic principles and representative simulation tools are compared and analyzed in detail. Additionally, this paper provides a discussion and summary of the advantages and disadvantages of these frameworks and tools, offering technical insights for the design of new methods, and serving as valuable reference tools for researchers to comprehensively understand GWAS and genome-wide interaction studies.
KW - coalescent simulation
KW - forward-time simulation
KW - genome-wide association study
KW - resampling simulation
KW - simulation tools
KW - single nucleotide polymorphism
UR - https://www.scopus.com/pages/publications/85212457107
U2 - 10.1093/bfgp/elae034
DO - 10.1093/bfgp/elae034
M3 - 文献综述
C2 - 39173096
AN - SCOPUS:85212457107
SN - 2041-2649
VL - 23
SP - 745
EP - 753
JO - Briefings in Functional Genomics
JF - Briefings in Functional Genomics
IS - 6
ER -