TY - JOUR
T1 - High performance computation of landscape genomic models including local indicators of spatial association
AU - The NEXTGEN Consortium
AU - Stucki, S.
AU - Orozco-terWengel, P.
AU - Forester, B. R.
AU - Duruz, S.
AU - Colli, L.
AU - Masembe, C.
AU - Negrini, R.
AU - Landguth, E.
AU - Jones, M. R.
AU - Bruford, M. W.
AU - Taberlet, P.
AU - Joost, S.
N1 - Publisher Copyright:
© 2016 The Authors. Molecular Ecology Resources Published by John Wiley & Sons Ltd.
PY - 2017/9
Y1 - 2017/9
N2 - With the increasing availability of both molecular and topo-climatic data, the main challenges facing landscape genomics – that is the combination of landscape ecology with population genomics – include processing large numbers of models and distinguishing between selection and demographic processes (e.g. population structure). Several methods address the latter, either by estimating a null model of population history or by simultaneously inferring environmental and demographic effects. Here we present samβada, an approach designed to study signatures of local adaptation, with special emphasis on high performance computing of large-scale genetic and environmental data sets. samβada identifies candidate loci using genotype–environment associations while also incorporating multivariate analyses to assess the effect of many environmental predictor variables. This enables the inclusion of explanatory variables representing population structure into the models to lower the occurrences of spurious genotype–environment associations. In addition, samβada calculates local indicators of spatial association for candidate loci to provide information on whether similar genotypes tend to cluster in space, which constitutes a useful indication of the possible kinship between individuals. To test the usefulness of this approach, we carried out a simulation study and analysed a data set from Ugandan cattle to detect signatures of local adaptation with samβada, bayenv, lfmm and an FST outlier method (FDIST approach in arlequin) and compare their results. samβada – an open source software for Windows, Linux and Mac OS X available at http://lasig.epfl.ch/sambada – outperforms other approaches and better suits whole-genome sequence data processing.
AB - With the increasing availability of both molecular and topo-climatic data, the main challenges facing landscape genomics – that is the combination of landscape ecology with population genomics – include processing large numbers of models and distinguishing between selection and demographic processes (e.g. population structure). Several methods address the latter, either by estimating a null model of population history or by simultaneously inferring environmental and demographic effects. Here we present samβada, an approach designed to study signatures of local adaptation, with special emphasis on high performance computing of large-scale genetic and environmental data sets. samβada identifies candidate loci using genotype–environment associations while also incorporating multivariate analyses to assess the effect of many environmental predictor variables. This enables the inclusion of explanatory variables representing population structure into the models to lower the occurrences of spurious genotype–environment associations. In addition, samβada calculates local indicators of spatial association for candidate loci to provide information on whether similar genotypes tend to cluster in space, which constitutes a useful indication of the possible kinship between individuals. To test the usefulness of this approach, we carried out a simulation study and analysed a data set from Ugandan cattle to detect signatures of local adaptation with samβada, bayenv, lfmm and an FST outlier method (FDIST approach in arlequin) and compare their results. samβada – an open source software for Windows, Linux and Mac OS X available at http://lasig.epfl.ch/sambada – outperforms other approaches and better suits whole-genome sequence data processing.
KW - environmental correlations
KW - genome scans
KW - high performance computing
KW - landscape genomics
KW - local adaptation
KW - spatial autocorrelation
UR - http://www.scopus.com/inward/record.url?scp=85006098450&partnerID=8YFLogxK
U2 - 10.1111/1755-0998.12629
DO - 10.1111/1755-0998.12629
M3 - Article
C2 - 27801969
AN - SCOPUS:85006098450
SN - 1755-098X
VL - 17
SP - 1072
EP - 1089
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
IS - 5
ER -