Olivier Francois

BCM - Computational and Mathematical Biology

TESS / POPS
sNMF
LFMM
LEA

Olivier François, PhD

Computer programs

TESS is a computer program that implements a Bayesian clustering algorithm for spatial population genetics. Is it particularly useful for seeking genetic barriers or genetic discontinuities in continuous populations. The method is based on a hierarchical mixture model where the prior distribution on cluster labels is defined as a Hidden Markov Random Field. Given individual geographical locations, the program seeks population structure from multilocus genotypes without assuming predefined populations. TESS takes input data files in a format compatible to existing non-spatial Bayesian algorithms (e.g. STRUCTURE). It returns graphical displays of cluster membership probabilities and geographical cluster assignments from its Graphical User Interface.

POPS The POPS program performs inference of ancestry distribution models. It uses a TESS-like interface to compute individual cluster membership and admixture proportions based on multilocus genotype data and their correlation with environmental and geographical variables. Similarly to species distribution models, POPS provides routines to project cluster memberships and admixture proportions under scenarios of environmental change. Typical uses of POPS are for evaluating how the population genetic structure of a species could be modified by climate change, or testing hypotheses about local adaptation and ecological speciation.

LFMM: Adaptation to local environments often occurs through natural selection acting on large number of alleles, each having a weak phenotypic effect. One way to detect those alleles is by identifying genetic polymorphisms that exhibit high correlation with some environmental gradient or with the variables used as proxies for ecological pressures. The LFM computer program includes an integrated framework based on population genetics, ecological modeling and statistical techniques for screening genomes for signatures of local adaptation. It implements fast algorithms using latent factor mixed models based on a variant of Bayesian principal component analysis in which residual population structure is introduced via unobserved factors.

sNMF: Inference of population structure using sparse non-negative matrix factorization algorithms.

abc is an R contributed package. Approximate Bayesian computation (ABC) is devoted to the analysis of complex models. The R package abc that implements several ABC algorithms for performing parameter estimation and model selection.

apTreeshape (Windows package) is an R contributed package dedicated to simulation and analysis of phylogenetic tree topologies using statistical indices. It is a companion library of the 'ape' package. It provides additional functions for reading, plotting, manipulating phylogenetic trees. It also offers convenient web-access to public databases, and enables testing null models of macroevolution using corrected test statistics. Trees of class "phylo" from the 'ape' package can be converted easily into a 'treeshape' format. Also contains new diversification rate shift tests.

LEA is an R package dedicated to landscape genomics and ecological association tests. LEA can run analyses of population structure and genome scans for local adaptation. It includes statistical methods for estimating ancestry coefficients from large genotypic matrices and evaluating the number of ancestral populations (snmf, pca); and identifying genetic polymorphisms that exhibit high correlation with some environmental gradient or with the variables used as proxies for ecological pressures (lfmm), and controlling the false discovery rate. LEA is mainly based on optimized C programs that can scale with the dimension of very large data sets.