Genomic signatures of selection

Species evolve constantly over time as individuals are born, reproduce and die. Doing so their genomes pass on from parents to offspring, modified by mutations and shuffled through recombination.

Constraints placed on the ability of individuals to survive in their environment, to chose their mates, to produce surviving offspring … will impact the way genomes change over time.

In return, patterns of diversity observed in present day genomes carry information on these constraints and therefore on the evolutionary history of populations in the past and on biological processes that affect the way genomes are transmitted.

My work in population genomics aims at developping new ways to reconstruct how selection has affected the diversity observed in the genome of (mostly) livestock species. So we try, via modeling, to interpret genetic data recorded on whole genomes in order to gain insight into the past history of populations.

Population differentiation

A figure with different panels — Genetic diversity of a region of chromosome 6 in sheep populations from France. This figure illustrates the loss of diversity most likely due to selection in a large panel of populations (top left) from the North (blue) and South (Red) of France. Many populations have very low diversity in this part of their genome, with all individuals sharing the same ancestral chromosome (*e.g.* in blue on the top right panel). We developped methods to identify such regions, that create large signals along the genome (bottom)

As illustrated above, one way to identify genomic regions under selection is to look for genome regions where populations carry very different genetic backgrounds, more than what we expect from their average, genome-wide, patterns. Building upon the work of Maxime Bonhomme (Bonhomme et al. 2010), we extended this approach to work on haplotypes rather than point mutations (Fariello et al. 2013) which increased interpretability and statistical power. Later on, we further worked to look specifically for signatures associated to selection on particular phenotypes of local pig breeds from Europe (Poklukar et al. 2023). These methods, including Maxime’s FLK, are implemented in the hapFLK software. We are currently working on extending the hapFLK method to improve the statistical inference on dating past selection events.

Genetic Time series

a figure with top panel showing the trajectory of an allele frequency through time and the bottom panel the dependancy graph of a HMM — Hidden Markov Model for inference on genetic time series

With the increased availability of dense genetic data, a new kind of information can be taken into account: time. From ancient DNA studies, experimental evolution experiments or continuous genetic survey of natural and domestic populations, we now have access to the evolution of the genetic composition of groups of individuals over time, genetic time series.

We have worked on one particular class of statistical models to analyse such data: Hidden Markov Models (HMM). The objectives of the analyses are to estimate for each allele along the genome a selection coefficient that explains the trajectory of its frequency over time. In (Paris, Servin, and Boitard 2019), we compared different models to approximate the transition kernels in such HMM. We found that the Beta-with-Spikes approximation was performing best. Since then, we have been working on using the HMMs to improve inference both on the effective population size and selection coefficients. These methods are implemented in the SelNeTime software (Uhl et al. 2025).

References

Bonhomme, Maxime, Claude Chevalet, Bertrand Servin, Simon Boitard, Jihad Abdallah, Sarah Blott, and Magali SanCristobal. 2010. “Detecting Selection in Population Trees: The Lewontin and Krakauer Test Extended.” Genetics 186 (1): 241–62. https://doi.org/10.1534/genetics.110.117275.

Fariello, Maria Inés, Simon Boitard, Hugo Naya, Magali SanCristobal, and Bertrand Servin. 2013. “Detecting Signatures of Selection Through Haplotype Differentiation Among Hierarchically Structured Population.” Genetics 193 (March): 929–41.

Paris, Cyriel, Bertrand Servin, and Simon Boitard. 2019. “Inference of Selection from Genetic Time Series Using Various Parametric Approximations to the Wright-Fisher Model.” G3: Genes, Genomes, Genetics 9 (12): 4073–86. https://doi.org/10.1534/g3.119.400778.

Poklukar, Klavdija, Camille Mestre, Martin Škrlep, Marjeta Čandek-Potokar, Cristina Ovilo, Luca Fontanesi, Juliette Riquet, et al. 2023. “A Meta-Analysis of Genetic and Phenotypic Diversity of European Local Pig Breeds Reveals Genomic Regions Associated with Breed Differentiation for Production Traits.” Genet Sel Evol 55 (1): 88. https://doi.org/10.1186/s12711-023-00858-3.

Uhl, Mathieu, Paul Bunel, Miguel de Navascués, Simon Boitard, and Bertrand Servin. 2025. “SelNeTime: A Python Package Inferring Effective Population Size and Selection Intensity from Genomic Time Series Data,” October, 2024.11.06.622284. https://doi.org/10.1101/2024.11.06.622284.