Arlequin

Written by

in

Detecting Loci Under Selection: Methods Implemented in Arlequin

Understanding the genetic basis of adaptation requires identifying specific regions of the genome that have been targeted by natural selection. As populations adapt to new environments, advantageous alleles increase in frequency, leaving a signature of selection—such as reduced genetic diversity or elevated population differentiation—that contrasts with the background of neutral genetic drift.

⁠Arlequin (version 3.0) has long been a staple in population genetics for analyzing genetic structure and detecting these signatures. Specifically, Arlequin 3.5 introduced enhanced capabilities for identifying loci under selection based on ⁠hierarchical FSTcap F sub cap S cap T end-sub

statistics, providing a powerful, simulation-based approach for both model and non-model organisms. The Core Approach: Outlier Detection

Arlequin focuses on identifying outlier loci—genetic markers that show significantly higher differentiation between populations than would be expected under neutral evolution. 1. The Hierarchical Island Model

The primary method implemented in Arlequin 3.5 for detecting selection is based on the approach proposed by Excoffier et al. (2009). This method uses a hierarchical island model to simulate the distribution of FSTcap F sub cap S cap T end-sub

Hierarchical Structure: It accounts for complex population structures, such as populations grouped within different regions, rather than assuming a simple, flat landscape.

Simulation-Based: Arlequin performs extensive coalescent simulations to generate a “neutral” distribution of FSTcap F sub cap S cap T end-sub

values, given the observed genetic diversity and demographic history (e.g., migration rates, population size changes). Identifying Outliers: By comparing the observed FSTcap F sub cap S cap T end-sub

of each locus against this simulated neutral distribution, the software identifies markers with significantly high differentiation, flagging them as potential targets of divergent selection. 2. Advantages of the Arlequin Approach

Robust to Demographic History: By simulating the neutral distribution based on the data’s specific demographic parameters, this approach reduces the rate of false positives often caused by population structure or bottlenecks, which can mimic selective sweeps.

Applicability: The method can be applied to various types of genetic data, including SNPs, microsatellites, and DNA sequence data, making it versatile for ⁠RADseq data.

Software Integration: Arlequin seamlessly integrates these selection tests with other analyses like AMOVA (Analysis of Molecular Variance), allowing researchers to validate their findings. Practical Application: Running the Analysis

Detecting selection in Arlequin involves setting up a specific population structure and running the simulation to find candidates.

Data Preparation: Convert genetic data (e.g., VCF format) into Arlequin’s .arp format using tools like PGDSpider.

Model Configuration: Define the hierarchy of populations (e.g., subpopulations within regions) within the settings.

Simulation & Analysis: Run the hierarchical analysis. Arlequin computes pairwise FSTcap F sub cap S cap T end-sub and runs simulations.

Identifying Outliers: Check the output file for loci that deviate significantly from the simulated distribution. Conclusion

Detecting loci under selection is crucial for evolutionary biology. By leveraging hierarchical FSTcap F sub cap S cap T end-sub

simulations, Arlequin offers a robust, well-established method for distinguishing natural selection from demographic history. As high-throughput sequencing data becomes more common, tools that accurately handle complex population structures, like Arlequin, remain essential for distinguishing neutral genetic drift from adaptive evolution.

Disclaimer: Ensure your input data is clean, and the population hierarchy is correctly defined to prevent, as discussed in this Biostars forum, invalid migration rate errors.

If you can tell me the type of markers you’re using (e.g., SNPs, microsatellites) and your population structure, I can give you more specific advice on setting up your Arlequin input file. Arlequin 3.5 – What’s new – Population Genetics

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *