WinPCA: a package for windowed principal component analysis

L. Moritz Blumer, Jeffrey M. Good, Richard Durbin

Research output: Contribution to journalArticlepeer-review

Abstract

Summary: With chromosomal reference genomes and population-scale whole genome-sequencing becoming increasingly accessible, contemporary studies often include characterizations of the genomic landscape as it varies along chromosomes, commonly termed genome scans. While traditional summary statistics like FST and dXY between pre-assigned populations remain integral to characterizing the genomic divergence profile, PCA differs by providing single-sample resolution, thereby supporting the identification of polymorphic inversions, introgression and other types of divergent sequence that may not be fully aligned with global population structure. Here, we introduce WinPCA, a user-friendly package to compute, polarize and visualize genetic principal components in windows along the genome. To accommodate low-coverage whole genome-sequencing datasets, WinPCA can optionally make use of PCAngsd methods to compute principal components in a genotype likelihood framework. WinPCA accepts variant data in either VCF or BEAGLE format and can generate rich plots for interactive data exploration and downstream presentation.

Original languageEnglish
Article numberbtaf529
JournalBioinformatics
Volume41
Issue number10
Early online dateSep 22 2025
DOIs
StatePublished - Oct 2025

Keywords

  • Software
  • Principal Component Analysis/methods
  • Genomics/methods
  • Humans
  • Whole Genome Sequencing

Fingerprint

Dive into the research topics of 'WinPCA: a package for windowed principal component analysis'. Together they form a unique fingerprint.

Cite this