Overview
clam is a command-line tool for population genomics that provides an efficient workflow for calculating accurate population genetic statistics. Instead of requiring an all-sites VCF (which can be prohibitively large), clam uses a two-step process:
- First, identify genomic regions with sufficient sequencing depth to be considered "callable" using
clam loci
- Then, calculate population genetic statistics using these callable regions and a VCF containing variants with
clam stat
This approach eliminates the need for generating all-sites VCF files while still producing accurate diversity estimates, making it particularly suitable for large population genomics datasets.