What Is Beta Diversity?
Beta diversity measures how different biological communities are from one another. If alpha diversity describes diversity within one site, beta diversity describes species turnover between sites. In ecology, conservation biology, microbiome science, and environmental monitoring, beta diversity is used to quantify changes in composition across habitats, treatments, gradients, or time.
When beta diversity is low, communities share many of the same species and often similar relative abundances. When beta diversity is high, communities are compositionally distinct. This distinction makes beta diversity one of the most practical tools for answering questions such as: Which treatment changed the microbiome the most? Which habitat transition has the sharpest species replacement? Which sampling sites are most ecologically unique?
Core Beta Diversity Formulas
1) Jaccard Dissimilarity (presence-absence)
Define:
A= number of shared speciesB= species present only in Community 1C= species present only in Community 2
Jaccard similarity is A / (A + B + C). Jaccard dissimilarity is:
βjac = 1 - A/(A+B+C)
This metric ignores abundance and only tracks shared versus non-shared species.
2) Sørensen Dissimilarity (presence-absence)
Sørensen gives more weight to shared species in its similarity form:
Sørensen similarity = 2A / (2A + B + C)
So dissimilarity becomes:
βsor = 1 - 2A/(2A+B+C)
Compared with Jaccard, it can be slightly less harsh when overlap exists.
3) Bray-Curtis Dissimilarity (abundance-based)
For abundance vectors x and y over species i:
BC = Σ|xᵢ - yᵢ| / Σ(xᵢ + yᵢ)
Bray-Curtis captures compositional shifts in counts, not just presence-absence. It is widely used in community ecology and microbiome analysis.
4) Whittaker Beta Diversity (richness partitioning)
For two communities:
βW = γ / ᾱ - 1
γ= total richness across combined communitiesᾱ= mean alpha richness of communities
This is useful when discussing turnover as richness differentiation among sites.
Step-by-Step: How to Calculate Beta Diversity
- Build a species-by-site table with consistent taxonomic labels.
- Select two communities (or many pairwise combinations).
- Choose whether your analysis is incidence-based or abundance-based.
- For incidence metrics, convert positive abundance to presence
1and zero to absence0. - Compute shared and unique species counts (
A,B,C) for Jaccard or Sørensen. - For abundance-aware analysis, compute Bray-Curtis from raw or transformed abundance.
- Interpret values on a 0–1 scale: values near 1 indicate greater turnover.
Worked Example
Suppose two forest plots have the following abundances for five species:
- Plot A:
12, 0, 5, 2, 9 - Plot B:
7, 3, 0, 2, 8
Presence-absence representation:
- A present: species 1, 3, 4, 5 (richness S1 = 4)
- B present: species 1, 2, 4, 5 (richness S2 = 4)
Shared species (A) = 3 (species 1, 4, 5). Unique to A (B) = 1 (species 3). Unique to B (C) = 1 (species 2).
- Jaccard dissimilarity =
1 - 3/(3+1+1) = 0.4 - Sørensen dissimilarity =
1 - 2*3/(2*3+1+1) = 0.25
For Bray-Curtis:
Σ|xᵢ-yᵢ| = |12-7|+|0-3|+|5-0|+|2-2|+|9-8| = 14Σ(xᵢ+yᵢ) = 17+3+5+4+17 = 46BC = 14/46 = 0.3043
Whittaker beta:
γ = 5total species across both plotsᾱ = (4+4)/2 = 4βW = 5/4 - 1 = 0.25
Together, these values suggest modest compositional turnover between plots, with moderate incidence turnover but stronger overlap in common taxa.
How to Choose the Right Beta Diversity Metric
Use Jaccard or Sørensen when:
- You care about species replacement regardless of abundance.
- Detection is sparse and abundance estimates are noisy.
- You want easy interpretation from presence-absence data.
Use Bray-Curtis when:
- Abundance differences are biologically meaningful.
- You need sensitivity to dominance and shifts in relative counts.
- You are comparing treatment effects on community structure, not only membership.
Use Whittaker beta when:
- You are framing diversity partitioning across local and regional scales.
- You want a richness-based turnover summary.
Common Mistakes and Quality Checks
- Mismatched species order: vectors must align by species index or label.
- Mixing transformed and raw counts: keep preprocessing consistent across samples.
- Ignoring sampling depth: unequal sequencing depth can inflate apparent dissimilarity.
- Over-interpreting single metrics: compare multiple indices for robust conclusions.
- No uncertainty analysis: bootstrap or permutation procedures improve inference reliability.
In microbiome applications, normalization choices (rarefaction, relative abundance, CLR transforms, or compositional methods) can strongly alter Bray-Curtis outcomes. In vegetation ecology, detection probability and seasonal timing can alter presence-absence overlap. Always report preprocessing steps with your beta diversity statistics.
Reporting Beta Diversity in Scientific Writing
A clear reporting format includes:
- Data type (incidence or abundance)
- Index used and formula citation
- Preprocessing workflow
- Pairwise summary statistics (mean, median, range)
- Statistical test for group differences (e.g., PERMANOVA)
- Visualization (ordination, heatmap, distance matrix)
Example statement: “Community turnover between restored and unrestored sites was high (mean Bray-Curtis dissimilarity = 0.67), with significant compositional separation by treatment (PERMANOVA, p < 0.01).”
FAQ: How to Calculate Beta Diversity
Is beta diversity a similarity or dissimilarity measure?
Most commonly it is reported as dissimilarity (0 to 1), where higher means more different. Some indices begin as similarity and are converted by subtracting from 1.
What is a good beta diversity value?
There is no universal “good” value. Interpretation depends on ecosystem heterogeneity, scale, sampling method, and ecological question.
Can I compare Jaccard and Bray-Curtis directly?
You can compare trends, but they quantify different aspects of structure. Jaccard ignores abundance; Bray-Curtis includes it.
Do I need all species in both communities?
No. Species absent from one and present in the other are precisely what drives turnover.
How many sites are required?
For a single beta value, two sites are enough. For ecosystem-level inference, multiple sites and pairwise matrices are preferred.
Use the calculator above whenever you need a quick, transparent computation of beta diversity metrics. For publication-grade analyses, pair these calculations with robust sampling design, multivariate statistics, and reproducible preprocessing protocols.