Complex exchanges among plasmids and clonal expansion of lineages shape the population structure and virulence of Borrelia burgdorferi.
In the United States, Borrelia burgdorferi (Bb) is the principal etiologic agent of Lyme disease. The complex structure of Bb genomes has posed challenges for genomic studies because homology among the bacterium's many plasmids, which account for ~40% of the genome by length, has made them difficult to sequence and assemble. We used long-read sequencing to generate near-complete assemblies of 62 isolates of human-derived Bb and collected public genomes with plasmid sequences. We characterized genetic diversity and population structure in the resulting set of 82 plasmid-complete Borrelia burgdorferi sensu stricto genomes. The Bb core genome is encoded by a chromosome and the conserved plasmids cp26, lp54, and lp17; the accessory genome is encoded by all other plasmids and the distal arm of the chromosome. Near-complete genomes reveal that the most granular Bb genotypes are clonal expansions of complex rearrangements among accessory genome elements. Ribosomal spacer types (RST) represent multiple collections of such genotypes, whereas OspC types are usually clonal. Structural rearrangements are non-randomly distributed throughout the genome, with cp32 plasmids undergoing dense exchanges and most linear plasmids, except lp54, sharing blocks among themselves and with the distal arm of the chromosome. OspC type A strains, known to possess greater virulence in humans, are distinguished by the presence of lp28-1 and lp56. Rearrangements among plasmids tended to preserve gene content, suggesting functional constraints among gene networks. Using k-partite graph decompositions, we identified gene sets with correlation patterns suggestive of conserved functional modules. Long-read assemblies reveal that Bb population genetic structure results from clonal expansion of lineages that have undergone complex rearrangements among plasmid-encoded accessory genome elements. Genetic structure is preserved among genes even when plasmid rearrangements occur, suggesting that selection among epistatic loci maintains functional genetic networks. The analysis of near-complete genomes assembled using long-read sequencing methods advances our understanding of Bb biology and Lyme disease pathogenesis by providing the first detailed view of population variation in previously inaccessible areas of the Bb genome.