posted on 2019-09-04, 14:12authored byJack Aidley, Joseph J. Wanford, Luke R. Green, Samuel K. Sheppard, Christopher D. Bayliss
Hypermutable simple sequence repeats (SSRs) are drivers of phase variation (PV) whose stochastic, high-frequency, reversible switches in gene expression are a common feature of several pathogenic bacterial species, including the human pathogen Campylobacter jejuni. Here we examine the distribution and conservation of known and putative SSR-driven phase variable genes - the phasome - in the genus Campylobacter. PhasomeIt, a new program, was specifically designed for rapid identification of SSR-mediated PV. This program detects the location, type and repeat number of every SSR. Each SSR is linked to a specific gene and its putative expression state. Other outputs include conservation of SSR-driven phase-variable genes and the 'core phasome' - the minimal set of PV genes in a phylogenetic grouping. Analysis of 77 complete Campylobacter genome sequences detected a 'core phasome' of conserved PV genes in each species and a large number of rare PV genes with few, or no, homologues in other genome sequences. Analysis of a set of partial genome sequences, with food-chain-associated metadata, detected evidence of a weak link between phasome and source host for disease-causing isolates of sequence type (ST)-828 but not the ST-21 or ST-45 complexes. Investigation of the phasomes in the genus Campylobacter provided evidence of overlapping but distinctive mechanisms of PV-mediated adaptation to specific niches. This suggests that the phasome could be involved in host adaptation and spread of campylobacters. Finally, this tool is malleable and will have utility for studying the distribution and genic effects of other repetitive elements in diverse bacterial species.
Funding
JA was funded by the BBSRC through the MIBTP Doctoral Training Programme under grant number BB/J014532/1. JW was funded by the MRC IMPACT scheme. LRG was funded by the MRC through grant number MR/M020193/1. The funders had no role in the design of the study, collection, analysis and interpretation of the data, writing of the manuscript, or the decision to publish.
History
Citation
Microbial Genomics, 2018, 4 (11)
Author affiliation
/Organisation/COLLEGE OF LIFE SCIENCES/Biological Sciences/Genetics and Genome Biology
All supporting data, code and protocols have been provided within the article or through supplementary data files. Five supplementary tables and three supplementary figures are available with the online version of this article. Data Summary
1. The PhasomeIt program has been deposited in Github (url – https://github.com/JackAidley/PhasomeIt/).
2. The complete dataset for the genus Campylobacter has been deposited in Figshare: DOI: 10.6084/m9.figshare.7066475.v1 (url –https://figshare.com/articles/Campylobacter_phasome_analysis_generated_by_PhasomeIt/7066475).
3. The complete dataset for the Campylobacter isolates with meta-data has been deposited in Figshare: DOI: 10.6084/m9.figshare.7066484.v1 (url – https://figshare.com/articles/Phasome_analysis_of_host_specified_Campylobacter_isolate_generated_by_PhasomeIt/7066484).
4. The genome sequence files for the Campylobacter isolates with metadata have been deposited in Figshare: DOI: 10.6084/m9.figshare.7066487.v1 (url – https://figshare.com/articles/Host_specified_Campylobacter_genomes/7066487).