University of Leicester
Browse

High-throughput phenotype-to-genotype testing of meningococcal carriage and disease isolates detects genetic determinants of disease-relevant phenotypic traits

Download (2.11 MB)
journal contribution
posted on 2024-11-18, 16:24 authored by Robeena Farzand, Mercy W Kimani, Evangelos Mourkas, Abdullahi Jama, Jack L Clark, Megan De Ste Croix, William M Monteith, Jay Lucidarme, Neil J Oldfield, David PJ Turner, Ray Borrow, Luisa Martinez-Pomares, Samuel K Sheppard, Christopher BaylissChristopher Bayliss

Genome-wide association studies (GWAS) with binary or single phenotype data have successfully identified disease-associated genotypes and determinants of antimicrobial resistance. We describe a novel phenotype-to-genotype approach for a major bacterial pathogen that involves simultaneously testing for associations among multiple disease-related phenotypes and linkages between phenotypic variation and genetic determinants. High-throughput assays quantified variation among 163 Neisseria meningitidis serogroup W ST-11 clonal complex isolates for 11 phenotypic traits. A comparison of carriage and two disease subgroups detected significant differences between groups for eight phenotypic traits. Candidate genotypic testing indicated that indels in csw , a capsular biosynthesis gene, were associated with reduced survival in antibody-depleted heat-inactivated serum. GWAS testing detected 341 significant genetic variants (3 single-nucleotide polymorphisms and 338 unitigs) across all traits except serum bactericidal antibody-depleted assays. Growth traits were associated with variants of capsular biosynthesis genes, carbonic anhydrase, and an iron-uptake system while adhesion-linked variation was in pilC2 , marR, and mutS . Multiple phase variation states or combinatorial phasotypes were associated with significant differences in multiple phenotypes. Controlling for group effects through regression and recursive random forest approaches detected group-independent effects for nalP with biofilm formation and fetA with a growth trait. Through random forest testing, nine phenotypes were weakly predictive of MenW:cc11 sub-lineage, original or 2013, for disease isolates while three characteristics separated carriage and disease isolates with >80% accuracy. This study demonstrates the power of combining high-throughput phenotypic testing of pathogenically relevant isolate collections with genomics for identifying genetic determinants of specific disease-relevant phenotypes and the pathobiology of microbial pathogens. IMPORTANCE Next-generation sequencing technologies have led to the creation of extensive microbial genome sequence databases for several bacterial pathogens. Mining of these databases is now imperative for unlocking the maximum benefits of these resources. We describe a high-throughput methodology for detecting associations between phenotypic variation in multiple disease-relevant traits and a range of genetic determinants for Neisseria meningitidis , a major causative agent of meningitis and septicemia. Phenotypic variation in 11 disease-related traits was determined for 163 isolates of the hypervirulent ST-11 lineage and linked to specific single-nucleotide polymorphisms, short sequence variants, and phase variation states. Application of machine learning algorithms to our data outputs identified combinatorial phenotypic traits and genetic variants predictive of a disease association. This approach overcomes the limitations of generic meta-data, such as disease versus carriage, and provides an avenue to explore the multi-faceted nature of bacterial disease, carriage, and transmissibility traits.

History

Author affiliation

College of Life Sciences Genetics, Genome Biology & Cancer Sciences

Version

  • AM (Accepted Manuscript)

Published in

mBio

Pagination

e0305924

Publisher

American Society for Microbiology

issn

2150-7511

eissn

2150-7511

Copyright date

2024

Available date

2024-11-18

Editors

Rappuoli R

Spatial coverage

United States

Language

en

Deposited by

Professor Christopher Bayliss

Deposit date

2024-11-04

Data Access Statement

All data are either included within the manuscript or are openly available at the University of Leicester Research Repository, accessible via (66).

Rights Retention Statement

  • Yes

Usage metrics

    University of Leicester Publications

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC