University of Leicester
Browse

Perspective: on the importance of extensive, high-quality and reliable deposition of biomolecular NMR data in the age of artificial intelligence

Download (679.23 kB)
journal contribution
posted on 2024-11-29, 10:11 authored by Victoria HigmanVictoria Higman, Eliza Ploskon-ArthurEliza Ploskon-Arthur, Gary S Thompson, Geerten VuisterGeerten Vuister

Artificial intelligence (AI) models are revolutionising scientific data analysis but are reliant on large training data sets. While artificial training data can be used in the context of NMR processing and data analysis methods, relating NMR parameters back to protein sequence and structure requires experimental data. In this perspective we examine what the biological NMR community needs to do, in order to store and share its data better so that we can make effective use of AI methods to further our understanding of biological molecules. We argue, first, that the community should be depositing much more of its experimental data. In particular, we should be depositing more spectra and dynamics data. Second, the NMR data deposited needs to capture the full information content required to be able to use and validate it adequately. The NMR Exchange Format (NEF) was designed several years ago to do this. The widespread adoption of NEF combined with a new proposal for dynamics data specifications come at the right time for the community to expand its deposition of data. Third, we highlight the importance of expanding and safeguarding our experimental data repository, the Biological Magnetic Resonance Data Bank (BMRB), not only in the interests of NMR spectroscopists, but biological scientists more widely. With this article we invite others in the biological NMR community to champion increased (possibly mandatory) data deposition, to get involved in designing new NEF specifications, and to advocate on behalf of the BMRB within the wider scientific community.

Funding

The Collaborative Computational Project for NMR (CCPN): Supporting biomolecular NMR and community driven NMR software development.

Medical Research Council

Find out more...

The Collaborative Computational Project for NMR (CCPN): data analysis, integration and partnerships for NMR in the biosciences

Medical Research Council

Find out more...

The collaborative computational project for NMR (CCPN): supporting biological NMR for the biosciences, medicine and industry.

Medical Research Council

Find out more...

History

Author affiliation

College of Life Sciences Molecular & Cell Biology

Version

  • VoR (Version of Record)

Published in

Journal of Biomolecular NMR

Publisher

Springer Science and Business Media LLC

issn

0925-2738

eissn

1573-5001

Copyright date

2024

Available date

2024-11-29

Language

en

Deposited by

Dr Vicky Higman

Deposit date

2024-10-21

Data Access Statement

No datasets were generated or analysed during the current study.

Rights Retention Statement

  • Yes

Usage metrics

    University of Leicester Publications

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC