University of Leicester
Browse

Selection of computational environments for PSP processing on scientific gateways.

Download (1.84 MB)
journal contribution
posted on 2018-08-13, 15:33 authored by Edvard Martins de Oliveira, Júlio Cézar Estrella, Alexandre Cláudio Botazzo Delbem, Luiz Henrique Nunes, Henrique Yoshikazu Shishido, Stephan Reiff-Marganiec
Science Gateways have been widely accepted as an important tool in academic research, due to their flexibility, simple use and extension. However, such systems may yield performance traps that delay work progress and cause waste of resources or generation of poor scientific results. This paper addresses an investigation on some of the failures in a Galaxy system and analyses of their impacts. The use case is based on protein structure prediction experiments performed. A novel science gateway component is proposed towards the definition of the relation between general parameters and capacity of machines. The machine-learning strategies used appoint the best machine setup in a heterogeneous environment and the results show a complete overview of Galaxy, a diverse platform organization, and the workload behavior. A Support Vector Regression (SVR) model generated and based on a historic data-set provided an excellent learning module and proved a varied platform configuration is valuable as infrastructure in a science gateway. The results revealed the advantages of investing in local cluster infrastructures as a base for scientific experiments.

Funding

This work was supported by the Brazilian National Council for Scientific and Technological Development (CNPQ) (165009/2015-2).

History

Citation

Heliyon, 2018, 4 (7), e00690

Author affiliation

/Organisation/COLLEGE OF SCIENCE AND ENGINEERING/Department of Informatics

Version

  • VoR (Version of Record)

Published in

Heliyon

Publisher

Elsevier

issn

2405-8440

Acceptance date

2018-07-05

Copyright date

2018

Available date

2018-08-13

Publisher version

https://www.sciencedirect.com/science/article/pii/S2405844017333303

Notes

Data associated with this study has been deposited at http://bit.ly/protein-algos

Language

en

Usage metrics

    University of Leicester Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC