University of Leicester
Browse

A Q-learning-based memetic algorithm for multi-objective dynamic software project scheduling

Download (1.77 MB)
journal contribution
posted on 2017-11-21, 12:06 authored by Xiao-Ning Shen, Leandro L. Minku, Naresh Marturi, Yi-Nan Guo, Ying Han
Software project scheduling is the problem of allocating employees to tasks in a software project. Due to the large scale of current software projects, many studies have investigated the use of optimization algorithms to find good software project schedules. However, despite the importance of human factors to the success of software projects, existing work has considered only a limited number of human properties when formulating software project scheduling as an optimization problem. Moreover, the changing environments of software companies mean that software project scheduling is a dynamic optimization problem. However, there is a lack of effective dynamic scheduling approaches to solve this problem. This work proposes a more realistic mathematical model for the dynamic software project scheduling problem. This model considers that skill proficiency can improve over time and, different from previous work, it considers that such improvement is affected by the employees’ properties of motivation and learning ability, and by the skill difficulty. It also defines the objective of employees’ satisfaction with the allocation. It is considered together with the objectives of project duration, cost, robustness and stability under a variety of practical constraints. To adapt schedules to the dynamically changing software project environments, a multi-objective two-archive memetic algorithm based on Q-learning (MOTAMAQ) is proposed to solve the problem in a proactive-rescheduling way. Different from previous work, MOTAMAQ learns the most appropriate global and local search methods to be used for different software project environment states by using Q-learning. Extensive experiments on 18 dynamic benchmark instances and 3 instances derived from real-world software projects were performed. A comparison with seven other meta-heuristic algorithms shows that the strategies used by our novel approach are very effective in improving its convergence performance in dynamic environments, while maintaining a good distribution and spread of solutions. The Q-learning-based learning mechanism can choose appropriate search operators for the different scheduling environments. We also show how different trade-offs among the five objectives can provide software managers with a deeper insight into various compromises among many objectives, and enabling them to make informed decisions.

Funding

This work is supported by the National Natural Science Foundation of China (NSFC) under Grant No. 61502239, No. 61573361, and No. 61503191, and Natural Science Foundation of Jiangsu Province of China under Grant No. BK20150924 and No. BK20150933. We are grateful to Weineng Chen and Jun Zhang for providing the data of the three real-world PSP instances.

History

Citation

Information Sciences, 2018, 428, pp. 1-29

Author affiliation

/Organisation/COLLEGE OF SCIENCE AND ENGINEERING/Department of Informatics

Version

  • AM (Accepted Manuscript)

Published in

Information Sciences

Publisher

Elsevier

issn

0020-0255

eissn

1872-6291

Acceptance date

2017-10-21

Copyright date

2017

Available date

2018-10-24

Publisher version

http://www.sciencedirect.com/science/article/pii/S0020025517310472

Notes

The file associated with this record is under embargo until 12 months after publication, in accordance with the publisher's self-archiving policy. The full text may be available through the publisher links provided above.

Language

en

Usage metrics

    University of Leicester Publications

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC