University of Leicester
Browse
2_NAIR-An Efficient Distributed Deep Learning Architecture for Resource Constrained IoT System(1).pdf (3.74 MB)

NAIR: An Efficient Distributed Deep Learning Architecture for Resource Constrained IoT System

Download (3.74 MB)
journal contribution
posted on 2024-04-15, 11:52 authored by Yucong Xiao, Daobing Zhang, Yunsheng Wang, Xuewu Dai, Zhipei Huang, Wuxiong Zhang, Yang Yang, Ashiq AnjumAshiq Anjum, Fei Qin
The distributed deep learning architecture can support the front-deployment of deep learning systems in resource constrained IoT devices and is attracting increasing interest. However, most ready-to-use deep models are designed for centralized deployment without considering the transmission loss of the intermediate representation inside the distributed architecture. This oversight significantly affects the inference performance of distributed deployed deep models. To alleviate this problem, a state-of-the-art work chooses to retrain the original model to form an intermediate representation with ordered importance and yields better inference accuracy under constrained transmission bandwidth. This paper first reveals that this solution is essentially a pruning-like solution, where unimportant information is adaptively pruned to fit within the limited bandwidth. With this understanding, a novel scheme named Naturally Aggregated Intermediate Representation (NAIR) has been proposed, which aims to naturally amplify the difference of importance embedded in the intermediate representation from a mature deep model and reassemble the intermediate representation into a hierarchy of importance from high-to-low to accommodate the transmission loss. As a result, this method shows further improved performance in various scenarios, avoids compromising the overall inference performance of the system, and saves astronomical retraining and storage costs. The effectiveness of NAIR has been validated through extensive experiments, achieving a 112% improvement in performance compared to the state-of-the-art work.

Funding

10.13039/501100012166-National Key Research and Development Program of China (Grant Number: 2021YFC3090204 and 2022YFA100390)

10.13039/100000001-National Science Foundation (Grant Number: CNS 2128346)

Self-Learning Digital Twins for Sustainable Land Management

Engineering and Physical Sciences Research Council

Find out more...

Self-learning AI-based digital twins for accelerating clinical care in respiratory emergency admissions (SLAIDER)

Engineering and Physical Sciences Research Council

Find out more...

10.13039/501100001809-National Natural Science Foundation of China (Grant Number: 62071450 and U21B2002)

10.13039/100007219-Natural Science Foundation of Shanghai Municipality (Grant Number: 19ZR1454100)

10.13039/501100012226-Fundamental Research Funds for the Central Universities

History

Author affiliation

College of Science & Engineering/Comp' & Math' Sciences

Version

  • AM (Accepted Manuscript)

Published in

IEEE Internet of Things Journal

Publisher

Institute of Electrical and Electronics Engineers (IEEE)

eissn

2327-4662

Copyright date

2024

Available date

2024-04-15

Language

en

Deposited by

Professor Ashiq Anjum

Deposit date

2024-04-14

Usage metrics

    University of Leicester Publications

    Categories

    No categories selected

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC