Distributed Deep Convolutional Neural Networks for the Internet-of-Things

Reading time: 3 minute
...

📝 Original Paper Info

- Title: Distributed Deep Convolutional Neural Networks for the Internet-of-Things
- ArXiv ID: 1908.01656
- Date: 2021-07-30
- Authors: Simone Disabato, Manuel Roveri, Cesare Alippi

📝 Abstract

Severe constraints on memory and computation characterizing the Internet-of-Things (IoT) units may prevent the execution of Deep Learning (DL)-based solutions, which typically demand large memory and high processing load. In order to support a real-time execution of the considered DL model at the IoT unit level, DL solutions must be designed having in mind constraints on memory and processing capability exposed by the chosen IoT technology. In this paper, we introduce a design methodology aiming at allocating the execution of Convolutional Neural Networks (CNNs) on a distributed IoT application. Such a methodology is formalized as an optimization problem where the latency between the data-gathering phase and the subsequent decision-making one is minimized, within the given constraints on memory and processing load at the units level. The methodology supports multiple sources of data as well as multiple CNNs in execution on the same IoT system allowing the design of CNN-based applications demanding autonomy, low decision-latency, and high Quality-of-Service.

💡 Summary & Analysis

This paper addresses the challenges of deploying deep learning (DL) solutions in Internet-of-Things (IoT) units that are constrained by limited memory and computational power. These constraints often prevent the execution of typical DL models, which require substantial memory and processing capabilities. The authors propose a design methodology for distributing Deep Convolutional Neural Networks (CNNs) across IoT systems to support real-time data processing and decision-making while respecting these constraints.

The core technology introduced in this paper formalizes the deployment strategy as an optimization problem aimed at minimizing latency between data collection and decision-making phases, within given memory and processing load limits. The methodology supports multiple data sources and multiple CNN models operating simultaneously on a single IoT system, which is crucial for applications requiring autonomy, low-latency decisions, and high-quality service.

The results demonstrate the effectiveness of this approach in various experimental settings, particularly with a 5-layer CNN example. By successfully managing memory and processing loads efficiently, the methodology enables real-time execution and decision-making processes that are essential for IoT applications.

This research is significant as it paves the way for more efficient deployment of DL models on constrained IoT devices, potentially expanding their applicability across various industries where real-time data analysis and quick decisions are critical.

📄 Full Paper Content (ArXiv Source)

@m0pt@C0.4cmC2.25cmC2.25cmC2.25cmcC1.4cmC1.4cmC1.4cm@m0pt@ &&&&&
&L&$`t_t`$&$`t_p`$&$`t=t_t+t_p`$&&$`\eta_R`$&$`\eta_O`$&$`\eta_B`$&
&1&$`361.85\pm60.77`$&$`12059.62\pm430.32`$&$`12421.47\pm406.82`$& &$`4.92\pm1.84`$&$`4.05\pm1.84`$&$`0.03\pm0.17`$&
&2&$`249.47\pm69.22`$&$`11723.26\pm319.91`$&$`11972.72\pm313.71`$& & $`3.59\pm0.90`$&$`1.49\pm0.95`$&$`0.01\pm0.10`$&
&3&$`183.34\pm87.14`$&$`11645.13\pm259.90`$&$`11828.47\pm267.93`$& &$`2.80\pm0.55`$&$`0.55\pm0.86`$&$`0.00\pm0.06`$&
&4&$`138.99\pm85.02`$&$`11620.58\pm236.18`$&$`11759.57\pm252.58`$& &$`2.14\pm0.50`$&$`0.88\pm0.55`$&$`0.00\pm0.07`$&
&C&$`127.85\pm94.81`$&$`11602.91\pm215.07`$&$`11730.76\pm237.14`$& &$`0.99\pm0.11`$&$`0.08\pm0.34`$&$`0.00\pm0.00`$&

The methodology is applied to a 5-layer CNN (i.e., C = 1).

A Note of Gratitude

The copyright of this content belongs to the respective researchers. We deeply appreciate their hard work and contribution to the advancement of human civilization.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut