A3D3 all-hands: High-Throughput AI Methods and Infrastructure Workshop

Name: A3D3 all-hands: High-Throughput AI Methods and Infrastructure Workshop
Start: 2023-07-10T01:30:00-07:00
End: 2023-07-14T12:00:00-07:00
Location: University of Washington

10–14 Jul 2023

University of Washington

US/Pacific timezone

MCUNetV1 & V2: On-Device Inference of Tiny Deep Learning on IoT Devices

10 Jul 2023, 19:00

Oak Hall Denny Room

Poster Working dinner

Dr Wei-Chen Wang (MIT)

Machine learning on tiny IoT devices based on microcontroller units (MCU) is appealing but challenging: the memory of microcontrollers is 2-3 orders of magnitude smaller even than mobile phones. We propose MCUNet, a framework that jointly designs the efficient neural architecture (TinyNAS) and the lightweight inference engine (TinyEngine), enabling ImageNet-scale inference on microcontrollers. TinyNAS adopts a two-stage neural architecture search approach that first optimizes the search space to fit the resource constraints, then specializes the network architecture in the optimized search space. TinyNAS can automatically handle diverse constraints (i.e. device, latency, energy, memory) under low search costs. TinyNAS is co-designed with TinyEngine, a memory-efficient inference library to expand the search space and fit a larger model. TinyEngine adapts the memory scheduling according to the overall network topology rather than layer-wise optimization, reducing the memory usage by 3.4x, and accelerating the inference by 1.7-3.3x compared to TF-Lite Micro and CMSIS-NN. MCUNet is the first to achieve>70% ImageNet top1 accuracy on an off-the-shelf commercial microcontroller, using 3.5x less SRAM and 5.7x less Flash compared to quantized MobileNetV2 and ResNet-18. On visual&audio wake words tasks, MCUNet achieves state-of-the-art accuracy and runs 2.4-3.4x faster than MobileNetV2 and ProxylessNAS-based solutions with 3.7-4.1x smaller peak SRAM. Our study suggests that the era of always-on tiny machine learning on IoT devices has arrived.

Ji Lin (MIT) Wei-Ming Chen (MIT) Yujun Lin (MIT) Han Cai (MIT) Dr Wei-Chen Wang (MIT) John Cohn (MIT-IBM Watson AI Lab) Chuang Gan (MIT-IBM Watson AI Lab) Song Han (MIT)

[A3D3 2023] MCUNetV1 & V2_Poster_36x48.pdf

A3D3 all-hands: High-Throughput AI Methods and Infrastructure Workshop

MCUNetV1 & V2: On-Device Inference of Tiny Deep Learning on IoT Devices

Oak Hall Denny Room

Speaker

Description

Authors

Presentation materials

Choose timezone

A3D3 all-hands: High-Throughput AI Methods and Infrastructure Workshop

Speaker

Description

Authors

Presentation materials