Ditto: Accelerating Diffusion Model via Temporal Value Similarity

Reading time: 2 minute
...

📝 Original Info

  • Title: Ditto: Accelerating Diffusion Model via Temporal Value Similarity
  • ArXiv ID: 2501.11211
  • Date: 2025-01-20
  • Authors: ** 제공되지 않음 (논문에 저자 정보가 명시되지 않았습니다.) **

📝 Abstract

Diffusion models achieve superior performance in image generation tasks. However, it incurs significant computation overheads due to its iterative structure. To address these overheads, we analyze this iterative structure and observe that adjacent time steps in diffusion models exhibit high value similarity, leading to narrower differences between consecutive time steps. We adapt these characteristics to a quantized diffusion model and reveal that the majority of these differences can be represented with reduced bit-width, and even zero. Based on our observations, we propose the Ditto algorithm, a difference processing algorithm that leverages temporal similarity with quantization to enhance the efficiency of diffusion models. By exploiting the narrower differences and the distributive property of layer operations, it performs full bit-width operations for the initial time step and processes subsequent steps with temporal differences. In addition, Ditto execution flow optimization is designed to mitigate the memory overhead of temporal difference processing, further boosting the efficiency of the Ditto algorithm. We also design the Ditto hardware, a specialized hardware accelerator, fully exploiting the dynamic characteristics of the proposed algorithm. As a result, the Ditto hardware achieves up to 1.5x speedup and 17.74% energy saving compared to other accelerators.

💡 Deep Analysis

Figure 1

📄 Full Content

📸 Image Gallery

Similairy_graph.png Value_range_graph.png Value_range_picture.png compute_unit.png dag.png delta_processing.png diffusion_layer.png encoding_logic.png eval_camd.png eval_dag_change_layer_accuracy.png eval_dag_ideal.png eval_defo.png eval_ditto.png eval_figure1.png eval_memory_access.png hardware_overview.png motive_bit_req.png motive_memory_access.png open-research-objects.png potential_speedup.png research-objects-reviewed.png results-reproduced.png speedup_layer.png

Reference

This content is AI-processed based on open access ArXiv data.

Start searching

Enter keywords to search articles

↑↓
ESC
⌘K Shortcut