LION-DG: Layer-Informed Initialization with Deep Gradient Protocols for Accelerated Neural Network Training

February 09, 2026

Reading time: 1 minute

...

📝 Original Info

Title: LION-DG: Layer-Informed Initialization with Deep Gradient Protocols for Accelerated Neural Network Training
ArXiv ID: 2601.02105
Date: 2026-01-05
Authors: Hyunjun Kim

📝 Abstract

Weight initialization remains decisive for neural network optimization, yet existing methods are largely layer-agnostic. We study initialization for deeply-supervised architectures with auxiliary classifiers, where untrained auxiliary heads can destabilize early training through gradient interference. We propose LION-DG, a layer-informed initialization that zero-initializes auxiliary classifier heads while applying standard He-initialization to the backbone. We prove that this implements Gradient Awakening: auxiliary gradients are exactly zero at initialization, then phase in naturally as weights grow-providing an implicit warmup without hyperparameters. Experiments on CIFAR-10 and CIFAR-100 with DenseNet-DS and ResNet-DS architectures demonstrate: • DenseNet-DS: +8.3% faster convergence on CIFAR-10 with comparable accuracy • Hybrid approach: Combining LSUV with LION-DG achieves best accuracy (81.92% on CIFAR-10) • ResNet-DS: Positive speedup on CIFAR-100 (+11.3%) with side-tap auxiliary design We identify architecture-specific trade-offs and provide clear guidelines for practitioners. LION-DG is simple, requires zero hyperparameters, and adds no computational overhead.

📄 Full Content

...(본문 내용이 길어 생략되었습니다. 사이트에서 전문을 확인해 주세요.)

LION-DG: Layer-Informed Initialization with Deep Gradient Protocols for Accelerated Neural Network Training

📝 Original Info

📝 Abstract

📄 Full Content

Table of Contents

Table of Contents

📝 Original Info

📝 Abstract

📄 Full Content

Start searching

No results found