Computer Science / Computer Vision Computer Science / Machine Learning Computer Science / Neural Computing

자기용 신경망을 이용한 컨볼루션 신경망의 가위질

2026년 02월 10일

읽는 시간: 3 분

...

📝 원문 정보

Title: Pruning Convolutional Neural Networks with Self-Supervision
ArXiv ID: 2001.03554
발행일: 2020-01-13
저자: Mathilde Caron, Ari Morcos, Piotr Bojanowski, Julien Mairal and Armand Joulin

📝 초록 (Abstract)

감독 없이 훈련된 합성곱 신경망은 감독된 사전 훈련과 거의 동등한 성능을 보여주지만 때로는 더 많은 매개변수를 가질 수 있습니다. 이러한 큰 무감독 합성곱 신경망에서 성능이 유지되는 하위 네트워크를 추출하는 것은 계산량을 줄이는 데 중요합니다. 전통적인 속기 방법은 특정 작업에 대한 훈련 중에 손상된 네트워크의 성능을 유지하려고 시도합니다. 그러나 자가 감독 학습에서는 훈련 목표가 하위 작업에 대한 표현력 이전성과 무관하기 때문에, 이러한 목표를 보존하는 것이 속기된 하위 네트워크가 여전히 하위 작업을 해결하는 데 효과적일 것을 보장하지 않습니다. 본 연구에서는 주로 감독 학습을 위해 개발된 표준 속기 방법을 레이블 없이 훈련된 네트워크(즉, 자가 감독 과제)에 적용하는 것을 조사하였습니다. 우리는 레이블과 함께 또는 없이 얻은 속기 마스크가 다시 레이블로 재훈련되었을 때 유사한 성능을 달성한다는 것을 보여주었습니다. 이는 자가 감독 학습과 감독 학습에서 속기가 동일하게 작동함을 시사합니다. 또한, 우리는 속기 과정이 자가 감독 하위 네트워크 표현의 전달 성능을 유지한다는 것을 발견하였습니다.

💡 논문 핵심 해설 (Deep Analysis)

This paper explores the application of pruning techniques to convolutional neural networks (CNNs) trained without supervision, aiming to extract subnetworks that maintain performance while reducing computational costs. The research addresses a significant challenge in deep learning where unsupervised pre-trained models often come with high parameter counts, making them computationally expensive.

The core contribution is the adaptation of established pruning methods, originally designed for supervised learning, to networks trained through self-supervision. The authors use magnitude-based unstructured iterative pruning, which involves iteratively removing weights based on their size while maintaining network performance on a specific task. This method was tested with two prominent self-supervised tasks: rotation classification and the Exemplar approach.

The results indicate that pruned masks obtained both with and without labels achieve comparable performance when retrained with labels. This suggests that pruning operates similarly for supervised and unsupervised learning, preserving transferability to downstream tasks. The study also highlights several limitations, such as sensitivity to late resetting parameters and the need for further exploration of structured pruning methods.

The significance lies in providing a pathway to efficiently utilize over-parameterized models by reducing their complexity without sacrificing performance. This can lead to more efficient deployment in real-world applications where computational resources are limited.

📄 논문 본문 발췌 (Translation)

...

[[IMG_PROTECT_1]] … 이 논문의 목표는 감독 학습을 위해 주로 개발된 표준 속기 방법을 레이블 없이 훈련된 네트워크(즉, 자가 감독 과제)에 어떻게 적용할 수 있는지를 연구하는 것입니다. 우리는 구조화되지 않은 무작위 기반 속기와 자가 감독 학습의 잘 알려진 방법론을 사용하여 이를 수행합니다. 특히, Han 등이 제안한 가중치 크기에 따른 반복적인 속기 과정을 사용하여 과도하게 매개변수화된 네트워크에서 희소 하위 네트워크를 추출합니다.

…

[[IMG_PROTECT_2]] … 우리의 실험은 ImageNet 데이터셋 위주로 진행되었으며, CIFAR-10에 대한 결과도 일부 보고하였습니다. ResNet-50을 ImageNet에서 사용하고, CIFAR-10에는 ResNet-18을 사용하였습니다.

…

📄 ArXiv 원문 PDF 보기

자기용 신경망을 이용한 컨볼루션 신경망의 가위질

📝 원문 정보

📝 초록 (Abstract)

💡 논문 핵심 해설 (Deep Analysis)

📄 논문 본문 발췌 (Translation)

Reference

목차

목차

📝 원문 정보

📝 초록 (Abstract)

💡 논문 핵심 해설 (Deep Analysis)

📄 논문 본문 발췌 (Translation)

Reference

검색 시작

검색 결과 없음