A multi-branch convolutional neural network for detecting double JPEG compression

February 23, 2026

Reading time: 5 minute

...

📝 Abstract

Detection of double JPEG compression is important to forensics analysis. A few methods were proposed based on convolutional neural networks (CNNs). These methods only accept inputs from pre-processed data, such as histogram features and/or decompressed images. In this paper, we present a CNN solution by using raw DCT (discrete cosine transformation) coefficients from JPEG images as input. Considering the DCT sub-band nature in JPEG, a multiple-branch CNN structure has been designed to reveal whether a JPEG format image has been doubly compressed. Comparing with previous methods, the proposed method provides end-to-end detection capability. Extensive experiments have been carried out to demonstrate the effectiveness of the proposed network.

💡 Analysis

🇰🇷 한글로 읽기

📄 Content

1 A multi-branch convolutional neural network for detecting double JPEG compression

Bin Li, Hu Luo, Haoxin Zhang, Shunquan Tan, Zhongzhou Ji Shenzhen Key Lab of Media Information Security, Shenzhen University, P. R. China ABSTRACT Detecting double JPEG compression is important to forensics analysis. A few methods were proposed based on convolutional neural networks (CNNs). These methods only accept inputs from pre-processed data, such as histogram features and/or decompressed images. In this paper, we present a CNN solution by using raw DCT (discrete cosine transformation) coefficients from JPEG images as input. Considering the DCT sub-band nature in JPEG, a multiple-branch CNN structure has been designed to reveal whether a JPEG format image has been doubly compressed. Comparing with previous methods, the proposed method provides end-to-end detection capability. Extensive experiments have been carried out to demonstrate the effectiveness of the proposed network.

Keywords: convolutional neural network; discrete cosine transformation; double JPEG compression; image forensics; multi-branch structure

INTRODUCTION With the tremendous evolution of Internet and online social media, multimedia data (especially images) have been pervasively used due to their abilities to show contents intuitively, exchange information conveniently, and entertain the public ubiquitously. Meanwhile, common people can edit images or videos with handy software; while human eyes can hardly notice the changes. Consequently, increasing concerns are drawn on the security issues regarding the originality and the authenticity of multimedia data, which is the main task of digital forensics.
JPEG (Wallace, 1992) images are the most common ones generated from cameras and widely spread on the Internet. Therefore, there is a high probability that a forger would choose JPEG images to edit, such as performing copy-move or splicing operations, and afterwards performing JPEG recompression for distribution. Detection of double JPEG compression may provide useful information to reveal the traces of image edition or fabricated manipulation.
Various kinds of approaches have been proposed to perform double JPEG detection based on extracting handcrafted features (Lukas et al., 2003; Popescu & Farid, 2004; Fu et al., 2007; Li et al., 2008; Amerini et al. 2014; Pasquini et al., 2014; Taimori et al., 2016). It is generally assumed that the AC (alternating current) DCT (discrete cosine transform) coefficients of a JPEG image follow Laplacian distribution, and their first significant digits conform Benford’s law. When the image has been JPEG compressed twice with a different quantization matrix, the 2 distribution of DCT coefficients is deviated from the statistical model. As a result, handcrafted features are often extracted based on the statistics of DCT coefficients. (Lukas et al., 2003) discovered that the histogram of DCT coefficients in a doubly compressed JPEG image fluctuate periodically. The phenomenon, called double quantization (DQ) effect, has been mathematically verified by (Popescu &Farid, 2004). Based on the observation in (Fu et al., 2007) that the DCT coefficients of singly JPEG compressed images follows the Benford’s law, while that of doubly compressed images violating it, (Li et al., 2008) designed mode-based first digit features (MBFDF) to effectively detect double JPEG compression. They extracted the first significant digits (0~9) of DCT coefficients in the first 20 AC sub-bands, and concatenated them as a 180-D feature vector. (Amerini et al. 2014) took the first digits of 2, 5, and 7 in the first 9 AC sub-bands to reduce the feature dimension to 27-D. (Pasquini et al., 2014) proposed a statistical model based on Benford-Fourier coefficients. (Taimori et al., 2016) developed a quantization table unaware method for double JPEG compression detection based on Benford’s law by using all non-zero and zero AC coefficients. It is tempting that data-driven approaches, such as those based on deep learned features (LeCun et al., 2015; Simonyan & Zisserman, 2014; Szegedy et al., 2015; He et al., 2016), may explore hierarchy data representation, and automatically expose statistical deviation without knowing much of the details of image manipulation. (Wang & Zhang, 2016; Amerini et al., 2017) have proposed effective CNNs for double JPEG compression. However, as shown in Fig. 1 with the overview of network architecture and detailedly described in the next section, the input of their CNNs is preprocessed by extracting the histogram of DCT coefficients, instead of directly using the raw DCT coefficients. As we know, histogram is a kind of first-order statistics. Such a preprocessing stage has already provided some handcrafted information for double JPEG compression, as implicitly shown in (Lukas et al., 2003). As a result, these CNNs may lack of end-to-end feature extract

View Original ArXiv

This content is AI-processed based on ArXiv data.

A multi-branch convolutional neural network for detecting double JPEG compression

📝 Abstract

💡 Analysis

📄 Content

Table of Contents

Table of Contents

📝 Abstract

💡 Analysis

📄 Content

Start searching

No results found