Computer Science / Computer Vision Computer Science / Digital Libraries

Image Registration for the Alignment of Digitized Historical Documents

February 23, 2026

Reading time: 5 minute

...

#Computer Vision #Computer Science #Digital Libraries

📝 Original Info

Title: Image Registration for the Alignment of Digitized Historical Documents
ArXiv ID: 1712.04482
Date: 2017-12-14
Authors: Researchers from original ArXiv paper

📝 Abstract

In this work, we conducted a survey on different registration algorithms and investigated their suitability for hyperspectral historical image registration applications. After the evaluation of different algorithms, we choose an intensity based registration algorithm with a curved transformation model. For the transformation model, we select cubic B-splines since they should be capable to cope with all non-rigid deformations in our hyperspectral images. From a number of similarity measures, we found that residual complexity and localized mutual information are well suited for the task at hand. In our evaluation, both measures show an acceptable performance in handling all difficulties, e.g., capture range, non-stationary and spatially varying intensity distortions or multi-modality that occur in our application.

💡 Deep Analysis

Deep Dive into Image Registration for the Alignment of Digitized Historical Documents.

📄 Full Content

Image Registration for the Alignment of Digitized Historical Documents

AmirAbbas Davari1*, Tobias Lindenberger1*, Armin Häberle2, Vincent Christlein1, Andreas Maier1, Christian Riess1

Pattern Recognition Lab, Computer Science Department, Friedrich-Alexander Universität Erlangen-Nürnberg, Erlangen, Germany [amir.davari, tobias.lintob.lindenberger, vincent.christlein, andreas.maier, christian.riess] @fau.de
Bibliotheca Hertziana - Max-Planck-Institut für Kunstgeschichte, Rome, Italy Haeberle@biblhertz.it

Both authors contributed equally

1 Introduction Novel imaging and image processing techniques provide technical tools to art historians for a better understanding of the creation of an artwork. The approach of a “work process analysis” of an artwork (e.g. a drawing or painting) by art historians aims at segmenting and differentiating the unique steps of production and thus, following the artist’s path from the starting point to his final image. About 70-75% of all old master drawings consist of multiple materials, such as chalks of distinct colors, graphite and/or ink, which mostly have been applied in a step by step manner. This fact opens the opportunity for a chronological reconstruction of the genesis of the work. To this end, material decomposition of drawn layers is oftentimes the most accurate way to follow the artistic workflow. One way to perform layer separation is by spectroscopy. However, this approach is oftentimes destructive to the examined material. To allow for an examination of old master drawings while preserving the drawing to the best extend possible, it is also possible to acquire a multi- or hyperspectral image of the drawing, and separate the layers within the range of visible wavelengths. To obtain a reliable and consistent separation of artwork layers as a basis for art historical interpretation, a number of technical challenges have yet to be solved [Dav17]. First of all, there is the need for a pixel-wise “ground truth” map to objectively compare competing approaches. One possibility to get such a ground truth is to mimic the creation of a step-by-step layered artwork, and to image it after completing each work step. A map for a layer can then be obtained by subtracting two subsequent layers. However, one substantial issue lies in the fact that the acquisitions of two subsequent layers do not exactly match onto each other, and therefore have to be aligned. There are many possible reasons for the mismatch, among which are distortions, mechanical motion, and spherical and chromatic aberration of the optical devices. An example mismatch is illustrated in Fig. 1.1. Similarly, a mismatch must be compensated when the output of an algorithm for layer separation should be mapped 2

to the computed ground truth. This compensation must be a pixel-wise alignment. This is done by a process that is, in the field of image processing, referred to as “image registration”. In this work, we investigate different classical image registration methods for the purpose of creating an accurate ground truth map for hyperspectral historical document processing. We first narrow down the number of possibilities for solving this task by considering problem-specific constraints. Then, we quantitatively and qualitatively compare the two most promising approaches on a phantom document. This paper is organized as follows: in the main document, we state the key findings of this study. A more technical presentation and justification of the intermediate choices is provided in the appendix.

(a) (b) (c) Figure 1.1: Importance of image registration for layer separation in old Master Drawings using image processing is depicted here. (a) image of a phantom data that is acquired by a board scanner, (b) sample channel of hyperspectral image from the same phantom data, (c) false color overlapping image of (a) and (b). As it can be observed, the two images that are acquired by the board scanner and the hyperspectral camera are not pixel-wise aligned. Therefore, the output of layer separation algorithm on the hyperspectral image cannot be numerically evaluated. Image registration would solve this problem.

2 Methods

2.1 Hyperspectral Image Acquisition Hyperspectral imaging combines normal spatial imaging with spectroscopy. The spatial and spectral information of a target is stored in a stack of grayscale images. Each individual image in the stack represents the target recorded at a different 3

wavelength. The stack can be used for further image analysis in the spatial and spectral domain [Mid16c]. For this project, we used a hyperspectral push-broom camera. This camera type features a two dimensional detector array that is combined with a spectrograph. The target is illuminated along one of the spatial axes of the detector. This line contains the full spectrum of the camera’s spectral axis. In push-broom scanning

…(Full text truncated)…

📄 Read Full PDF on ArXiv

📸 Image Gallery

Reference

This content is AI-processed based on ArXiv data.

Image Registration for the Alignment of Digitized Historical Documents

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

📸 Image Gallery

Reference

Table of Contents

Table of Contents

📝 Original Info

📝 Abstract

💡 Deep Analysis

📄 Full Content

📸 Image Gallery

Reference

Related Posts

'Tom' pet robot applied to urban autism

A Datamining Approach to the Short Title Catalogue Flanders: the Case of Early Modern Quiring Practices

A Deep Convolutional Auto-Encoder with Pooling - Unpooling Layers in Caffe

Start searching

No results found