Recognition of Perspective Distorted QR Codes with a Partially Damaged Finder Pattern in Real Scene Images

Karrach, Ladislav; Pivarčiová, Elena; Bozek, Pavol

doi:10.3390/app10217814

Open AccessArticle

Recognition of Perspective Distorted QR Codes with a Partially Damaged Finder Pattern in Real Scene Images

by

Ladislav Karrach

¹

,

Elena Pivarčiová

^1,*

and

Pavol Bozek

²

¹

Department of Manufacturing and Automation Technology, Faculty of Technology, Technical University in Zvolen, Masarykova 24, 960 01 Zvolen, Slovakia

²

Institute of Production Technologies, Faculty of Materials Science and Technology, Slovak University of Technology in Bratislava, Vazovova 5, 811 07 Bratislava, Slovakia

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2020, 10(21), 7814; https://doi.org/10.3390/app10217814

Submission received: 21 September 2020 / Revised: 27 October 2020 / Accepted: 30 October 2020 / Published: 4 November 2020

(This article belongs to the Special Issue Automation and Robotics: Latest Achievements, Challenges and Prospects)

Download

Browse Figures

Versions Notes

Abstract

:

Featured Application

Mobile robot navigation, automatic object identification and tracking.

Abstract

QR (Quick Response) codes are one of the most famous types of two-dimensional (2D) matrix barcodes, which are the descendants of well-known 1D barcodes. The mobile robots which move in certain operational space can use information and landmarks from environment for navigation and such information may be provided by QR Codes. We have proposed algorithm, which localizes a QR Code in an image in a few sequential steps. We start with image binarization, then we continue with QR Code localization, where we utilize characteristic Finder Patterns, which are located in three corners of a QR Code, and finally we identify perspective distortion. The presented algorithm is able to deal with a damaged Finder Pattern, works well for low-resolution images and is computationally efficient.

Keywords:

QR code detection; adaptive thresholding; finder pattern; perspective transformation

1. Introduction

Some of the requirements that are placed on autonomous mobile robotic systems include real-world environments navigation and recognition and identification of objects of interest with which the robotic system have to interact. Computer vision allows machines to obtain a large amount of information from the environment that has a major impact on their behavior. In the surrounding environment there are often numerous different static objects (walls, columns, doors, and production machines) but also moving objects (people, cars, and handling trucks).

Objects that are used to refine navigation, landmarks, can be artificial (usually added by a human) and natural (which naturally occur in the environment) [1,2].

Robotic systems have to work in a real environment and must be able to recognize, for example, people [3], cars [4], product parameters for the purpose of quality control, or objects which are to be handled.

The use of QR (Quick Response) codes (two-dimensional matrix codes) in interaction with robots can be seen in the following areas:

in the field of navigation as artificial landmarks-analogies of traffic signs that control the movement of the robot in a given area (no entry, driving direction, alternate route, and permitted hours of operation) or as an information board providing context specific information or instructions (such as identification of floor, room, pallets, and working place)
in the area of object identification, 2D codes are often used to mark products and goods and thus their recognition will provide information about the type of goods (warehouses), the destination of the shipment (sorting lines) or control and tracking during track-and-trace.

QR Codes

QR Codes are classified among 2D matrix codes (similar to Data Matrix codes). QR Codes (Model 1 and Model 2) are squared-shaped 2D matrices of dark and light squares—so called modules. Each module represents the binary 1 or 0. Each QR Code has fixed parts (such as Finder Patterns and Timing Patterns) that are common to each QR Code and variable parts that differ according to the data that is encoded by a QR Code. Finder Patterns, which are located in the three corners of a QR Code, are important for determining the position and rotation of the QR Code. The size of the QR code is determined by the number of modules and can vary from 21 × 21 modules (Version 1) to 177 × 177 (Version 40). Each higher version number comprises four additional modules per side. In the Figure 1 we can see a sample of version 1 QR Code.

QR Code has error correction capability to restore data if the code is partially damaged. Four error correction levels are available (L–Low, M–Medium, Q–Quartile, and H–High). The error correction level determines how much of the QR Code can be corrupted to keep the data still recoverable (L–7%, M–15%, Q–25%, and H–30%) [5]. The QR Code error correction feature is implemented by adding a Reed–Solomon Code to the original data. The higher the error correction level is the less storage capacity is available for data.

Each QR Code symbol version has the maximum data capacity according to the amount of data, character type and the error correction level. The data capacity ranges from 10 alphanumeric (or 17 numeric) characters for the smallest QR Code up to 1852 alphanumeric (or 3057 numeric) characters for the largest QR Code at highest error correction level [5]. QR Codes support four encoding modes—numeric, alphanumeric, Kanji and binary—to store data efficiently.

QR Code was designed in 1994, in Japan for automotive industry, but currently has a much wider use. QR codes are used to mark a variety of objects (goods, posters, monuments, locations, and business cards) and allow to attach additional information to them, often in the form of a URL to a web page. QR Code is an ISO standard (ISO/IEC 18004:2015) and is freely available without license fees.

In addition to a traditional QR Code Model 1 and 2, there are also variants such as a Micro QR Code (a smaller version of the QR Code standard for applications where a symbol size is limited) or an iQR Code (which can hold a greater amount of information than a traditional QR Code and it supports also rectangular shapes) (Figure 2).

2. Related Work

Prior published approaches for recognizing QR Codes in images can be divided into Finder Pattern based location methods [6,7,8,9,10,11,12] and QR Code region based location methods [13,14,15,16,17,18]. The first group locates a QR Code based on the location of its typical Finder Patterns that are present in its three corners. The second group locates the area of a QR code in the image based on its irregular checkerboard-like structure (a QR Code consists of many small light and dark squares which alternate irregularly and are relatively close to each other).

A shape of the Finder Pattern (Figure 3) was deliberately chosen by the authors of the QR Code, because “it was the pattern least likely to appear on various business forms and the like” [6]. They found out that black and white areas that alternate in a 1:1:3:1:1 ratio are the least common on printed materials.

In [7] (Lin and Fuh) all points matching the 1:1:3:1:1 ratio, horizontally and vertically, are collected. Collected points belonging to one Finder Pattern are merged. Inappropriate points are filtered out according to the angle between three of them.

In [8] (Li et al.) minimal containing region is established analyzing five runs in labeled connected components, which are compacted using run-length coding. Second, coordinates of central Finder Pattern in a QR Code are calculated by using run-length coding utilizing modified Knuth–Morris–Pratt algorithm.

In [9] (Belussi and Hirata) two-stage detection approach is proposed. In the first stage Finder Pattern (located at three corners of a QR Code) is detected using a cascaded classifier trained according to the rapid object detection method (Viola–Jones framework). In the second stage geometrical restrictions among detected components are verified to decide whether subsets of three of them correspond to a QR Code or not.

In [10] (Bodnár and Nyúl) Finder Pattern candidate localization is based on the cascade of boosted weak classifiers using Haar-like features, while the decision on a Finder Pattern candidate to be kept or dropped is decided by a geometrical constraint on distances and angles with respect to other probable Finder Patterns. In addition to Haar-like features, local binary patterns (LBP), and histogram of oriented gradients (HOG) based classifiers are used and trained to Finder Patterns and whole code areas as well.

In [11] (Tribak and Zaz) successive horizontal and vertical scans are launched to obtain segments whose structure complies with the ratio 1:1:3:1:1. The intersection between horizontal and vertical segments presents the central pixel of the extracted pattern. All the extracted patterns are transmitted to a filtering process based on principal components analysis, which is used as a pattern feature.

In [12] (Tribak and Zaz) seven Hu invariant moments are applied to the Finder Pattern candidates obtained by initial scanning of an image and using Euclidean metrics they are compared with Hu moments of the samples. If the similarity is less than experimentally determined threshold, then the candidate is accepted.

In [13] (Sun et al.) authors introduce algorithm, which aims to locate a QR Code area by four corners detection of 2D barcode. They combine the Canny edge detector with external contours finding algorithm.

In [14] (Ciążyński and Fabijańska) they use histogram correlation between a reference image of a QR code and an input image divided into a block of size 30 × 30. Then candidate blocks are joined into regions and morphological erosion and dilation is applied to remove small regions.

In [15] (Gaur and Tiwari) they propose approach which uses Canny edge detection followed by morphological dilation and erosion to connect broken edges in a QR Code into a bigger connected component. They expect that the QR Code is the biggest connected component in the image.

In [16,17] (Szentandrási et al.) they exploit the property of 2D barcodes of having regular distribution of edge gradients. They split a high-resolution image into tiles and for each tile they construct HOG (histogram of oriented gradients) from the orientations of edge points. Then they select two dominant peeks in the histogram, which are apart roughly 90°. For each tile, a feature vector is computed, which contains a normalized histogram, angles of two main gradient directions, a number of edge pixels and an estimation of probability score of a chessboard-like structure.

In [18] (Sörös and Flörkemeier) areas with high concentration of edge structures as well as areas with high concentration of corner structures are combined to get QR Code regions.

3. The Proposed Method

Our method is primarily based on searching for Finder Patterns and utilizes their characteristic feature—1:1:3:1:1 black and white point ratio in any scanning direction. The basic steps are indicated in the flowchart in Figure 4.

Before searching for a QR Code, the original image (maybe colored) is converted to a gray scale image using Equation (1), because the color information does not bear any significant additional information that might help in QR Code recognition.

I = \frac{77 R + 151 G + 28 B}{256}

(1)

where I stands for gray level and R, G, B for red, green, and blue color intensities of individual pixels in the RGB model, respetively. This RGB to gray scale conversion is integer approximation of widely used luminance calculation as defined in recommendation ITU-R BT.601-7:

I = 0.299 R + 0.587 G + 0.114 B

(2)

Next, the gray scaled image is converted to a binary image using modified adaptive thresholding (with the size of window 35—the window size we choose to be at least five times the size of expected size of QRC module) [19]. We expect that black points, which belong to QRC, will become foreground points.

We use the modification of the well-known adaptive thresholding technique (Equation (3)), which calculates individual threshold for every point in the image. This threshold is calculated using average intensity of points under a sliding window. To speed up the thresholding we pre-calculate the integral sum image and we also use the global threshold value (points with intensity above 180 we always consider as background points). Adaptive thresholding can successfully threshold also uneven illuminated images.

B (x, y) = {\begin{matrix} 0 & \leftarrow & I (x, y) > 180 \\ 0 & \leftarrow & I (x, y) > = T (x, y) \\ 1 & \leftarrow & I (x, y) < T (x, y) \end{matrix}

(3)

T (x, y) = m (x, y) - \frac{I (x, y)}{10} - 10

m (x, y) = \frac{1}{35 \times 35} \sum_{i = - 17}^{17} \sum_{j = - 17}^{17} I (x + i, y + j)

where I is gray scale (input) image, B is binary (output) image, T is threshold value (individual for each pixel at coordinates x, y), and m is average of pixel intensities under sliding window of the size 35 × 35 pixels.

In order to improve adaptive thresholding results, some of the image pre-processing techniques, such as histogram equalization, contrast stretching or deblurring, are worth to consider.

3.1. Searching for Finder Patterns

First, the binary image is scanned from top to bottom and from left to right, and we look for successive sequences of black and white points in a row matching the ratios 1:1:3:1:1 (W₁:W₂:W₃:W₄:W₅ where W₁, W₃, W₅ indicates the number of consecutive black points which are alternated by W₂, W₄ white points) with small tolerance (tolerance is necessary due to imperfect thresholding and noise in the Finder Pattern area, black and white points in a line do not alternate in ideal ratios 1:1:3:1:1):

\begin{array}{l} W_{1}, W_{2}, W_{4}, W_{5} \in 〈 w - 1.5, w + 2.0 〉 \\ W_{3} \in 〈 3 w - 2, 3 w + 2 〉 \\ W_{3} \geq \max (W_{1} + W_{2}, W_{4} + W_{5}) \end{array}, where w = \frac{W_{1} + W_{2} + W_{3} + W_{4} + W_{5}}{7} = \frac{W}{7}

(4)

For each match in a row coordinates of Centroid (C) and Width (W = W₁ + W₂ + W₃ + W₄ + W₅) of the sequence (of black and white points) are stored in a list of Finder Pattern candidates (Figure 5).

Then, Finder Pattern candidates (from the list of candidates) that satisfy the following criteria are grouped:

their centroids C are at most 3/7W points vertically and at most 3 points horizontally away from each other,
their widths W does not differ by more than 2 points.

We expect, that the Finder Pattern candidates in one group belong to the same Finder Pattern and therefore we set the new centroid C and width W of the group as average of x, y coordinates and widths of the nearby Finder Pattern candidates (Figure 6a).

After grouping the Finder Patterns, it must be verified whether there are sequences of black and white points also in the vertical direction, which alternate in the ratio 1:1:3:1:1 (Figure 6b). A bounding box around the Finder Pattern candidate, in which vertical sequences are looked for, is defined as

x \in 〈 C_{x} \pm 1.3 / 7 W 〉, y \in 〈 C_{y} \pm 5.5 / 7 W 〉

(5)

where C (C_x, C_y) is a centroid and W is width of the Finder Pattern candidate. We work with a slightly larger bounding box in case the Finder Pattern is stretched vertically. Candidates, where no vertical match is found or where the ratio H/W < 0.7, are rejected. For candidates, where a vertical match is found, the y coordinate of centroid C (C_y) is updated as an average of y coordinates of centers of the vertical sequences.

3.2. Verification of Finder Patterns

Each Finder Pattern consists of a central black square with the side of 3 units (R₁), surrounded by a white frame with the width of 1 unit (R₂), surrounded by a black frame with the width of 1 unit (R₃). In Figure 7 there are colored regions R₁, R₂ and R₃ in red, blue, and green, respectively. For each Finder Pattern candidate Flood Fill algorithm is applied, starting from the centroid C (which lies in the region R₁) and continuing through white frame (region R₂) to black frame (region R₃). As continuous black and white regions are filled, following region descriptors are incrementally computed:

area (A = M₀₀)
centroid (C_x = M₁₀/M₀₀, C_y = M₀₁/M₀₀), where M₀₀, M₁₀, M₀₀ are raw image moments
bounding box (Top, Left, Right, Bottom)

The Finder Pattern candidate, which does not meet all of the following conditions, is rejected.

Area(R₁) < Area(R₂) < Area(R₃) and
1.1 < Area(R₂)/Area(R₁) < 3.4 and
1.8 < Area(R₃)/Area(R₁) < 3.9
0.7 < AspectRatio(R₂) < 1.5 and
0.7< AspectRatio(R₃) < 1.5
|Centroid(R₁), Centroid(R₂)| < 3.7 and
|Centroid(R₁), Centroid(R₃)| < 4.2

Note: the criteria were set to be invariant to the rotation of the Finder Pattern, and the acceptance ranges were determined experimentally. In an ideal undistorted Finder Pattern, the criteria are met as follows:

Area(R₂)/Area(R₁) = 16/9 = 1.8 and Area(R₃)/Area(R₁) = 24/9 = 2.7
AspectRatio(R₁) = 1 and AspectRatio(R₂) = 1 and AspectRatio(R₃) = 1
Centroid(R₁) = Centroid(R₂) = Centroid(R₃)

In real environments there can be damaged Finder Patterns. The inner black region R₁ can be joined with the outer black region R₃ (Figure 8a) or the outer black region can be interrupted or incomplete (Figure 8b). In the first case the bounding box of the region R₂ is completely contained by bounding box of the region R₁ and in the second case is bounding box of the region R₃ contained in bounding box of the region R₂. These cases are handled individually. If the first case is detected, then the region R₁ is proportionally divided into R₁ and R₃ and if the second case is detected then the region R₂ is instantiated using the region R₁ and R₃.

The Centroid (C) and Module Width (MW) of the Finder Pattern candidate are updated using the region descriptors as follows:

C = (\frac{M_{10} (R_{1}) + M_{10} (R_{2}) + M_{10} (R_{3})}{M_{00} (R_{1}) + M_{00} (R_{2}) + M_{00} (R_{3})}, \frac{M_{01} (R_{1}) + M_{01} (R_{2}) + M_{01} (R_{3})}{M_{00} (R_{1}) + M_{00} (R_{2}) + M_{00} (R_{3})})

(6)

M W = \sqrt{M_{00} (R_{1}) + M_{00} (R_{2}) + M_{00} (R_{3})} / 7

3.3. Grouping of Finder Patterns

3.3.1. Grouping Triplets of Finder Patterns

In the previous steps, Finder Patterns in the image were identified and now such triplets (from the list of all Finder Patterns) must be selected, which can represent 3 corners of a QR Code. Matrix of the distances between the centroid of all Finder Patterns is build and all 3-element combinations of all Finder Patterns are examined. For each triplet it is checked whether it is possible to construct a right-angled triangle from it so that the following conditions are met:

the size of each triangle side must be in predefined range
the difference in sizes of two legs must be less than 21
the difference in size of the real and theoretical hypotenuse must be less than 12

In this way, a list of QR Code candidates (defined by a triplet FP₁, FP₂, FP₃) is built. However, such a candidate for a QR Code is selected only on the basis of the mutual position of the 3 FP candidates. As is shown in Figure 9 not all QR Code candidates are valid (dotted red FP_3′-FP_3″-FP_2″ is false positive). These false positive QR Code candidates will be eliminated in the next steps.

Finally, Bottom-Left and Top-Right Finder Pattern from the triplet (FP₁, FP₂, FP₃) is determined by using formula:

If (FP₃.x − FP₂.x)(FP₁.y − FP₂.y) − (FP₃.y − FP₂.y)(FP₁.x − FP₂.x) < 0 then Bottom-Left is FP₃ and Top-Right is FP₁ else vice versa.

3.3.2. Grouping Pairs of Finder Patterns

If any QR Code has one of the 3 Finder Patterns significantly damaged, then this Finder Pattern might not be identified and there remains two Finder Patterns in the Finder Patterns list, that were not selected (as the vertices of a right-angled triangle) in the previous step (Figure 10a). The goal of this step is to identify these pairs and determine the position of the third missing Finder Pattern. A square shape of the QR Code is assumed.

All two element combinations of remaining Finder Patterns, whose distance is in a predefined interval, are evaluated. A pair of Finder Patterns can represent Finder Patterns that are adjacent corners of a QR Code square (Figure 10a) or are in the opposite corners. If they are adjacent corners, then there are two possible positions of the QR Code (in Figure 10b depicted as a red square and green square). If they are opposite corners, then there are other two possible positions of the QR Code.

All four possible positions of the potential QR Code are evaluated against the following criteria:

Is there a Quiet Zone around the bounding square at least 1 MW wide?
Is there a density of white points inside bounding square in interval (0.4; 0.65)?
Is there a density of edge points inside bounding square in interval (0.4; 0.6)?

Density of edge points is computed as the ratio of the number of edge points to area*2/MW.

Region of a QR Code is expected to have relative balanced density of white and black points and relative balanced ratio of edges to area.

A square region that meets all the above conditions is considered a candidate for a QR Code. There are two possible corners in the QR Code bounding square where 3rd Finder Pattern can be located (Figure 10c). For both possible corners Finder Pattern match score is computed and one with better score is selected (in other words question, “In which corner is the structure that more closely resembles the ideal Finder Pattern?” must be answered). Match score is computed as

M S = \arg \min (O S + \min (B S, W S))

(7)

where MS is match score (lower is better), OS is overall pattern match score, BS is black module match score and WS is white module match score. BS stores matches only for expected black points and WS stores matches only for expected white points between the mask and the image. BS and WS was introduced to handle situations when over the area of Finder Pattern is placed black or white spot, which would cause a low match score if only a simple pattern matching technique were used.

The match score is computed for several Finder Pattern mask positions by moving the mask in a spiral from its initial position up to a radius of MW with a step of MW/2 (for the case of small geometric deformations of the QR Code).

3.4. Verification of Quiet Zone

According to ISO standard a Quiet Zone is defined as “a region 4X wide which shall be free of all other markings, surrounding the symbol on all four sides”. So, it must be checked if there are only white points in the image in the rectangular areas wide 1MW which is parallel to line segments defined by FP₁–FP₂ and FP₂–FP₃ (Figure 11). For fast scanning of the rectangle points Bresenham’s line algorithm is utilized [20].

QR Code candidates which do not have quiet zones around are rejected. Rejected are also QR Code candidates whose outer bounding box (larger) contains outer bounding box (smaller) of another QR Code candidate.

3.5. QR Code Bounding Box

Centroids of the 3 Finder Patterns (which represent the QR Code candidate) are the vertices of the triangle FP₁–FP₂–FP₃. This inner triangle must be expanded to outer triangle P₁–P₂–P₃, where the arms of the triangle pass through boundary modules of the QR Code (Figure 12). For instance, the shift of FP₃ to P₃ may be expressed as

P_{3} = {FP}_{3} + \frac{{FP}_{3} - {FP}_{1}}{| {FP}_{3}, {FP}_{1} |} M W \sqrt{18}

(8)

where MW is module width (Equation (6))

3.6. Perspective Distortion

For perspective undistorted QR Codes (only shifted, scaled, rotated, or sheared) it is sufficient to have only 3 points to set-up the affine transformation from a square to destination parallelogram. However, for perspective (projective) distorted QR Codes 4 points are required to set-up perspective transformation from a square to destination quadrilateral [21].

Some authors (for example [7,22]) search for Alignment Pattern to obtain 4th point. However, version 1 QR Codes does not have Alignment Pattern, so we have decided not to rely on Alignment Patterns.

Instead of that we use an iterative approach to find the opposite sides to P₂–P₁ and P₂–P₃, which aligns to QR Code borders.

We start from initial estimate of P₄ as an intersection of the line L₁ and L₃, where L₁ is parallel to P₂–P₃ and L₃ is parallel to P₂–P₁ (Figure 13a).
We count the number of pixels which are common to the line L₃ and the QR Code for each third of the line L₃ (Figure 13a).
We shift the line L₃ by width of one module away from the QR Code and again we count the number of pixels which are common to the shifted line L_3′ and the QR Code. The module width is estimated as MW (from Equation (6)) (Figure 13b).
We compare overlaps of the line L₃ from the step 2 and 3, and
- If L₃ was whole in the QR Code and the shifted L_3′ is out of the QR Code, then initial estimation of P₄ is good and we end.
- If L3 was whole in the QR Code and 3rd third of the shifted L_3′ is again in the QR Code, then we continue by step 5.
- If 3rd third of L₃ was in quiet zone and 2nd and 3rd third of the shifted L_3′ is in the quiet zone or if 2nd third of L₃ was in the quiet zone and 1st and 2nd third of the shifted L_3′ is in the quiet zone then we continue by step 6.
We start to move P₄ end of line segment P₃–P₄ away from the QR Code until 3rd third of L₃ touches the quiet zone (Figure 13c).
We start to move P₄ end of line segment P₃–P₄ towards the QR Code until 3rd third of L₃ touches the QR Code.
We apply the same procedure also to the line L₁ like for the line L_3.
The intersection of the shifted lines L₁ and L₃ is a new P₄ position.

Figure 13. Perspective distortion: (a) initial estimate of the point P₄ and lines L₁, L₃; (b) first shift of the line L₃; (c) second shift of the line L₃.

Once the position of 4th point, P₄, is obtained, perspective transformation from the source square representing the ideal QR Code to destination quadrilateral representing the real QR Code in the image can be set-up (Figure 14).

Using equations [21]:

u = \frac{a x + b y + c}{g x + h y + 1}

,

v = \frac{d x + e y + f}{g x + h y + 1}

, where the transformation coefficients can be calculated from the coordinates of the points P₁(u₁, v₁), P₂(u₂, v₂), P₃(u₃, v₃), P₄(u₄, v₄) as

a = (u_{3} - u_{2}) / A + g u_{3}, b = (u_{1} - u_{2}) / A + h u_{1}, c = u_{2} d = (v_{3} - v_{2}) / A + g v_{3}, e = (v_{1} - v_{2}) / A + h v_{1}, f = v_{2} g = \frac{| \begin{matrix} d u_{3} & d u_{2} \\ d v_{3} & d v_{2} \end{matrix} |}{| \begin{matrix} d u_{1} & d u_{2} \\ d v_{1} & d v_{2} \end{matrix} |}, h = \frac{| \begin{matrix} d u_{1} & d u_{3} \\ d v_{1} & d v_{3} \end{matrix} |}{| \begin{matrix} d u_{1} & d u_{2} \\ d v_{1} & d v_{2} \end{matrix} |} d u_{1} = (u_{3} - u_{2}) A, d u_{2} = (u_{1} - u_{4}) A, d u_{3} = u_{2} - u_{3} + u_{4} - u_{1} d v_{1} = (v_{3} - v_{2}) A, d v_{2} = (v_{1} - v_{4}) A, d v_{3} = v_{2} - v_{3} + v_{4} - v_{1}

It sometimes happens, that the estimate of P₄ position is not quite accurate so we move P₄ in spiral from its initial position (obtained in previous step) and we calculate match score of bottom-right Alignment Pattern (Alignment Pattern exists only in QR codes version 2 and above). For each shift we recalculate coefficients for the perspective transformation, and we recalculate also match score. For the given version of the QR Code we know the expected position and size of bottom-right Alignment Pattern so we can calculate match between expected and real state. The final position P₄ is the one with the highest match score.

Another alternative method how to handle perspective distorted QR Codes and how to determine position of the P₄ point is based on edge directions and edge projections analysis [23].

3.7. Decoding of a QR Code

A QR Code is 2D square matrix in which the dark and light squares (modules) represent bits 1 and 0. In fact, each such a module on the pixel level is usually made up of cluster of adjacent pixels. In a QR Code decoding process, we have to build a 2D matrix that has elements with a value of 1 or 0 (Figure 15).

The pixels, where the brightness is lower than the threshold, are declared as 1 and the others are declared as 0. When analyzing such a cluster of pixels (module), the central pixel of the module plays a decisive role. If the calculated position of the central pixel does not align to integer coordinates in the image, the brightness is determined by bilinear interpolation.

Once the binary matrix of 1 and 0 is created, the open-source ZBar library [24] can be used to start the final decoding of the binary matrix and to receive the original text encoded by the QR Code.

4. Results

We used a test dataset of 595 QR Code samples to verify the method described in this paper. The testing dataset contained 25 artificial QR codes of different sizes and rotations, 90 QR Codes from Internet images, and 480 QR Codes from a specific industrial process. Several examples of the testing samples are in Figure 16.

In Table 1 and Figure 17 there are our results compared to competing QR Code decoding solutions (commercial and also open-source). In the table are the numbers of correctly decoded QR Codes from the total number of 595 QR Codes.

Our method successfully detected all QR Codes but failed to decode three samples. Two samples were QR Codes placed on a bottle, where perspective and cylindrical distortions were combined. How to deal with this type of combined distortion is a challenge for future research.

As the commercial solutions have closed source code, we performed the testing using the black-box method. We have compiled our own QR Code test dataset (published together with this article under “Supplementary Material”) to evaluate and compare our method, as a standardized universal dataset is not publicly available.

In Table 2 the computational complexity of our algorithm is compared to competing open-source solutions (commercial solutions were tested online). Our algorithm was implemented in Free Pascal and tests were run on an i5-4590 3.3GHz CPU (Intel Corporation, Santa Clara, CA, USA).

We see the main contribution of our method in the way the broken Finder Pattern is dealt with (Section 3.3.2) and how the QR Code bounding box is determined (Section 3.6), especially for perspective distorted QR Codes. The presented method can still locate a QR Code if one of the three Finder Patterns is significantly damaged. Consecutive tests in the real manufacturing process showed that this situation occurs much more often than a situation where two or three opposite Finder Patterns are damaged. In order to detect a QR Code with multiple damage Finder Patterns, it will be necessary to combine Finder Pattern based localization with the region based localization.

5. Conclusions

We have designed and tested a computationally efficient method for precise location of 2D QR Codes in arbitrary images under various illumination conditions. The proposed method is suitable for low-resolution images as well as for real time processing. The designed Finder Pattern based localization method uses three typical patterns of QR Codes to identify three corners of QR Codes in an image. If one of the three Finder Patterns is so destroyed that it cannot be localized, we have suggested a way to deal with it. The input image is binarized and scanned horizontally to localize the Finder Pattern candidates, which are subsequently verified in order to localize the raw QR Code region. For distorted QR Codes, the perspective transformation is set-up by gradually approaching the boundary of the QR Code.

This method was validated on the testing dataset consisting of a wide variety of samples (synthetic, real world, and specific industrial samples) and it was compared to competing software. The experimental results show that our method has a high detection rate.

The application of QR Codes and their optical recognition has wide use in identification, tracing or monitoring of items in production [30], storing and distribution processes, to aid visually impaired and blind people, to let autonomous robots [31] to acquire context-relevant information, to support authorization during log-in process, to support electronic payments, to increase industrial production surety factor [32], etc.

In cases where is required to place the 2D matrix codes on a very small area it may be preferable to use Data Matrix codes [33].

Supplementary Materials

The following are available online at https://www.mdpi.com/2076-3417/10/21/7814/s1, one ZIP file contains images of QR Codes testing dataset used to evaluate the competing solutions.

Author Contributions

Conceptualization, methodology, software, writing—original draft preparation, L.K.; validation, writing—review and editing, visualization, E.P.; supervision, project administration, funding acquisition, P.B.; All authors have read and agreed to the published version of the manuscript.

Funding

The article is funded of the research project KEGA 013TUKE-4/2019 “Modern educational tools and methods for shaping creativity and increasing the practical skills and habits of graduates of technical departments of universities”.

Acknowledgments

The contribution is sponsored by the project.

Conflicts of Interest

The authors declare no conflict of interest.

References

Zhang, X.; Zhu, S.; Wang, Z.; Li, Y. Hybrid visual natural landmark–based localization for indoor mobile robots. Int. J. Adv. Robot. Syst. 2018, 15. [Google Scholar] [CrossRef] [Green Version]
Bozek, P.; Pokorný, P.; Svetlik, J.; Lozhkin, A.; Arkhipov, I. The calculations of Jordan curves trajectory of the robot movement. Int. J. Adv. Robot. Syst. 2016, 13, 1–7. [Google Scholar] [CrossRef]
Wang, K.; Zhou, W. Pedestrian and cyclist detection based on deep neural network fast R-CNN. Int. J. Adv. Robot. Syst. 2018, 16. [Google Scholar] [CrossRef] [Green Version]
Zhang, X.; Gao, H.; Xue, C.; Zhao, J.; Liu, Y. Real-time vehicle detection and tracking using improved histogram of gradient features and Kalman filters. Int. J. Adv. Robot. Syst. 2018, 15. [Google Scholar] [CrossRef]
Denso Wave Incorporated. What is a QR Code? 2018. Available online: http://www.qrcode.com/en/about/ (accessed on 6 September 2018).
Denso Wave Incorporated. History of QR Code. 2018. Available online: http://www.qrcode.com/en/history/ (accessed on 6 September 2018).
Lin, J.-A.; Fuh, C.-S. 2D Barcode Image Decoding. Math. Probl. Eng. 2013, 2013, 848276. [Google Scholar] [CrossRef]
Li, S.; Shang, J.; Duan, Z.; Huang, J. Fast detection method of quick response code based on run-length coding. IET Image Process. 2018, 12, 546–551. [Google Scholar] [CrossRef]
Belussi, L.F.F.; Hirata, N.S. Fast Component-Based QR Code Detection in Arbitrarily Acquired Images. J. Math. Imaging Vis. 2012, 45, 277–292. [Google Scholar] [CrossRef]
Bodnár, P.; Nyúl, L.G. Improved QR Code Localization Using Boosted Cascade of Weak Classifiers. Acta Cybern. 2015, 22, 21–33. [Google Scholar] [CrossRef] [Green Version]
Tribak, H.; Zaz, Y. QR Code Recognition based on Principal Components Analysis Method. Int. J. Adv. Comput. Sci. Appl. 2017, 8, 241–248. [Google Scholar] [CrossRef] [Green Version]
Tribak, H.; Zaz, Y. QR Code Patterns Localization based on Hu Invariant Moments. Int. J. Adv. Comput. Sci. Appl. 2017, 8, 162–172. [Google Scholar] [CrossRef] [Green Version]
Sun, A.; Sun, Y.; Liu, C. The QR-code reorganization in illegible snapshots taken by mobile phones. In Proceedings of the 2007 International Conference on Computational Science and its Applications (ICCSA 2007), Kuala Lumpur, Malaysia, 26–29 August 2007; pp. 532–538. [Google Scholar]
Ciążyński, K.; Fabijańska, A. Detection of QR-Codes in Digital Images Based on Histogram Similarity. Image Process. Commun. 2015, 20, 41–48. [Google Scholar] [CrossRef] [Green Version]
Gaur, P.; Tiwari, S. Recognition of 2D Barcode Images Using Edge Detection and Morphological Operation. Int. J. Comput. Sci. Mob. Comput. IJCSMC 2014, 3, 1277–1282. [Google Scholar]
Szentandrási, I.; Herout, A.; Dubská, M. Fast detection and recognition of QR codes in high-resolution images. In Proceedings of the 28th Spring Conference on Computer Graphics, Budmerice, Slovakia, 2–4 May 2012; ACM: New York, NY, USA, 2012. [Google Scholar]
Dubská, M.; Herout, A.; Havel, J. Real-time precise detection of regular grids and matrix codes. J. Real Time Image Process. 2016, 11, 193–200. [Google Scholar] [CrossRef]
Sörös, G.; Flörkemeier, C. Blur-resistant joint 1D and 2D barcode localization for smartphones. In Proceedings of the 12th International Conference on Mobile and Ubiquitous Multimedia, MUM, Lulea, Sweden, 2–5 December 2013; pp. 1–8. [Google Scholar]
Bradley, D.; Roth, G. Adaptive Thresholding using the Integral Image. J. Graph. Tools 2007, 12, 13–21. [Google Scholar] [CrossRef]
Bresenham, J.E. Algorithm for computer control of a digital plotter. IBM Syst. J. 1965, 4, 25–30. [Google Scholar] [CrossRef]
Heckbert, P. Fundamentals of Texture Mapping and Image Warping. Master’s Thesis, University of California, Berkeley, CA, USA, June 1989. [Google Scholar]
Beer, D. Quirc—QR Decoder Library. 2018. Available online: https://github.com/dlbeer/quirc (accessed on 22 September 2018).
Karrach, L.; Pivarčiová, E.; Božek, P. Identification of QR Code Perspective Distortion Based on Edge Directions and Edge Projections Analysis. J. Imaging 2020, 6, 67. [Google Scholar] [CrossRef]
Terriberry, T.B. ZBar Barcode Reader. 2018. Available online: http://zbar.sourceforge.net (accessed on 22 September 2018).
Google. ZXing (Zebra Crossing) Barcode Scanning Library for Java, Android. 2018. Available online: https://github.com/zxing (accessed on 22 September 2018).
Leadtools. QR Code SDK Technology. 2018. Available online: http://demo.leadtools.com/JavaScript/Barcode/index.html (accessed on 22 September 2018).
Inlite Research Inc. Barcode Reader SDK. 2018. Available online: https://online-barcode-reader.inliteresearch.com (accessed on 22 September 2018).
DataSymbol. Barcode Reader SDK. 2018. Available online: http://www.datasymbol.com/barcode-reader-sdk/barcode-reader-sdk-for-windows/online-barcode-decoder.html (accessed on 22 September 2018).
Dynamsoft. Barcode Reader SDK. 2018. Available online: https://www.dynamsoft.com/Products/Dynamic-Barcode-Reader.aspx (accessed on 22 September 2018).
Bako, B.; Božek, P. Trends in Simulation and Planning of Manufacturing Companies. Procedia Eng. 2016, 149, 571–575. [Google Scholar] [CrossRef]
Frankovsky, P.; Pastor, M.; Dominik, L.; Kicko, M.; Trebuna, P.; Hroncova, D.; Kelemen, M. Wheeled mobile robot in structured environment. In Proceedings of the 12th International Conference ELEKTRO 2018, Mikulov, Czech Republic, 21–23 May 2018; pp. 1–5. [Google Scholar]
Pivarčiová, E.; Božek, P. Industrial production surety factor increasing by a system of fingerprint verification. In Proceedings of the 2014 International Conference on Information Science, Electronics and Electrical Engineering (ISEEE 2014), Sapporo, Japan, 26–28 April 2014; pp. 493–497. [Google Scholar]
Karrach, L.; Pivarčiová, E.; Nikitin, Y.R. Comparing the impact of different cameras and image resolution to recognize the data matrix codes. J. Electr. Eng. 2018, 69, 286–292. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Version 1: 21 × 21 QR Code.

Figure 2. Other types of a QR Code: (a) Micro QR code, (b) iQR code.

Figure 3. Finder Pattern.

Figure 4. The flow chart of proposed algorithm.

Figure 5. A Finder Pattern candidate matching 1:1:3:1:1 horizontally.

Figure 6. (a) Group of 8 Finder Pattern candidates matching 1:1:3:1:1 in rows, (b) Finder Pattern candidate matching 1:1:3:1:1 vertically.

Figure 7. Regions of the Finder Pattern candidate.

Figure 8. Various damages of Finder Patterns: (a) merged inner and outer black regions; (b) interrupted outer black region.

Figure 9. QR Code candidates.

Figure 10. QR Code with one damaged Finder Pattern: (a) two known Finder Patterns; (b) two possible regions of QR Code; (c) two possible positions of 3rd Finder Pattern.

Figure 11. Quiet Zones.

Figure 12. Bounding Box.

Figure 14. Perspective transformation.

Figure 15. Transformation of the QR Code into the binary grid.

Figure 16. Testing samples: (a) artificial, (b) Internet, (c) industrial.

Figure 17. Comparison of the recognition rate of our method against competing solutions.

Table 1. Comparison of competing solutions with our proposed method.

Solution	Artificial Samples	Internet Samples	Industrial Samples
ZXing (open-source) [25]	2	72	48
Quirc (open-source) [22]	12	69	45
LEADTOOLS QR Code SDK [26]	9	72	147
ZBar (open-source) [24]	23	84	398
Inlite Barcode Reader SDK [27]	25	80	421
DataSymbol Barcode Reader SDK [28]	25	89	471
Dynamsoft Barcode Reader SDK [29]	25	88	478
Our solution	25	87	480

Table 2. Dependence of computational complexity on image resolution and number of QR codes in an image.

Solution	1296 × 960
Solution	1 Code	10 Codes	1 Code	10 Codes	50 Codes
ZBar (open-source)	85 ms	185 ms	347 ms	372 ms	1744 ms
Quirc (open-source)	13 ms	26 ms	45 ms	130 ms	493 ms
Our solution	18 ms	21 ms	73 ms	77 ms	107 ms

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Karrach, L.; Pivarčiová, E.; Bozek, P. Recognition of Perspective Distorted QR Codes with a Partially Damaged Finder Pattern in Real Scene Images. Appl. Sci. 2020, 10, 7814. https://doi.org/10.3390/app10217814

AMA Style

Karrach L, Pivarčiová E, Bozek P. Recognition of Perspective Distorted QR Codes with a Partially Damaged Finder Pattern in Real Scene Images. Applied Sciences. 2020; 10(21):7814. https://doi.org/10.3390/app10217814

Chicago/Turabian Style

Karrach, Ladislav, Elena Pivarčiová, and Pavol Bozek. 2020. "Recognition of Perspective Distorted QR Codes with a Partially Damaged Finder Pattern in Real Scene Images" Applied Sciences 10, no. 21: 7814. https://doi.org/10.3390/app10217814

APA Style

Karrach, L., Pivarčiová, E., & Bozek, P. (2020). Recognition of Perspective Distorted QR Codes with a Partially Damaged Finder Pattern in Real Scene Images. Applied Sciences, 10(21), 7814. https://doi.org/10.3390/app10217814

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Recognition of Perspective Distorted QR Codes with a Partially Damaged Finder Pattern in Real Scene Images

Abstract

Featured Application

Abstract

1. Introduction

QR Codes

2. Related Work

3. The Proposed Method

3.1. Searching for Finder Patterns

3.2. Verification of Finder Patterns

3.3. Grouping of Finder Patterns

3.3.1. Grouping Triplets of Finder Patterns

3.3.2. Grouping Pairs of Finder Patterns

3.4. Verification of Quiet Zone

3.5. QR Code Bounding Box

3.6. Perspective Distortion

3.7. Decoding of a QR Code

4. Results

5. Conclusions

Supplementary Materials

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI