Keywords: Product identification, Keypoint alignment, Neural network, Data augmentation.
Abstract: Traditional shelf auditing is a manual audit. With the development of computer vision and deep learning
technology, it has become possible to use machine automatic image recognition instead of manual auditing.
Existing product identification is based on the use of two-dimensional code recognition and radio frequency
identification (RFID), which relies on hardware and is relatively expensive. The training data of product
identification is difficult to collect. This paper proposes a product identification method based on
convolutional neural network, and explores how to effectively obtain the product data sets. At the same time,
it introduces the unsupervised keypoint detection alignment method for the product detection part, and
proves that it can improve the correct rate of product identification.
1 INTRODUCTION
The initial product identification is achieved by
manual identification, but this method is labor
intensive(Merler M, 2007). The development of
technology has played an important role in
improving the efficiency of product identification.
Product detection and identification is an important
part of smart shelf auditing(Gül Varol, 2014).
Although the Product identification system has a
high recognition accuracy rate but relies on massive
data, there are many difficulties in collecting and
preparing data sets. Similarly, for product
identification, there is no uniform alignment method
for the goods. Most of the related tasks are not
aligned. The existing alignment methods are also
supervised to mark the keypoints first, but the
artificially labeled keypoints are different. The
merchandise is not robust, and the cost of manual
labeling is high. Face recognition is the recognition
after the alignment of the keypoints on the human
face. For the product identification, in the actual
scene, we need to identify the goods with the
rotation angle, but because of the labeling Such data
is very costly, and there is no alignment. However,
the accuracy of the product identification with the
rotation angle is less than the others. It makes sense
to align the items with stable keypoints and then
identify them. This paper proposes a product
identification method based on convolutional neural
network, explores how to effectively obtain product
datasets, compares the application of several data
augmentation methods in the augmentation of
product identification data, and finds an effective
method for data augmentation. Through experiments,
the application of several kinds of target
classification techniques in the field of product
identification was compared. At the same time, for
the product detection part, the unsupervised keypoint
detection alignment method is introduced to pre-
process, and the unsupervised keypoints obtained by
the inclined commodity utilization are aligned to
demonstrate the feasibility of improving the correct
rate of product identification.
2 RELATED RESEARCH
2.1 R-FCN
R-FCN(Jifeng Dai, 2016) solves the contradiction
between the location insensitivity of the
classification network and the sensitivity of
detecting network location. The RFCN proposes a
position sensitive score map (each position sensitive
score map represents a relative position of an object
class. For example, as long as a cat is detected in the
upper right corner of the image, a score map will be
Le, K.