Figure 1, where the skimmer moves from top to
bottom along the ladle. The overview of the proposed
system has two distinctive stages: an image
transformation stage which contains slag distribution
information, and a slag removal path estimation stage
based on deep learning. The main contributions of
this paper are three-fold:
1. propose a learning model that mimics the slag
removal path of a skilled human operators to
estimate its removal path.
2. create a slag distribution image structure that
contains slag distribution information from color
images.
3. estimate the slag removal path in real-time, and
the experimental results verify the performance of
the proposed method.
The structure of this paper is organized as
follows: First, the related work is discussed in Section
2. Section 3 describes the creation of the proposed
slag distribution image and how training data can be
extracted. Section 4 describes the structure and post-
processing of the deep learning model used to predict
the slag removal path. Section 5 discusses about the
working environment and quantitative and qualitative
accuracy analyses through a few experimental results.
Finally, we conclude in Section 6.
2 RELATED WORK
In this section, we briefly discuss previous studies
which are related to automated robots of desk
cleaning, rock excavation, slag removal, and route
prediction. (Kim. J. et al., 2018) proposed a desk
cleaning technique using the iCub humanoid Robot
for cleaning graffiti and lentils from a desk. For a
robot to clean the top of a desk automatically, it must
recognize the material on the desk and estimate the
path to clean it. In this study, a human instructor
teaches the robot how to perform cleaning tasks. Task
Parametrized Gaussian Mixture Model (TP-GMM) is
used to encode the demo variables and to properly
generalize the features. However, the
parameterization of TP-GMM is very difficult
because it requires partitioning and extracting
complex images of small tables. Therefore, while the
instructor demonstrates the cleaning task, a trained
deep neural network is used to extract parameters
from the robot camera image.
(Fukui et al., 2015) discussed about an Automated
Ore Excavator. To carry out autonomous excavation
of rocks, it is necessary to recognize the state of the
fragmented rock piles and plan the appropriate
excavation operation accordingly. They proposed an
imitation-based motion planning method and
developed a rock pile condition recognizer with an
excavation motion planner. To verify the proposed
method, they developed a 1/10 scale excavation
model and conducted excavation experiments.
Experimental results showed that rock piles could be
distinguished according to surface shape and particle
distribution, where the number and the variety of
training data proved important for realizing high
productivity excavation.
(Kim. J. S. et al., 2018) conducted a study to
remove slag using a de-slagging machine. In general,
de-slagging machines can only be controlled by
trained professionals. In their research, they proposed
a method for estimating the slag removal path
automatically using CNN. They trained their network
by extracting block regions based on the actions of an
experienced specialist. They performed backtracking
and curve fitting to properly estimate the removal
path and compared with the path of the experienced
expert.
(Minoura et al., 2018) proposed a path prediction
method that takes target object attributes and physical
environment information into account. Previous path
prediction methods using deep learning architecture
took into account the physical environment of a single
target, such as a pedestrian. However, they proposed
a route prediction method that could consider
multiple target types. The method represents the
attributes as one-hot vectors and encodes the physical
attributes through convolutional layers. Furthermore,
we used relative coordinates as the past motion
history of prediction targets. They verified the
proposed method using the Stanford drone dataset.
3
TRAINING DATA ACQUISITION
3.1 Slag Distribution Image Generation
In slag removal task, an area with high slag
distribution is removed first. The reason is that high
slag distribution means that the slag is concentrated
in the area. By removing dense slag areas, it results in
efficient removal task.
In this section, we discuss the design architecture
of the slag distribution image which is proposed to
train our deep learning network (Figure 2). It consists
of 3 channels: grayscale, morphology, and distance
transform images. In addition, it also contains
information that can be used to distinguish the inside
and outside of the ladle. In our method, we determine
that the slag exists only inside the inner part of the