Detection of Imaged Objects with Estimated Scales
Xuesong Li, Ngaiming Kwok, Jose E. Guivant, Karan Narula, Ruowei Li, Hongkun Wu
2019
Abstract
Dealing with multiple sizes of the object in the image has always been a challenge in object detection. Predefined multi-size anchors are usually adopted to address this issue, but they can only accommodate a limited number of object scales and aspect ratios. To cover a wider multi-size variation, we propose a detection method that utilizes depth information to estimate the size of anchors. To be more specific, a general 3D shape is selected, for each class of objects, that represents different sizes of 2D bounding boxes in the image according to the corresponding object depths. Given these 2D bounding boxes, a neural network is used to classify them into different categories and do the regression to obtain more accurate 2D bounding boxes. The KITTI benchmark dataset is used to validate the proposed approach. Compared with the detection method using pre-defined anchors, the proposed method has achieved a significant improvement in detection accuracy.
DownloadPaper Citation
in Harvard Style
Li X., Kwok N., Guivant J., Narula K., Li R. and Wu H. (2019). Detection of Imaged Objects with Estimated Scales. In Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP; ISBN 978-989-758-354-4, SciTePress, pages 39-47. DOI: 10.5220/0007353600390047
in Bibtex Style
@conference{visapp19,
author={Xuesong Li and Ngaiming Kwok and Jose E. Guivant and Karan Narula and Ruowei Li and Hongkun Wu},
title={Detection of Imaged Objects with Estimated Scales},
booktitle={Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP},
year={2019},
pages={39-47},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0007353600390047},
isbn={978-989-758-354-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the 14th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications (VISIGRAPP 2019) - Volume 5: VISAPP
TI - Detection of Imaged Objects with Estimated Scales
SN - 978-989-758-354-4
AU - Li X.
AU - Kwok N.
AU - Guivant J.
AU - Narula K.
AU - Li R.
AU - Wu H.
PY - 2019
SP - 39
EP - 47
DO - 10.5220/0007353600390047
PB - SciTePress