automatically taking photographs satisfying several
basic composition guidelines. The drone is navigated
by the aesthetic score gradually to the view satisfying
these guidelines. And we adopt a downhill simplex
method to heuristically search for the optimal
view. Experiments on human portrait photography
demonstrate the efficiency of our method. In fact, our
device can also take photos for any other subjects with
a clearly defined features like the human face.
As a prerequisite, the subject detection is crucial
to guarantee that our method can work well. In human
portrait photography, the face detection will fail if the
subject turns his head away from the camera. The
aesthetic score drops to 0 and our drone will stop
current movement and go back to find a higher score.
If the face detection still fails, it will stop current
searching and start a new one.
We are exploring more rules and clues in
practical photographing, such as color, illumination,
or geometry, to make our automatic photographer
more intelligent. Meanwhile, we notice that rule-
based aesthetic evaluation is not general enough to
capture the diversity of possible photographs. Many
rules are not convenient to be quantized. We are
trying to overcome these problems with data driven
methods.
REFERENCES
Bachrach, A., Prentice, S., He, R., and Roy, N. (2011).
Range robust autonomous navigation in gps-denied
environments. Journal of Field Robotics, 28(5):644–
666.
Benet, G., Blanes, F., Sim, J. E., and Prez, P. (2002). Using
infrared sensors for distance measurement in mobile
robots. Robotics & Autonomous Systems, 40(4):255–
266.
Bills, C., Chen, J., and Saxena, A. (2011). Autonomous
mav flight in indoor environments using single image
perspective cues. In IEEE International Conference
on Robotics and Automation (ICRA),2011, pages
5776–5783.
Byers, Z., Dixon, M., Goodier, K., Grimm, C. M.,
and Smart, W. D. (2003). An autonomous robot
photographer. In IROS 2003, volume 3, pages 2636–
2641 vol.3.
Comaniciu, D. and Meer, P. (2002). Mean shift:
a robust approach toward feature space analysis.
IEEE Transactions on Pattern Analysis & Machine
Intelligence, 24(5):603–619.
Duda, R. O. and Hart, P. E. (1972). Use of the hough
transformation to detect lines and curves in pictures.
Communications of The ACM, 15(1):11–15.
Fu, H., Han, X., and Phan, Q. H. (2013). Data-driven
suggestions for portrait posing. In SIGGRAPH Asia
2013, Technical Briefs, pages 29:1–29:4.
Hrabar, S. (2008). 3d path planning and stereo-based
obstacle avoidance for rotorcraft uavs. In IROS, pages
807–814.
Jin, Y., Wu, Q., and Liu, L. (2012). Aesthetic photo
composition by optimal crop-and-warp. Computers
& Graphics, 36(8):955–965.
Joubert, N., Roberts, M., Truong, A., Berthouzoz, F.,
and Hanrahan, P. (2015). An interactive tool for
designing quadrotor camera shots. ACM Transactions
on Graphics, 34(6):238.
Ke, Y., Tang, X., and Jing, F. (2006). The design of
high-level features for photo quality assessment. In
CVPR’06, volume 1, pages 419–426.
Kim, M.-J., Song, T. H., Jin, S. H., Jung, S. M.,
Go, G.-H., Kwon, K. H., and Jeon, J. W.
(2010). Automatically available photographer robot
for controlling composition and taking pictures. In
IROS, pages 6010–6015.
Krages (2005). Photography: The Art of Composition.
Allworth Press.
Lenz, I., Gemici, M., and Saxena, A. (2012). Low-power
parallel algorithms for single image based obstacle
avoidance in aerial robots. In IROS, pages 772–779.
Li, K., Yan, B., Li, J., and Majumder, A. (2015). Seam
carving based aesthetics enhancement for photos.
Signal Processing-image Communication, 39:509–
516.
Liu, L., Chen, R. C., Wolf, L., and Cohenor, D. (2010).
Optimizing photo composition. Computer Graphics
Forum, 29(2):469–478.
Luo, Y. and Tang, X. (2008). Photo and video quality
evaluation: Focusing on the subject. In ECCV 2008,
Marseille, France, October 12-18, pages 386–399.
Ni, B., Xu, M., Cheng, B., Wang, M., Yan, S., and Tian,
Q. (2013). Learning to photograph: A compositional
perspective. Trans. Multi., 15(5):1138–1151.
Press, W. H., Teukolsky, S. A., Vetterling, W. T., and
Flannery, B. P. (1992). Numerical Recipes in C: The
Art of Scientific Computing. Cambridge University
Press, New York, NY, USA, 2nd edition.
Roberts, M. and Hanrahan, P. (2016). Generating dynami-
cally feasible trajectories for quadrotor cameras. ACM
Transactions on Graphics, 35(4):61.
Soundararaj, S. P., Sujeeth, A. K., and Saxena, A. (2009).
Autonomous indoor helicopter flight using a single
onboard camera. In IROS, pages 5307–5314.
Viola, P. and Jones, M. J. (2004). Robust real-time face
detection. International Journal of Computer Vision,
57(2):137–154.
Yao, L., Suryanarayan, P., Qiao, M., Wang, J. Z.,
and Li, J. (2012). Oscar: On-site composition
and aesthetics feedback through exemplars for
photographers. International Journal of Computer
Vision, 96(3):353–383.
Automatic View Finding for Drone Photography based on Image Aesthetic Evaluation
289