GPU Cost Estimation for Load Balancing in Parallel Ray Tracing
Biagio Cosenza, Carsten Dachsbacher, Ugo Erra
2013
Abstract
Interactive ray tracing has seen enormous progress in recent years. However, advanced rendering techniques requiring many million rays per second are still not feasible at interactive speed, and are only possible by means of highly parallel ray tracing. When using compute clusters, good load balancing is crucial in order to fully exploit the available computational power, and to not suffer from the overhead involved by synchronization barriers. In this paper, we present a novel GPU method to compute a cost map: a per-pixel cost estimate of the ray tracing rendering process. We show that the cost map is a powerful tool to improve load balancing in parallel ray tracing, and it can be used for adaptive task partitioning and enhanced dynamic load balancing. Its effectiveness has been proven in a parallel ray tracer implementation tailored for a cluster of workstations.
References
- Baxter, III, W. V., Sud, A., Govindaraju, N. K., and Manocha, D. (2002). GigaWalk: Interactive Walkthrough of Complex Environments. In Eurographics workshop on Rendering, EGRW, pages 203-214.
- Benthin, C., Wald, I., and Slusallek, P. (2003). A Scalable Approach to Interactive Global Illumination. Computer Graphics Forum, 22(3):621-630.
- Bigler, J., Stephens, A., and Parker, S. (2006). Design for Parallel Interactive Ray Tracing Systems. In IEEE Symposium on Interactive Ray Tracing, pages 187 - 196.
- Blumofe, R. D. and Leiserson, C. E. (1999). Scheduling multithreaded computations by work stealing. Journal of ACM, 46(5):720-748.
- Budge, B., Bernardin, T., Stuart, J., Sengupta, S., Joy, K., and Owens, J. (2009). Out-of-core Data Management for Path Tracing on Hybrid Resources. In Eurographics.
- Carpenter, L. (1984). The A-buffer, an antialiased hidden surface method. In ACM SIGGRAPH, pages 103-108.
- Chalmers, A., Debattista, K., Sundstedt, V., Longhurst, P., and Gillibrand, R. (2006). Rendering on Demand. In EGPGV, pages 9-17.
- Chalmers, A. and Reinhard, E. (2002). Pratical Parallel Rendering. AKPeters.
- Cosenza, B. (2008). A Survey on Exploiting Grids for Ray Tracing. In Eurographics Italian Chapter Conference, pages 89-96.
- Cosenza, B., Cordasco, G., De Chiara, R., Erra, U., and Scarano, V. (2008). Load Balancing in Mesh-like Computations using Prediction Binary Trees. In Symposium on Parallel and Distributed Computing (ISPDC), pages 139-146.
- Crow, F. C. (1984). Summed-area Tables for Texture Mapping. In 11th annual Conference on Computer Graphics and Interactive Techniques, SIGGRAPH, pages 207-212.
- DeMarle, D. E., Gribble, C. P., Boulos, S., and Parker, S. G. (2005). Memory Sharing for Interactive Ray Tracing on Clusters. Parallel Comput., 31(2):221-242.
- DeMarle, D. E., Gribble, C. P., and Parker, S. G. (2004). Memory-Savvy Distributed Interactive Ray Tracing. In EGPGV, pages 93-100.
- DeMarle, D. E., Parker, S., Hartner, M., Gribble, C., and Hansen, C. (2003). Distributed Interactive Ray Tracing for Large Volume Visualization. In IEEE Symposium on Parallel and Large-Data Visualization and Graphics, PVG, pages 12-.
- Dietrich, A., Stephens, A., and Wald, I. (2007). Exploring a Boeing 777: Ray Tracing Large-Scale CAD Data. IEEE Comput. Graph. Appl., 27(6):36-46.
- Garanzha, K. and Loop, C. T. (2010). Fast Ray Sorting and Breadth-First Packet Traversal for GPU Ray Tracing. Computer Graphics Forum, pages 289-298.
- Georgiev, I. and Slusallek, P. (2008). RTfact: Generic concepts for flexible and high performance ray tracing. In Interactive Ray Tracing, 2008. RT 2008. IEEE Symposium on, pages 115 -122.
- Gillibrand, R., Longhurst, P., Debattista, K., and Chalmers, A. (2006). Cost prediction for global illumination using a fast rasterised scene preview. In AFRIGRAPH, pages 41-48.
- Glassner, A. S. (1989). An Introduction to Ray Tracing. Morgan Kaufmann.
- Grama, A., Karypis, G., Kumar, V., and A., G. (2003). Introduction to Parallel Computing, 2nd edition. Pearson Addison Wesley.
- Hargreaves, S. (2004). Deferred shading. Game Developers Conference Talks.
- Heirich, A. and Arvo, J. (1998). A Competitive Analysis of Load Balancing Strategies for Parallel Ray Tracing. Journal of Supercomputing, 12(1-2):57-68.
- Hensley, J., Scheuermann, T., Coombe, G., Singh, M., and Lastra, A. (2005). Fast summed-area table generation and its applications. Computer Graphics Forum, 24(3):547-555.
- Ize, T., Brownlee, C., and Hansen, C. D. (2011). Real-Time Ray Tracer for Visualizing Massive Models on a Cluster. In EGPGV, pages 61-69.
- Lauterbach, C., Garland, M., Sengupta, S., Luebke, D., and Manocha, D. (2009). Fast bvh construction on gpus. Computer Graphics Forum, pages 375-384.
- Moloney, B., Weiskopf, D., Möller, T., and Strengert, M. (2007). Scalable Sort-First Parallel Direct Volume Rendering with Dynamic Load Balancing. In Symposium on Parallel Graphics and Visualization (EGPGV), pages 45-52.
- Mueller, C. (1995). The sort-first rendering architecture for high-performance graphics. In Symposium on Interactive 3D graphics, I3D, pages 75-ff.
- Muuss, M. J. (1995). Towards real-time ray-tracing of combinatorial solid geometric models. In BRL-CAD Symposium.
- Odom, C. N., Shetty, N. J., and Reiners, D. (2009). Ray Traced Virtual Reality. In ISVC, pages 1031-1042.
- Overbeck, R., Ramamoorthi, R., and Mark, W. (2008). Large ray packets for real-time whitted ray tracing. In Interactive Ray Tracing, 2008. RT 2008. IEEE Symposium on, pages 41 -48.
- Parker, S. G., Bigler, J., Dietrich, A., Friedrich, H., Hoberock, J., Luebke, D., McAllister, D., McGuire, M., Morley, K., Robison, A., and Stich, M. (2010). Optix: a general purpose ray tracing engine. ACM Trans. Graph., 29(4):66:1-66:13.
- Plachetka, T. (2002). Perfect load balancing for demanddriven parallel ray tracing. In International Euro-Par Conference on Parallel Processing, pages 410-419.
- Reinhard, E., Kok, A. J. F., and Chalmers, A. (1998). Cost distribution prediction for parallel ray tracing. In Eurographics Workshop on Parallel Graphics and Visualisation, pages 77-90.
- Reshetov, A., Soupikov, A., and Hurley, J. (2005). Multilevel ray tracing algorithm. In ACM SIGGRAPH, pages 1176-1185.
- Wald, I. (2004). Realtime Ray Tracing and Interactive Global Illumination. PhD thesis, Computer Graphics Group, Saarland University.
- Wald, I., Dietrich, A., and Slusallek, P. (2004). An Interactive Out-of-Core Rendering Framework for Visualizing Massively Complex Models. In Eurographics Symposium on Rendering.
- Wald, I., Slusallek, P., Benthin, C., and Wagner, M. (2001). Interactive Distributed Ray Tracing of Highly Complex Models. In Eurographics Workshop on Rendering Techniques, pages 277-288.
- Yang, J. C., Hensley, J., Grün, H., and Thibieroz, N. (2010). Real-Time Concurrent Linked List Construction on the GPU. Comput. Graph. Forum, 29(4):1297-1304.
- Zhou, K., Hou, Q., Wang, R., and Guo, B. (2008). Realtime KD-tree construction on graphics hardware. In ACM SIGGRAPH Asia, pages 126:1-126:11.
Paper Citation
in Harvard Style
Cosenza B., Dachsbacher C. and Erra U. (2013). GPU Cost Estimation for Load Balancing in Parallel Ray Tracing . In Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2013) ISBN 978-989-8565-46-4, pages 139-151. DOI: 10.5220/0004283401390151
in Bibtex Style
@conference{grapp13,
author={Biagio Cosenza and Carsten Dachsbacher and Ugo Erra},
title={GPU Cost Estimation for Load Balancing in Parallel Ray Tracing},
booktitle={Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2013)},
year={2013},
pages={139-151},
publisher={SciTePress},
organization={INSTICC},
doi={10.5220/0004283401390151},
isbn={978-989-8565-46-4},
}
in EndNote Style
TY - CONF
JO - Proceedings of the International Conference on Computer Graphics Theory and Applications and International Conference on Information Visualization Theory and Applications - Volume 1: GRAPP, (VISIGRAPP 2013)
TI - GPU Cost Estimation for Load Balancing in Parallel Ray Tracing
SN - 978-989-8565-46-4
AU - Cosenza B.
AU - Dachsbacher C.
AU - Erra U.
PY - 2013
SP - 139
EP - 151
DO - 10.5220/0004283401390151