Parallelized QSplat

Abstract Download Usage Different Threads Performance Further Work References

Abstract

Rusinkiewicz and Levoy present a multiresolution point-based rendering system for large meshes in their paper, QSplat. Their system favors parallelization because multiple processors can process different subtrees of a model�s sphere hierarchy at the same time. In this project, the rendering algorithm of the author's original source code was modified to take advantage of multiprocessing using threads. Benchmarks run on a Dual Pentium 4 Xeon 2.8Ghz using software rendering resulted in an increase of the original performance of QSplat for larger models.

Download

Usage

Download and uncompress .qs format models from the original QSplat homepage.
Download and uncompress the modified QSplat executable for Win32.
Open the modified QSplat executable.
Render using software: Under the Driver menu, choose Software, and then Z-Buffer.
- Currently, the models only get rendered in Z-Buffer software mode because there are threading issues with hardware rendering and other software rendering techniques.
Render Threaded: Under the Options menu, choose Render Threaded

Under the File menu, choose Open, then browse for a downloaded .qs file in step 1.
The model should be displayed.

A file named "output.txt" should be made in the same folder. This file contains the rendering times of the model.

Running Different Number of Threads

Per request, here are a number of images with Parallelized QSplat running with different number of threads rendering only part of the whole Buddha model.

1 Thread

2 Threads

3 Threads

Performance

Test setup

Hardware: Dual HyperThreaded Intel Xeon 2.8Ghz, 1.0 GB of RAM, NVidia Quadro4 980 XGL
Software: Modified QSplat v1.0 rendering models at 1024x768 full screen with software Z-buffer method and multithreading running on Windows XP SP2

Results

	Threading	Splats Rendered	Render time (sec.)	Splats per second	Speed up
Buddha	Non-Threaded	1,466,944	1.151	1,274,495	262%
Buddha	Threaded	1,281,384	0.384	3,336,937	262%
Bunny	Non-Threaded	110,488	0.384	287,729	92%
Bunny	Threaded	102,089	0.384	265,856	92%
Dragon	Non-Threaded	1,707,981	1.342	1,272,713	134%
Dragon	Threaded	1,470,871	0.864	1,702,396	134%
Lion	Non-Threaded	384,780	0.576	668,020	109%
Lion	Threaded	349540	0.480	728,208	109%
Lucy	Non-Threaded	2,101,164	1.534	1,369,728	133%
Lucy	Threaded	1,745,465	0.960	1,818,192	133%

Detailed numerical results can be found here.

Discussion

Looking at the perfomance chart, models with more vertices (i.e. Buddha, Dragon, and Lucy) benefit more from the threading of the QSplat rendering algorithm. Models with less vertices (i.e. Bunny and Lion) do not benefit as much. Rendering the Bunny model using the threaded algorithm actually decreases its performance.

An explanation for this behavior would be that the threaded algorithm has extra overhead when setting up threads to run the rendering. There exists a threshold of vertex points that should be met before an increase in performance is apparent.

Further Work

Currently, the naive implementation of Parallelized QSplat looks at the number of children of the root node of a model's sphere hierarchy and kicks off a thread for each child. This causes a lot of overheaded when using level-of-detail rendering because threads are made each time a model is refined. A better implementation of the parallelization (maybe using a static amount of threads or some other parallelization mechanism) would further increase performance.

Another optimization would consist of finding the number of points such that rendering a smaller model would not cause a decrease in performance. Once found, the rendering algorithm can be changed so that if the number of vertices in a model is less than the found threshold, then a non-threaded branch of the program would render the model.

References

"QSplat" at http://graphics.stanford.edu/software/qsplat. Last accessed 3/14/2005.
Rusinkiewicz, R. and Levoy, M. "QSplat: A Multiresolution Point Rendering System for Large Meshes," SIGGRAPH, 2000.

Kevin Le
kle@calpoly.edu
Last updated 3/14/2005