Changeset 252 for trunk/VUT/doc
- Timestamp:
- 08/26/05 14:56:09 (19 years ago)
- Location:
- trunk/VUT/doc/SciReport
- Files:
-
- 3 added
- 1 edited
Legend:
- Unmodified
- Added
- Removed
-
trunk/VUT/doc/SciReport/online.tex
r251 r252 42 42 connectivity, occlusion by few closest depth layers). 43 43 44 \begin{figure}%[htb] 45 %\centerline 46 \centering \footnotesize 47 \begin{tabular}{c} 48 \includegraphics[width=0.5\textwidth, draft=\DRAFTFIGS]{images/ogre_terrain} \\ 49 %\hline 50 \includegraphics[width=0.25\textwidth, draft=\DRAFTFIGS]{images/vis_viewfrustum} \hfill \includegraphics[width=0.25\textwidth, draft=\DRAFTFIGS]{images/vis_chc} \\ 51 \end{tabular} 52 \label{tab:averages} 53 \caption{Top row: The rendered scene. Bottom row: The visualizion of the rendering overdraw. 54 Using view frustum culling (left image) vs. occlusion queries (right image). 55 The yellow boxes show the actually rendered scene objects. The 56 red boxes depict the view frustum culled hierarchy nodes, the blue boxes depict the 57 occlusion query culled hierarchy nodes.} 58 \label{fig:online_culling_example} 59 \end{figure} 60 44 61 Recent graphics hardware natively supports an \emph{occlusion query} 45 62 to detect the visibility of an object against the current contents of the … … 58 75 existing real-time rendering packages on architectures supporting the 59 76 underlying occlusion query. 60 77 In figure~\ref{fig:online_culling_example}, the same scene (top row) is rendered using view frustum 78 culling (visualization in the bottom left image) versus online culling using occlusion queries (visualization 79 in the bottom right image). We can clearly see the large overdraw that happens for view frustum culling. 61 80 %Using spatial and assuming temporal coherence 62 81 … … 661 680 wait till the result of the query becomes available. 662 681 663 \subsection{Initial depth pass}664 \label{sec:initial}665 666 To achieve maximal performance on modern GPU's, one has to take care of a number of issues.667 First, it is very important to reduce material switching. Thus modern rendering engines sort the668 objects (or patches) by materials in order to eliminate the material switching as good as possible.669 670 Next, materials can be very costly, sometimes complicated shaders have to be evaluated several times per batch. Hence it should be671 avoided to render the full material for fragments which eventually turn out to be occluded. This can be achieved672 by rendering an initial depth pass (i.e., enabling only depth write to fill the depth673 buffer). Afterwards the geometry is rendered again, this time with full shading. Because the674 depth buffer is already established, invisible fragments will be discarded before any shading is done calculated.675 676 This approach can be naturally adapted for use with the CHC algorithm. Only an initial depth pass is rendered677 in front-to-back order using the CHC algorithm. The initial pass is sufficient to fill the depth buffer and determine the visible geometry. Then only the visible geometry is rendered again, exploiting the full optimization and material sorting capability of the rendering engine.678 679 \subsection{Batching multiple queries}680 682 \begin{figure}%[htb] 681 683 {\footnotesize … … 685 687 \label{fig:pseudocode_batching} 686 688 \end{figure} 689 690 \subsection{Initial depth pass} 691 \label{sec:initial} 692 693 To achieve maximal performance on modern GPU's, one has to take care of a number of issues. 694 First, it is very important to reduce material switching. Thus modern rendering engines sort the 695 objects (or patches) by materials in order to eliminate the material switching as good as possible. 696 697 Next, materials can be very costly, sometimes complicated shaders have to be evaluated several times per batch. Hence it should be avoided to render the full material for fragments which eventually turn out to be occluded. This can be achieved 698 by rendering an initial depth pass (i.e., enabling only depth write to fill the depth 699 buffer). Afterwards the geometry is rendered again, this time with full shading. Because the 700 depth buffer is already established, invisible fragments will be discarded before any shading is done calculated. 701 702 This approach can be naturally adapted for use with the CHC algorithm. Only an initial depth pass is rendered 703 in front-to-back order using the CHC algorithm. The initial pass is sufficient to fill the depth buffer and determine the visible geometry. Then only the visible geometry is rendered again, exploiting the full optimization and material sorting capability of the rendering engine. 704 705 If the materials requires several rendering passes, we can use a variant of the depth pass method. We render 706 only the first passes using the algorithm (e.g., the solid passes), determining the visibility of the patches, and 707 render all the other passes afterwards. This approach can be used when there are passes which require a special 708 kind of sorting to be rendered correctly (e.g., transparent passes, shadow passes). 709 710 711 712 \subsection{Batching multiple queries} 713 \label{sec:batching} 714 687 715 When occlusion queries are rendered interleaved with geometry, there is always a state change involved. To 688 716 reduce state changes, it is beneficial not to execute one query at a time, but multiple queries at once~\cite{Staneker:2004:EMO}. 689 717 Instead of immediately executing a query for a node when we fetch it from the traversal stack, we add it to the {\em pending queue}. If {\em n} of these queries are accumulated in the queue, we can execute them at once. 690 To optain an optimal value for {\em n}, several some heuristics can be applied, e.g., a fraction of the number of queries issued in the last frame. The new pseudo-code of the algorithmis given in Figure~\ref{fig:pseudocode_batching}.718 To optain an optimal value for {\em n}, several some heuristics can be applied, e.g., a fraction of the number of queries issued in the last frame. The pseudo-code of the algorithm including the batching is given in Figure~\ref{fig:pseudocode_batching}. 691 719 692 720
Note: See TracChangeset
for help on using the changeset viewer.