source: GTP/trunk/App/Demos/Illum/Standalone/DepthOfField [DirectX]/ReadMe.txt @ 1467

Revision 1467, 12.9 KB checked in by szirmay, 18 years ago (diff)
Line 
1Depth Of Field demo
2-------------------
3
4Run:
5-----
6Copy DepthOfField.exe (from the Debug or Release directory) to the solution'S directory and run.
7
8Build:
9------
10Simply run and build the solution (DepthOfField.sln)
11
12Controls:
13---------
14Mouse controlls the camera orientation.
15Focal Distance parameter controlls the focal depth of the simulated camera.
16
17The implemented method:
18-----------------------
19
20Depth of Field with Simulation of Circle of Confusion
21
22Multiple rays scattered from a given point on an object will pass through the camera lens, forming the cone of light. If the object is in focus, all rays will converge at a single point on the image plane. However, if a given point on an object in the scene is not near the focal distance, the cone of light rays will intersect the image plane in an area shaped like a conic section. Typically, the conic section is approximated by a circle called the circle of confusion. The circle of confusion diameter b depends on the distance of the plane of focus and lens aperture setting a (also known as f-stop). For a known focus distance and lens parameters, size of the circle of confusion can be calculated as: b = (D * f* (zfocus - z))/(zfocus*(z - f)) where D is a lens diameter D= f / a and f is the focal length of the lens. Any circle of confusion greater than the smallest point a human eye can resolve contributes to the blurriness of the image that we see as a depth of field.
23
24The methods presented in this section are all post-processing methods. It means that they consist of two main phases. In the first phase, the scene is rendered into an off-screen buffer once or more. In the second phase, the final image is computed from the off-screen image buffers and some type of depth controlled blurring.
25
26Phase 1:
27First, the whole scene is rendered by outputting depth and blurriness factor, which is used to describe how much each pixel should be blurred, in addition to the resulting scene rendering color. This can be accomplished by rendering the scene to the multiple buffers at one time. DirectX® 9 has a useful feature called Multiple Render Targets (MRT) that allows simultaneous shader output into the multiple renderable buffers. Using this feature gives us the ability to output all of the data channels (scene color, depth and blurriness factor) in our first pass. One of the MRT restrictions on some hardware is the requirement for all render surfaces to have the same bit depth, while allowing use of different surface formats. Guided by this requirement we can pick the D3DFMT_A8R8G8B8 format for the scene color output and the two-channel texture format D3DFMT_G16R16 format for depth and blurriness factor. Both formats are 32-bits per pixel, and provide us with enough space for the necessary information at the desired precision.
28
29Depth of Field with Simulation of Circle of Confusion
30
31Multiple rays scattered from a given point on an object will pass through the camera lens, forming the cone of light. If the object is in focus, all rays will converge at a single point on the image plane. However, if a given point on an object in the scene is not near the focal distance, the cone of light rays will intersect the image plane in an area shaped like a conic section. Typically, the conic section is approximated by a circle called the circle of confusion. The circle of confusion diameter b depends on the distance of the plane of focus and lens aperture setting a (also known as f-stop). For a known focus distance and lens parameters, size of the circle of confusion can be calculated as: b = (D * f* (zfocus - z))/(zfocus*(z - f)) where D is a lens diameter D= f / a and f is the focal length of the lens. Any circle of confusion greater than the smallest point a human eye can resolve contributes to the blurriness of the image that we see as a depth of field.
32
33The methods presented in this section are all post-processing methods. It means that they consist of two main phases. In the first phase, the scene is rendered into an off-screen buffer once or more. In the second phase, the final image is computed from the off-screen image buffers and some type of depth controlled blurring.
34
35Phase 1:
36First, the whole scene is rendered by outputting depth and blurriness factor, which is used to describe how much each pixel should be blurred, in addition to the resulting scene rendering color. This can be accomplished by rendering the scene to the multiple buffers at one time. DirectX® 9 has a useful feature called Multiple Render Targets (MRT) that allows simultaneous shader output into the multiple renderable buffers. Using this feature gives us the ability to output all of the data channels (scene color, depth and blurriness factor) in our first pass. One of the MRT restrictions on some hardware is the requirement for all render surfaces to have the same bit depth, while allowing use of different surface formats. Guided by this requirement we can pick the D3DFMT_A8R8G8B8 format for the scene color output and the two-channel texture format D3DFMT_G16R16 format for depth and blurriness factor. Both formats are 32-bits per pixel, and provide us with enough space for the necessary information at the desired precision.
37
38The implementation of the first phase: The pixel shader of the scene rendering pass needs to compute the blurriness factor and output it along with the scene depth and color. To abstract from the different display sizes and resolutions, the blurriness is defined to lie in the 0..1 range. A value of zero means the pixel is perfectly sharp, while a value of one corresponds to the pixel of the maximal circle of confusion size. The reason behind using 0..1 range is twofold. First, the blurriness is not expressed in terms of pixels and can scale with resolution during the post-processing step. Second, the values can be directly used as sample weights when eliminating “bleeding” artifacts.
39For each pixel of a scene, this shader computes the circle of confusion size based on the formula provided in the preceding discussion of the thin lens model. Later in the process, the size of the circle of confusion is scaled by the factor corresponding to size of the circle in pixels for a given resolution and display size. As a last step, the blurriness value is divided by maximal desired circle of confusion size in pixels (variable maxCoC) and clamped to the 0..1 range. Sometimes it might be necessary to limit the circle of confusion size (through the variable maxCoC) to reasonable values (i.e. 10 pixels) to avoid sampling artifacts caused by an insufficient number of filter taps.
40
41struct VS_INPUT {
42        float4 Position  : POSITION;
43        float3 Normal    : NORMAL;                                     
44        float3 Binormal  : BINORMAL;
45        float3 Tangent   : TANGENT;
46        float4 TexCoord0 : TEXCOORD0;
47};
48
49struct VS_OUTPUT {
50        float4 hPosition : POSITION;     // point in normalized device space before homogeneous division
51        float2 TexCoord  : TEXCOORD0;    // texture coordinates
52        float3 tView     : TEXCOORD1;    // tangent space view vector
53        float3 tLight    : TEXCOORD2;    // tangent space light vector
54        float  Depth    : TEXCOORD3;
55};
56
57struct PS_OUTPUT {
58        float4 Color : COLOR0;
59        float4 Depth : COLOR1;
60};
61
62VS_OUTPUT BumpVS(VS_INPUT IN)
63{
64        VS_OUTPUT output;
65   
66        // object-space tangent matrix
67        float3x3 Tan = float3x3(normalize(IN.Tangent), normalize(IN.Binormal), IN.Normal);
68        // position in view-space
69        float3 P = mul(IN.Position, WorldView);
70        // model-space view vector       
71        float3 mView = mCameraPos - IN.Position;
72        // model-space light vector
73        float3 mLight = mLightPos - IN.Position;
74        // tangent-space view vector
75        output.tView = mul(Tan, mView);
76        // tangent-space light vector
77        output.tLight =  mul(Tan, mLight);
78        // vertex position before homogenious division
79        output.hPosition = mul(IN.Position, WorldViewProj);
80        // tex coordinates passed to pixel shader
81        output.TexCoord = IN.TexCoord0;
82        output.Depth = output.hPosition.z;
83
84        return output;
85}
86
87
88PS_OUTPUT BumpPS(VS_OUTPUT IN)
89{
90        PS_OUTPUT output;
91        // needs normalization because of linear interpolation
92        float3 View = normalize( IN.tView );
93        // needs normalization because of linear interpolation   
94        float3 Light = normalize( IN.tLight );
95        // get tangent-space normal from normal map
96        float3 Normal = tex2D(BumpMapSampler, IN.TexCoord).rgb;
97        // illumination calculation
98        output.Color = Illumination(Light, Normal, View, IN.TexCoord, Attenuation(IN.tLight));
99   
100        float blur = saturate(abs(IN.Depth - focalDist) * focalRange);
101        output.Depth = float4(IN.Depth, blur, 0, 0);
102
103        return output;
104}
105
106
107Phase 2:
108During the post-processing phase, the results of the previous rendering are processed and the color image is blurred based on the blurriness factor computed in the first phase. Blurring is performed using a variable-sized filter representing the circle of confusion. To perform image filtering, a simple screen-aligned quadrilateral is drawn, textured with the results of the first phase.
109The filter kernel in the post-processing step has 13 samples - a center sample and 12 outer samples. The number of samples can vary but 12 represents the maximum number of samples that can be processed by a 2.0 pixel shader in a single pass.
110The post-processing pixel-shader computes filter smaple positions based on a 2D offset stored in filterTaps array, initialized by the following function:
111
112void SetupFilterKernel()
113{
114        FLOAT dx = 1.0f / (FLOAT)m_ScreenWidth;
115        FLOAT dy = 1.0f / (FLOAT)m_ScreenHeight;
116
117        D3DXVECTOR4 v[12];
118        v[0]  = D3DXVECTOR4(-0.326212f * dx, -0.405805f * dy, 0.0f, 0.0f);
119        v[1]  = D3DXVECTOR4(-0.840144f * dx, -0.07358f * dy, 0.0f, 0.0f);
120        v[2]  = D3DXVECTOR4(-0.695914f * dx, 0.457137f * dy, 0.0f, 0.0f);
121        v[3]  = D3DXVECTOR4(-0.203345f * dx, 0.620716f * dy, 0.0f, 0.0f);
122        v[4]  = D3DXVECTOR4(0.96234f * dx, -0.194983f * dy, 0.0f, 0.0f);
123        v[5]  = D3DXVECTOR4(0.473434f * dx, -0.480026f * dy, 0.0f, 0.0f);
124        v[6]  = D3DXVECTOR4(0.519456f * dx, 0.767022f * dy, 0.0f, 0.0f);
125        v[7]  = D3DXVECTOR4(0.185461f * dx, -0.893124f * dy, 0.0f, 0.0f);
126        v[8]  = D3DXVECTOR4(0.507431f * dx, 0.064425f * dy, 0.0f, 0.0f);
127        v[9]  = D3DXVECTOR4(0.89642f * dx, 0.412458f * dy, 0.0f, 0.0f);
128        v[10] = D3DXVECTOR4(-0.32194f * dx, -0.932615f * dy, 0.0f, 0.0f);
129        v[11] = D3DXVECTOR4(-0.791559f * dx, -0.597705f * dy, 0.0f, 0.0f);
130
131        g_pPostEffect->SetVectorArray("filterTaps", v, 12);
132}
133
134One of the problems with all post-filtering methods is leaking or "bleeding" of color from sharp objects onto the blurry backgrounds. This results in faint halos around sharp objects. The color leaking happens because the filter for the blurry background will sample color from the sharp object in the vicinity due to the large filter size. To solve this problem, we will discard the outer samples that can contribute to leaking according to the following criteria: if the outer sample is in focus and it is in front of the blurry center sample, it should not contribute to the blurred color. This can introduce a minor popping effect when objects go in or out of focus. To combat sample popping, the outer sample blurriness factor is used as a sample weight to fade out its contribution gradually.
135
136The vertex shader implementation is feeded with a fullscreen quad without texture coordinates. Those are calculated in the vertex shader and passed to the pixel shader.
137
138struct VS_INPUT {
139                float3 Position : POSITION;
140};
141
142struct VS_OUTPUT {
143        float4 hPosition : POSITION;     // point in normalized device space before homogeneous division
144        float2 TexCoord  : TEXCOORD0;
145};
146
147VS_OUTPUT DepthVS(VS_INPUT IN)
148{
149        VS_OUTPUT output;
150   
151    output.hPosition = half4(IN.Position, 1);
152   
153    output.TexCoord = IN.Position.xy * 0.5f + 0.5f;
154    output.TexCoord.y *= -1;
155
156    return output;
157}
158//------------------------------------------------------------------------------------
159//
160// DoF pixel shader
161//
162//------------------------------------------------------------------------------------
163const float maxCoC = 5;
164float4 DepthPS(VS_OUTPUT IN) : COLOR
165{
166        // Get center sample
167        float4 colorSum = tex2D(ColorMapSampler, IN.TexCoord);
168        float2 centerDepthBlur = tex2D(DepthMapSampler, IN.TexCoord);
169
170        // Compute CoC size based on blurriness
171        float sizeCoC = centerDepthBlur.y * maxCoC;
172
173        float totalContribution = 1.0f;
174
175        // Run through all taps
176        for (int i = 0; i < NUM_DOF_TAPS; i++)
177        {
178                // Compute tap coordinates
179                float2 tapCoord = IN.TexCoord + filterTaps[i] * sizeCoC;
180
181                // Fetch tap sample
182                float4 tapColor = tex2D(ColorMapSampler, tapCoord);
183                float2 tapDepthBlur = tex2D(DepthMapSampler, tapCoord);
184
185                // Compute tap contribution
186                float tapContribution = (tapDepthBlur.x > centerDepthBlur.x) ? 1.0f : tapDepthBlur.y;
187
188                // Accumulate color and contribution
189                colorSum += tapColor * tapContribution;
190                totalContribution += tapContribution;
191        }
192
193        // Normalize to get proper luminance
194        float4 finalColor = colorSum / totalContribution;
195
196        return finalColor;
197}
198
199
Note: See TracBrowser for help on using the repository browser.