Orthogonal Illumination Maps

Featured Articles/Orthogonal Illumination Maps

Home

Featured Articles

Recent News

About OpenGL

Apps & Hardware

Downloads

Developer Documentation

Coding Tutorials &
Techniques

License & Logos

Discussion Forums

Feedback/Support

Orthogonal Illumination Maps - Cass Everitt of Objectecture
Note: This paper was written for OpenGL.org in Aug '99. Copyright is to the author

Introduction

Put simply, orthogonal illumination mapping is a texture-based multi-pass rendering technique for performing hardware accelerated lighting calculations per-pixel rather than per-vertex. It has a number of advantages: it is simple, geometry-independent, and fast on today's commodity graphics cards. Rather than go directly into the technique, let me motivate the problem a bit.

Motivation

How many times have you rendered your terrain database at full geometric resolution and been stunned by the picture, but dismayed that it took 20 seconds per frame? It has happened to me a lot. :-) There are a number of excellent techniques for mesh simplification that seek to reduce the number of triangles we have to render, but this typically has a direct and immediate impact on the quality of the lighting in OpenGL. Even small variations in height can produce wonderful visual cues when illuminated (ala bump-mapping), but these small variations are typically the first to go in mesh simplification.

Since OpenGL lighting is performed per-vertex, we have a real problem reducing the geometry without negative impact on the lighting.

Well, that leaves us with a few alternatives:

live with 20 seconds per frame (with stunning image quality)
simplify the mesh to achieve an acceptable trade-off between image quality and frame rate
use orthogonal illumination mapping and simplify your mesh without impact on the lighting [hint: choose this one]

Note: This discussion will focus on terrain data, and illumination by a single infinite light source because that is what I have source code demonstrating. The technique is trivially applicable on other (easily texturable) geometries, but somewhat less trivially applicable on local light sources as some calculations must then be performed per-vertex rather than per-frame.

The Technique

Assuming you chose the third option above, you'll want to know how to go about implementing it. I guess we might as well get to some of the bad news at this point. This technique (as I have implemented it) requires 1 pass for ambient, 6 passes for diffuse, and 1 pass to blend in a color texture. Don't panic! Even with 8 passes, it's still a LOT faster than rendering all the geometry. In this new world order of multitexturing, many of these passes can be collapsed into a single pass. With the worst news safely behind us, let's see the technique. First with a simple picture equation, then in prose.

Composite

=

Ambient

+

X Component of Illumination

+

Y Component of Illumination

+

Z Component of Illumination

Note: The images illustrating the X, Y, and Z component of illumination actually include the ambient pass. This is because the components are signed -- darker than ambient implies a negative contribution.

So the basic idea is to perform diffuse illumination (L dot N) just like you would do per-vertex using OpenGL lighting, but instead, do it per pixel. The components of L are premultiplied with the light color and the material properties and specified as the current color, while the components of N are each turned into textures. The dot product is performed by using a TexEnv of MODULATE for the multiplicative operations and blending for the (signed) additive operations. This has the desired effect of moving the whole dot product calculation into the fragment processing portion of the OpenGL machine.

The trickiest part of this technique is the fact that the dot product is the sum of signed values, and textures and colors are unsigned in OpenGL. This requires careful formulation of the equation to avoid negative values. It is helpful to think first of how this technique would be implemented if signed colors, textures, and framebuffer contents were allowed, then work backwards to try to achieve the same result given our actual constraints. So if the burden of unsignedness were lifted, we would formulate this dot product by defining a signed luminance texture for each of the normal components, and the algorithm would look something like this:

If OpenGL colors, textures, and color buffer held signed values:

Disable BLEND, Disable LIGHTING, Enable TEXTURE_2D, set BlendFunc to (ONE, ONE)
set color to (Mdiffuse*Ldiffuse*Lx), set active texture to Nx, render geometry
set DepthFunc to EQUAL, DepthMask to FALSE, Enable BLEND
set color to (Mdiffuse*Ldiffuse*Ly), set active texture to Ny, render geometry
set color to (Mdiffuse*Ldiffuse*Lz), set active texture to Nz, render geometry
clamp the framebuffer contents to [0,1]
Disable TEXTURE_2D, set color to ambient color, render geometry
set DepthFunc to LESS, set DepthMask TRUE

So the technique is pretty straightforward for this hypothetical variation of OpenGL. The task now is to come up with a method for getting the same result with the real OpenGL. First, we know we cannot have signed textures, so instead of signed luminance textures of Nx, Ny, and Nz, we use unsigned luminance textures of NposX, NnegX, NposY, NnegY, NposZ, and NnegZ. Next, we cannot have negative color buffer contents, so we need to make certain to perform all operations that have an additive effect on the color buffer first and all operations that have a subtractive effect on the color buffer last. We also have to deal with the signedness of Lx, Ly, and Lz, but at this point, let's assume a simple case of Lx, Ly, Lz > 0, and see what the order of operations would be to generate the composite shown above.

If EXT_blend_subtract is supported:

Disable BLEND, Disable LIGHTING, Enable TEXTURE_2D, set BlendFunc to (ONE, ONE)
set color to (Mdiffuse*Ldiffuse*Lx), set active texture to NposX, render geometry
set DepthFunc to EQUAL, DepthMask to FALSE, Enable BLEND
set color to (Mdiffuse*Ldiffuse*Ly), set active texture to NposY, render geometry
set color to (Mdiffuse*Ldiffuse*Lz), set active texture to NposZ, render geometry
set BlendEquationEXT to FUNC_REVERSE_SUBTRACT_EXT
set color to (Mdiffuse*Ldiffuse*Lx), set active texture to NnegX, render geometry
set color to (Mdiffuse*Ldiffuse*Ly), set active texture to NnegY, render geometry
set color to (Mdiffuse*Ldiffuse*Lz), set active texture to NnegZ, render geometry
Disable TEXTURE_2D, set color to ambient color, render geometry
set BlendEquationEXT to FUNC_ADD_EXT, set DepthFunc to LESS, set DepthMask TRUE

If EXT_blend_subtract is NOT supported: (try to fake it)

Disable BLEND, Disable LIGHTING, Disable TEXTURE_2D, set BlendFunc to (ONE, ONE)
set color to ambient color, render geometry
set DepthFunc to EQUAL, DepthMask to FALSE, Enable BLEND, Enable TEXTURE_2D
set color to (Mdiffuse*Ldiffuse*Lx), set active texture to NposX, render geometry
set color to (Mdiffuse*Ldiffuse*Ly), set active texture to NposY, render geometry
set color to (Mdiffuse*Ldiffuse*Lz), set active texture to NposZ, render geometry
set BlendFunc to (ZERO, ONE_MINUS_SRC_COLOR)
set color to (Mdiffuse*Ldiffuse*Lx), set active texture to NnegX, render geometry
set color to (Mdiffuse*Ldiffuse*Ly), set active texture to NnegY, render geometry
set color to (Mdiffuse*Ldiffuse*Lz), set active texture to NnegZ, render geometry
set BlendFunc to (ONE, ONE), set DepthFunc to LESS, set DepthMask TRUE

That's really all there is to it! Well, for this simple case, anyway. So now what happens when Lx (or Ly or Lz) is negative? Well, you obviously have to set the color to (Mdiffuse*Ldiffuse*(-Lx)) to avoid specifying a negative color. The other important effect is that when Lx is negative, the product of Lx and NposX is negative, so it becomes a subtractive pass. Similarly, Lx * NnegX is then positive, and so becomes an additive pass.

It should also be noted that the ambient pass comes first when faking the subtractive blend. It is done to help minimize the error of this approximation.

Also, the technique here is not adapted for multitexuring, but such adaptations would not be difficult for a given set of functionality (ie number of texture units, tex_env extensions, and blending modes).

Results

All the illustrations here were generated using the NVIDIA drivers for linux on a Riva TNT. The demonstration code is GLUT and has been verified to work under Irix and Win32 as well. Because the Riva TNT does not support subtractive blend, all of the examples here were created using the hack mentioned earlier. If you do the math, you will discover that this hack causes the darker areas to be a little lighter than they would otherwise be. I have also verified this visually on SGI machines (which support subtractive blend) and by toggling the demo program between OpenGL lighting and orthogonal illumination mapping.

The demo program uses etopo5 (earth topography database, 5 minute resolution). The textures are generated at full resolution while the geometry is subsampled by 16 in both I and J. Obviously a more sophisticated mesh simplifier would produce better results, but the naive mesh simplification is still valid (and very easy to implement).

So, we will start with some subjective comparisons on images produced by the demo program and then look at some of the numbers.

glPolygonMode(GL_FILL)

glPolygonMode(GL_LINE)

These illustrations show the actual geometric complexity of the scene contrasted with the perceived illuminated complexity.

original light position

light moved

This example shows the effect of moving the light while keeping the geometry and view constant.

geometry position 1

geometry position 2

This example shows the effect of moving the geometry while keeping the light direction and view position constant.

full geometry lit with OIM

full geometry lit with OpenGL

simple geometry lit with OIM

simple geometry lit with OpenGL

simple geometry wireframe

full geometry wireframe

These comparisons show various aspects of OIM versus OpenGL lighting.

Now for the numbers. These results were generated on a couple of machines I have at home. Machine A is a celeron 300A that is overclocked to 450 MHz. It has a Diamond Viper 770 (TNT2) board with 32 MB of memory. It also has 128 MB of main memory. Machine B is a pentium 200MMX with a Creative Graphics Blaster (TNT) board with 16 MB of memory. It has 96 MB of main memory. Values in the table are frames per second.

Linux,
16bpp,
Machine A Linux,
16bpp,
Machine B Win95,
16bpp,
Machine A Win95,
32bpp,
Machine A

OIM

7-pass
24.7 13.7 42.5 23.6

6-pass
28.3 15.9 49.6 27.4

OpenGL Lighting

simple geometry
107.5 61.7 252.4 130.2

full geometry
0.58 0.19 2.2 2.2

The 6-pass OIM test removes the pass for NnegZ as a simplification for height field data. The final pass for blending in the color texture is omitted from these tests as well.

Downloads

If, after all this, you're actually interested enough to run the demo program for yourself and maybe tinker with the source a bit, you'll need to download it from one of the following links:

Source with Etopo5: (11 MB)
        thesis.zip from preferred high-bandwidth server: [http] [ftp]
        thesis.zip from lower-bandwidth server: [http] [ftp]

        NOTE: WinZip corrupts the etopo5.bin file in the .tar.gz archive. If you are using WinZip, get the .zip file!

        thesis.tar.gz from preferred high-bandwidth server: [http] [ftp]
        thesis.tar.gz from lower-bandwidth server: [http] [ftp]

For the source only: (really small)
        thesis-source.tar.gz
        thesis-source.zip

Conclusion

This document was thrown together rather quickly, but perhaps it will serve well enough as an introduction to Orthogonal Light Maps for now. It will undoubtedly undergo some iterations to improve the presentation, correct errors and refine the techniques. Until I have more information available, please feel free to contact me at cass@r3.nu if you have specific questions or interests regarding this technique.

As you might have surmised by the link names above, I am trying to synthesize this work into a thesis, so if you have any information about similar techniques please drop me an email.

Acknowledgements

A big thanks to Mark Kilgard for some corrections and refinements to the technique as presented in the original posting of this article. (You can find the specifics of his enhancements in the demo source.) Thanks also to Michael Gold and David Gould for helping me verify the Win32 versions worked. Finally, thanks to Robert Moorhead who got me started in computer graphics and is guiding me through the development and documentation of this work.