Processing math: 100%

Lecture 12: Geometric Transforms

Composite Transformations

Often we want to apply more than one transform. If we first wanted to scale an object, we could write:

v2=Sv1

Where v1 is the vector to be scaled, S is the scale matrix, and v2 is the scaled vector.

Then, if we wanted to rotate the object, we would write:

v3=Rv2

Where v3 is the final vector and R is the rotation matrix.

In full, we could write:

v3=R(Sv1)

However, we can also compute the composite matrix RS, then apply this to v1 and get the same result.

v3=(RS)v1

This concept of a composite matrix is essential in graphics applications as it means we can represent the work of two matrices in a single composite matrix.

Thus we can send a single matrix as a uniform for each draw call which can apply multiple transformations to an object.

Note that these transforms are applied right side first!

Thus RSv1 is different from SRv2

Translation

So far we have covered rotation, scale, and shear transformations. There is one type of transformation conspiciously missing from this list: translation (or movement).

This is because we have been looking at matrices M that transform a point in the following form:

x=m1,1x+m1,2y y=m2,1x+m2,2y

But this kind of transform cannot allow us to move locations. As an example, you can’t multiply the point 0,0 with any number to move it. To translate something, we need a transform of the form:

x=x+xt y=y+yt

There is no way to do this with a 2x2 matrix!

What’s the solution? We could keep track of a separate translation… but the goal here is to represent everything with a matrix. Then we can easily composite transformations.

The solution is to move to a higher dimension.

Recall that a 2D shear looked like:

A=[1shear01]

A 3D shear looks like:

A=[10xs01ys001]

Applied to some 3D vector:

[10xs01ys001][xyz]=[x+xszy+yszz]

So we set the third or z component of our vector to 1:

[10xt01yt001][xy1]=[x+xty+yt1]

Thus by using a 3D transformation matrix and a 3D vector with 1 in the z component, we can use a matrix to represent a 2D translation.

This video shows visually how a shear in 3D space works out to be a translate in 2D space:

Affine transformations

By LucasVB (Own work) [Public domain], via Wikimedia Commons

Composite with Translation

For example, we can translate a coordinate by tx,ty and then rotate by an angle of θ by using the following composite matrices:

[cosθsinθ0sinθcosθ0001][10xt01yt001][xy1]

Notice that we’ve added a column of 0,0,1 and a row of 0,0,1 to the rotation matrix from before. This holds true for other 2D transforms. For example, scale:

[sx000sy0001]

and shear-x:

[1shearx0010001]

Positions vs. Vectors

This also gives us a tool for distinguishing between positions and vectors.

We want positions to me affected by translation matrices, of course. But if we have something which is geometrically a vector (e.g. a surface normal), we want that to be scaled and rotated by not translated. We can accomplish this by putting a 0 in the z component instead of a 1.

Thus, for vectors:

[10xt01yt001][xy0]=[xy0]

This extra component is called the homogeneous coordinate. This is because it is typically either 0 or 1.

What about 3D?

Since we used a third component to represent translations in our 2D coordinate system, it should come as no surprise that we used a fourth component to represent translations in a 3D coordinate system:

3D scale:

[sx0000sy0000sz00001]

3D rotate (around Z axis):

[cosθsinθ00sinθcosθ0000100001]