Introduction
When you’re making a game engine, you are constantly multiplying and constructing transformation matrices; chances are, you ran by
the subject of row/column-major along the way and possibly got confused about some things. In this post, I’d like to make things clear and share with you what I learned when I had the same experience.
Before beginning, please note the following:
-
A matrix
A
is defined asMatmxn
wherem=rows n=cols
, anda(i, j)
always refers to the element in the i-th row and j-th column. So in mathematics, whenever you see row/column indices, the first number should always be the row, and the second number is always the column. -
Multiplying two matrices is defined if and only if the number of columns in the first matrix is equal to the number of rows in the second matrix, i.e. the inner dimensions must match.
-
In mathematics, there’s no such thing as row/column-major. Row/column-major refers only to storage layout, which is specified by the programming language. What we do have, are row and column vectors.
Row and column vectors
If we have a Mat4x4
transform M
and a Vec4
v
, and we wanted to transform
v
by M
; we multiply them in one of two ways:
-
v
is a row vectorMat1x4
, and transformingv
byM
is written in the ordervM
. DoingMv
, i.e.Mat4x4*Mat1x4
is not allowed, because the inner dimensions don’t match. -
v
is a column vectorMat4x1
, and transformingv
byM
is written in the orderMv
.
Take the following two examples (in code form for alignment), where M
here is a Mat2x2
and v
is a Vec2
:
I) Mv: | x |
| y |
*
| m11 m12 | | m11*x + m12*y |
| m21 m22 | | m21*x + m22*y |
II) vM: | m11 m12 |
| m21 m22 |
*
| x y | | x*m11 + y*m21 x*m12 + y*m22 |
The results are different and that is not surprising; matrix multiplication is not
commutative. Getting the same result requires us to transpose both v
and M
, not just v
(as the transpose of a column vector is a row vector and vice versa).
So, row and column vectors tell us the order of multiplication, and therefore, how we should construct our transform matrices.
Having said that and to reiterate: if you use a column vector v
and have constructed a transform matrix M
, you would
do Mv
to apply the transform.
If, for whatever reason, you decided to switch to row vectors, which means you have transposed v
; you will no longer get
the same output vector, unless you also transpose the transform matrix M
. Only then would vM
give the same result.
Row- and column-major
Almost everywhere except for early Computer Graphics, column vectors are used by
default, i.e. multiplication order in most textbooks is Mv
.
From what I’ve read, this annoyed a lot of people and someone decided to fix the OpenGL spec and switch to column vectors instead of row vectors, to be consistent with most math and physics textbooks.
Here is where the col/row-major comes in:
for clarity, I want to define a translation matrix T
where the translation vector t
in early Computer Graphics
was placed in the last row, like so:
T "vM":
| 1 0 0 0 |
| 0 1 0 0 |
| 0 0 1 0 |
| tx ty tz 1 |
Now if we were to define this matrix in C/C++, we would need to consider something: given that computer memory addressing is 1-dimensional, does C/C++ store the entire 2D array contiguously in memory?
If yes, then the language has to decide which elements of that 2D array are going to be contiguous. If rows are contiguous, then it is row-major. If columns are contiguous; it is column-major.
As you may already know, C/C++ does store 2D arrays contiguously in memory, and it is row-major. So if you were to define the matrix T
in C/C++; it would look like this in memory:
|| 1, 0, 0, 0, 0, 1, 0, 0, 0, 0, 1, 0, tx, ty, tz, 1 ||
When they switched to column vectors, T
had to change to this:
T "Mv":
| 1 0 0 tx |
| 0 1 0 ty |
| 0 0 1 yz |
| 0 0 0 1 |
If we change T
in C/C++; it would now look like this in memory:
|| 1, 0, 0, t.x, 0, 1, 0, t.y, 0, 0, 1, t.z, 0, 0, 0, 1 ||
They didn’t like that they weren’t compatible with the old memory layout, so their “fix” was to change from row-major to column-major.
That means if we have a matrix B
and make B[j][i] = T[i][j]
, the columns for B
are now stored contiguously in memory.
That way B
has the old memory layout, only now using column vectors instead of row vectors.
The fact that a matrix is stored in column-major does NOT affect the rules of multiplication, but only how we retrieve some element in that matrix using some programming language.
Remember in mathematics, if we see row/column indices, the first number is always referring to the row, and the second number is referring to the column. M[][]
however, is programming language syntax, and whether the first number in that context represents row or column depends on whether M
is row-major or column-major.
Take as an example a Mat3x3
C
, and a Vec3
v
; we want to transform v
by C
. Let’s say we are using column vectors, so the multiplication order is Cv
. The y-component for the resulting vector is computed mathematically as:
c21x + c22y + c23*z
When implementing this in code, only now do we care about row-major vs. column-major. Because for example, how do we retrieve c21
from memory?
// If C is row-major.
result.y = C[2][1]*v.x + C[2][2]*v.y + C[2][3]*v.z;
// If C is column-major.
result.y = C[1][2]*v.x + C[2][2]*v.y + C[3][2]*v.z;
From what I understand, the OpenGL spec went against the mathematical notation for matrix indices, claiming that they use “column-major index notation”, i.e. a(i, j)
in their notation is the element in i-th column and j-th row; when in reality, there is no such thing as “column-major” in math, it doesn’t care about storage order, and a(i, j)
is always the element in i-th row and j-th column.
Final Remarks
In your engine, you need to specify two things when it comes to matrices and matrix multiplication:
- row vectors vs. column vectors (to decide order of multiplication).
- row-major vs. column-major (to decide how to store matrix in memory).
I like using row-major with column vectors, to stay consistent with the common notation of C/C++ and math, but it is important to note that column-major can be a better choice when doing SIMD.
When declaring your matrix structure, a trick you can do is name each element by its index explicitly, where the first number is always the row, as per the mathematical notation:
// row-major.
// Matmxn
// a(i, j) = arr[i][j] = arr[i*m + j]
struct Mat4x4
{
float _11, _12, _13, _14;
float _21, _22, _23, _24;
float _31, _32, _33, _34;
float _41, _42, _43, _44;
};
// column-major.
// Matmxn
// a(i, j) = arr[j][i] = arr[i + n*j]
struct Mat4x4
{
float _11, _21, _31, _41;
float _12, _22, _32, _42;
float _13, _23, _33, _43;
float _14, _24, _34, _44;
};
And now in our matrix multiplication function, we use the variable names instead of indexing using M[][]
:
Mat4x4 M;
Vec4 v;
// Mv+row-major.
result.x = M._11*v.x + M._12*v.y + M._13*v.z + M._14*v.w;
result.y = M._21*v.x + M._22*v.y + M._23*v.z + M._24*v.w;
result.z = M._31*v.x + M._32*v.y + M._33*v.z + M._34*v.w;
result.w = M._41*v.x + M._42*v.y + M._43*v.z + M._44*v.w;
// Mv+column-major.
result.x = M._11*v.x + M._12*v.y + M._13*v.z + M._14*v.w;
result.y = M._21*v.x + M._22*v.y + M._23*v.z + M._24*v.w;
result.z = M._31*v.x + M._32*v.y + M._33*v.z + M._34*v.w;
result.w = M._41*v.x + M._42*v.y + M._43*v.z + M._44*v.w;
//
//
// No difference!
You also ideally want to use same multiplication order in both CPU and GPU. Nowadays, you can easily tell shader languages, such as GLSL and HLSL, what sort of matrices you want to use.
References
Fabian Giesen’s blog:
Row major vs. column major, row vectors vs. column vectors
Row-major vs. column-major and GL ES
Austin Morlan’s blog:
OpenGL matrices