Rotation


A rotation is a circular movement of an object around a central line, known as the axis of rotation. A rotation R(α)R(\alpha) in RN\mathbb{R}^{N} is a linear operator parameterized by a real value, typically interpreted as the angle of rotation and denoted θ\theta or α\alpha. It is also a transformation.

In the context of a rigid body, it is a movement that maintains at least one point fixed. A rotation where the axis passes through an object's center of mass is called a spin rotation, whereas a rotation around an axis completely outside of the body is called a revolution (or orbit). In this case, the axis is called the axis of revolution.

Rotations form a group with interesting properties. Since rotations are, geometrically speaking, planar operations, the most fundamental group is the one in R2\mathbb{R}^{2}. Here they form a group known as

SO(2)={RM2(R)  RRT=RTR=1^ and detR=1}SO(2)=\{ R\in M_{2}(\mathbb{R})\ | \ RR^{T}=R^{T}R=\hat{\mathbf{1}} \text{ and } \det R=1\}

where M2(R)M_{2}(\mathbb{R}) is the set of all real 2×22\times 2 matrices. It is called the (two-dimensional) special orthogonal group1. It can also be written as SO(2)={R(α)  α[0,2π[ }SO(2)=\{ R(\alpha)\ |\ \alpha \in[0,2\pi[\ \} and is the group of all antisymmetric matrices in R2\mathbb{R}^{2}. For rotations in RN\mathbb{R}^{N}, they form the group

SO(N)={RMN(R)  RRT=RTR=1^ and detR=1}SO(N)=\{ R\in M_{N}(\mathbb{R})\ | \ RR^{T}=R^{T}R=\hat{\mathbf{1}} \text{ and } \det R=1\}

Properties

  • They are linear operators.
  • They are antisymmetric: R(α)TR(α)=R(α)RT(α)=IR(\alpha)^{T}R(\alpha)=R(\alpha)R^{T}(\alpha)=\mathrm{I}, where I\mathrm{I} is the Identity matrix.
  • They have unit determinant: detR(α)=1\det R(\alpha)=1.
  • Given a vector vRN\mathbf{v}\in \mathbb{R}^{N}, the application of a rotation R(α)R(\alpha) rotates it by an angle α\alpha with respect to its previous orientation.

Matrix representation

Being linear operators, rotations can be represented through square matrices. This is a particularly convenient form in more theoretical math, especially when operator theory can be used. Given a vector (x,y)(x,y) in a plane, a rotation about the origin by an angle θ>0\theta>0 is the vector (x,y)(x',y') given by

x=cos(θ)xsin(θ)y,y=sin(θ)x+cos(θ)yx'=\cos(\theta)x-\sin(\theta)y, \quad y'=\sin(\theta)x+\cos(\theta)y

which can be derived using plain trigonometry. The rotation is counterclockwise (by convention). These equations can be written in matrix form as

(xy)=(cosθsinθsinθcosθ)(xy)\boxed{\begin{pmatrix}x' \\ y'\end{pmatrix}=\begin{pmatrix}\cos\theta & -\sin\theta \\ \sin\theta & \cos\theta\end{pmatrix}\begin{pmatrix}x \\ y\end{pmatrix}}

In three dimensions, a rotation about the zz axis can be trivially defined as

(xyz)=(cosθsinθ0sinθcosθ0001)(xyz)\boxed{\begin{pmatrix}x' \\ y' \\ z'\end{pmatrix}=\begin{pmatrix}\cos\theta & -\sin\theta & 0 \\ \sin\theta & \cos\theta & 0 \\ 0 & 0 & 1\end{pmatrix}\begin{pmatrix}x \\ y \\ z\end{pmatrix}}

The 3D rotation matrix columns can be represented in the standard R3\mathbb{R}^{3} Basis {i,j,k}\{\mathbf{i},\mathbf{j},\mathbf{k}\}:

R0i=(cs0)=ci+sj,R0j=(sc0)=si+cj,R0k=(001)=kR_{0}\mathbf{i}=\begin{pmatrix}c \\ s \\ 0\end{pmatrix}=c\mathbf{i}+s\mathbf{j}, \quad R_{0}\mathbf{j}=\begin{pmatrix}-s \\ c \\ 0\end{pmatrix}=-s\mathbf{i}+c\mathbf{j}, \quad R_{0}\mathbf{k}=\begin{pmatrix}0 \\ 0 \\ 1\end{pmatrix}=\mathbf{k}

where c=cosθc=\cos\theta and s=sinθs=\sin\theta. Consider a right-handed orthonormal set {a,b,d}\{\mathbf{a},\mathbf{b},\mathbf{d}\}. R0R_{0} is a rotation in this basis. The matrix R1R_{1} that represents a rotation in the standard basis will transform a\mathbf{a}, b\mathbf{b} and d\mathbf{d} with

R1a=ca+sb,R1b=sa+cb,R1d=dR_{1}\mathbf{a}=c\mathbf{a}+s\mathbf{b}, \quad R_{1}\mathbf{b}=-s\mathbf{a}+c\mathbf{b}, \quad R_{1}\mathbf{d}=\mathbf{d}

and in matrix form

R1(abd)=(ca+sbsa+cbd)=(abd)(cs0sc0001)=(abd)R0R_{1}(\begin{array}{c|c}\mathbf{a} & \mathbf{b} & \mathbf{d}\end{array})=(\begin{array}{c|c}c\mathbf{a}+s\mathbf{b} & -s\mathbf{a}+c\mathbf{b} & \mathbf{d}\end{array})=(\begin{array}{c|c}\mathbf{a} & \mathbf{b} & \mathbf{d}\end{array})\begin{pmatrix}c & -s & 0 \\ s & c & 0 \\ 0 & 0 & 1\end{pmatrix}=(\begin{array}{c|c}\mathbf{a} & \mathbf{b} & \mathbf{d}\end{array})R_{0}

Calling P=(abd)P=(\begin{array}{c|c}\mathbf{a} & \mathbf{b} & \mathbf{d}\end{array}), we can see that the rotation matrices are similar:

R1P=PR0R0=P1R1P=PTR1PR_{1}P=PR_{0} \quad \Rightarrow \quad R_{0}=P^{-1}R_{1}P=P^{T}R_{1}P

where P1=PTP^{-1}=P^{T} comes from antisymmetry. We can also solve for

R1=PR0PTR_{1}=PR_{0}P^{T}

which gives us

R1=c(aaT+bbT)+s(baTabT)+ddT(1)R_{1}=c(\mathbf{a}\mathbf{a}^{T}+\mathbf{b}\mathbf{b}^{T})+s(\mathbf{b}\mathbf{a}^{T}-\mathbf{a}\mathbf{b}^{T})+\mathbf{d}\mathbf{d}^{T}\tag{1}

where all the xxT\mathbf{x}\mathbf{x}^{T} are 3×33\times3 matrices. This allows us to calculate R1R_{1}, though at the cost of having to define a\mathbf{a} and b\mathbf{b}. Still, there is a more convenient form that can be derived to remove this dependency as, intuitively, a rotation is independent of which vectors on the plane you refer it to.

The vector v\mathbf{v} that's being rotated is represented in {a,b,d}\{\mathbf{a},\mathbf{b},\mathbf{d}\} as a Linear combination of the basis vectors:

v=(av)a+(bv)b+(dv)d=αa+βb+δd(2)\mathbf{v}=(\mathbf{a}\cdot \mathbf{v})\mathbf{a}+(\mathbf{b}\cdot\mathbf{v})\mathbf{b}+(\mathbf{d}\cdot\mathbf{v})\mathbf{d}=\alpha\mathbf{a}+\beta\mathbf{b}+\delta\mathbf{d}\tag{2}

A couple of useful vectors are

d×v=βa+αb,d×(d×v)=αaβb\mathbf{d}\times\mathbf{v}=-\beta\mathbf{a}+\alpha\mathbf{b}, \quad \mathbf{d}\times(\mathbf{d}\times\mathbf{v})=-\alpha\mathbf{a}-\beta\mathbf{b}

The first one can be written as a matrix multiplied by a vector

d×v=(d1d2d3)×(v1v2v3)=(d2v3d3v2d3v1d1v3d1v2d2v1)=(0d3d2d30d1d2d10)(v1v2v3)=Dv(3)\mathbf{d}\times\mathbf{v}=\begin{pmatrix}d_{1} \\ d_{2} \\ d_{3}\end{pmatrix}\times\begin{pmatrix}v_{1} \\ v_{2} \\ v_{3}\end{pmatrix}=\begin{pmatrix}d_{2}v_{3}-d_{3}v_{2} \\ d_{3}v_{1}-d_{1}v_{3} \\ d_{1}v_{2}-d_{2}v_{1}\end{pmatrix}=\begin{pmatrix}0 & -d_{3} & d_{2} \\ d_{3} & 0 & -d_{1} \\ -d_{2} & d_{1} & 0\end{pmatrix}\begin{pmatrix}v_{1} \\ v_{2} \\ v_{3}\end{pmatrix}=D\mathbf{v}\tag{3}

The matrix DD is antisymmetric. We can then also say d×(d×v)=D2v\mathbf{d}\times(\mathbf{d}\times\mathbf{v})=D^{2}\mathbf{v}. This is (one of) the matrix representation(s) of the cross product.

Using (2)(2) and the unitary matrix II we can find

Iv=v==(aaT+bbT+ddT)vI\mathbf{v}=\mathbf{v}=\ldots=(\mathbf{a}\mathbf{a}^{T}+\mathbf{b}\mathbf{b}^{T}+\mathbf{d}\mathbf{d}^{T})\mathbf{v}

which means

I=aaT+bbT+ddTI=\mathbf{a}\mathbf{a}^{T}+\mathbf{b}\mathbf{b}^{T}+\mathbf{d}\mathbf{d}^{T}

Similarly, we have

Dv=d×v==(baTabT)vD\mathbf{v}=\mathbf{d}\times\mathbf{v}=\ldots=(\mathbf{b}\mathbf{a}^{T}-\mathbf{a}\mathbf{b}^{T})\mathbf{v}

and so

D=baTabTD=\mathbf{b}\mathbf{a}^{T}-\mathbf{a}\mathbf{b}^{T}

Finally, we have

D2v=d×(d×v)==(ddTI)vD^{2}\mathbf{v}=\mathbf{d}\times(\mathbf{d}\times\mathbf{v})=\ldots=(\mathbf{d}\mathbf{d}^{T}-I)\mathbf{v}

and so

D2=ddTID^{2}=\mathbf{d}\mathbf{d}^{T}-I

We can combine all these relations to find R1R_{1} from (1)(1):

R1=I+sinθD+(1cosθ)D2(4)\boxed{R_{1}=I+\sin\theta D+(1-\cos\theta)D^{2}}\tag{4}

In this form, the rotation matrix is independent from the choice of a\mathbf{a} and b\mathbf{b} when applied to a vector. In fact

R1v=Iv+sDv+(1c)D2v=v+sd×v+(1c)d×(d×v)R_{1}\mathbf{v}=I\mathbf{v}+sD\mathbf{v}+(1-c)D^{2}\mathbf{v}=\mathbf{v}+s\mathbf{d}\times\mathbf{v}+(1-c)\mathbf{d}\times(\mathbf{d}\times\mathbf{v})

Rotation vector spaces

It is possible to take the exponential of a matrix by using the series definition

exp(x)=n=0xnn!\exp(x)=\sum_{n=0}^{\infty} \frac{x^{n}}{n!}

This has some interesting effects in the context of rotations. Consider the most general antisymmetric 2×22\times 2 matrix:

A=(0αα0)=α(0110)=αE\mathrm{A}=\begin{pmatrix} 0 & -\alpha \\ \alpha & 0 \end{pmatrix}=\alpha \begin{pmatrix} 0 & -1 \\ 1 & 0 \end{pmatrix}=\alpha \mathrm{E}

The exponential is

expA=I+n=1Ann!=n=0A2m(2m)!+m=0A2m+1(2m+1)!=Im=0(1)mα2m(2m)!cosα+Em=0(1)mα2m+1(2m+1)!sinα\begin{align} \exp \mathrm{A}&=\mathrm{I}+\sum_{n=1}^{\infty} \frac{\mathrm{A}^{n}}{n!} \\ &=\sum_{n=0}^{\infty} \frac{\mathrm{A}^{2m}}{(2m)!}+\sum_{m=0}^{\infty} \frac{\mathrm{A}^{2m+1}}{(2m+1)!} \\ &=\mathrm{I}\underbrace{ \sum_{m=0}^{\infty} \frac{(-1)^{m}\alpha^{2m}}{(2m)!} }_{ \cos \alpha }+\mathrm{E}\underbrace{ \sum_{m=0}^{\infty} \frac{(-1)^{m}\alpha^{2m+1}}{(2m+1)!} }_{ \sin \alpha } \end{align}

where we used that A2m=(1)mα2mI\mathrm{A}^{2m}=(-1)^{m}\alpha^{2m}\mathrm{I} and A2m+1=A2mA=(1)mα2m+1E\mathrm{A}^{2m+1}=\mathrm{A}^{2m}\mathrm{A}=(-1)^{m}\alpha^{2m+1}\mathrm{E} and the Sine and cosine series. As such, in the most general case, the exponential of an antisymmetric matrix is

expA=cosα I+sinα E=(cosαsinαsinαcosα)=R(α)\exp \mathrm{A}=\cos \alpha \ \mathrm{I}+\sin \alpha\ \mathrm{E}=\begin{pmatrix} \cos \alpha & -\sin \alpha \\ \sin \alpha & \cos \alpha \end{pmatrix}=R(\alpha)

Evidently, all (finite) rotations are exponentials of the antisymmetric matrix E\mathrm{E}, weighed by some factor α\alpha. We can therefore represent them as

R(α)=eαE\boxed{R(\alpha)=e^{\alpha \mathrm{E}}}

Since all rotations are exponentials of E\mathrm{E}, we say that E\mathrm{E} is the generator of SO(2)SO(2). In fact, antisymmetric matrices form a Vector space (sum and scalar multiplication are both defined and closed) and E\mathrm{E} forms a Basis of this space. Thus, any rotation is also a Linear combination of basis elements. In 2D the basis is just {E}\{ \mathrm{E} \}, but in NN-dimensional spaces this provides an easy extension of the definition of rotation: just add more elements to the basis so that it generates SO(N)SO(N).

The number of generators, that is elements in the basis, is found to be N(N1)/2N(N-1)/2. In N=3N=3 we have 33 generators and these are

E1=(000001010),E2=(001000100),E3=(010100000)\mathrm{E}_{1}=\begin{pmatrix} 0 & 0 & 0 \\ 0 & 0 & -1 \\ 0 & 1 & 0 \end{pmatrix},\quad \mathrm{E}_{2}=\begin{pmatrix} 0 & 0 & 1 \\ 0 & 0 & 0 \\ -1 & 0 & 0 \end{pmatrix},\quad \mathrm{E}_{3}=\begin{pmatrix} 0 & -1 & 0 \\ 1 & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}

A generic infinitesimal rotation Ω\Omega is given by

Ω=i=13ωiEi=(0ω3ω2ω30ω1ω2ω10)\boxed{\Omega=\sum_{i=1}^{3} \omega_{i}\mathrm{E}_{i}=\begin{pmatrix} 0 & -\omega_{3} & \omega_{2} \\ \omega_{3} & 0 & -\omega_{1} \\ -\omega_{2} & \omega_{1} & 0 \end{pmatrix}}

For example, the rotation R3R_{3} is given by the exponential of E3\mathrm{E}_{3}

R3=eω3E3=exp(0ω30ω300000)=(cosω3sinω30sinω3cosω30001)R_{3}=e^{ \omega_{3}\mathrm{E}_{3} }=\exp\begin{pmatrix} 0 & -\omega_{3} & 0 \\ \omega_{3} & 0 & 0 \\ 0 & 0 & 0 \end{pmatrix}=\begin{pmatrix} \cos \omega_{3} & -\sin \omega_{3} & 0 \\ \sin \omega_{3} & \cos \omega_{3} & 0 \\ 0 & 0 & 1 \end{pmatrix}

But this is just a 2D rotation around an axis, specifically around the zz axis. The same can be found for R1R_{1} and R2R_{2} too, which end up being rotations around the xx and yy axes. Evidently, then, RiR_{i} is the rotation around the ii-th axis. A generic rotation can then be described as a Linear combination of basis rotations, just like how a vector is a linear combination of basis vectors (again, rotations constitute a vector space).

R=ei=1NωiEi=i=1NeωiEi\boxed{R=e^{ \sum_{i=1}^{N}\omega_{i}\mathrm{E}_{i} }=\prod_{i=1}^{N} e^{\omega_{i}\mathrm{E}_{i}}}

Intuitively, this makes sense: rotating something on a "diagonal" is like rotating it "horizontally" and then "vertically".

Look for a moment at the E\mathrm{E} basis matrices. They all have the same blueprint: antisymmetric, all zeros on the diagonal and exactly a single pair of 11 and 1-1, with one matrix for every entry outside of the diagonal. For one, this gives proof on why the number of basis elements is what it is: the number of nondiagonal entries in an N×NN\times N matrix is precisely N(N1)/2N(N-1)/2. But more than that, it gives us a way to formalize how this basis is built, which is through the Levi-Civita tensor ϵijk\epsilon_{ijk}. For instance, in 3D each basis matrix ii is defined by its elements (Ei)jk=ϵijk(\mathrm{E}_{i})_{jk}=-\epsilon_{ijk}. A generic rotation basis then is just the application of the Levi-Civita tensor in whatever dimension you need.

Infinitesimal rotations are hence defined in a similar manner. In 3D:

Ωij=k=13ϵijkωk\boxed{\Omega_{ij}=-\sum_{k=1}^{3} \epsilon_{ijk}\omega_{k}}

This permits a more refined analysis of what infinitesimal rotations even do. The variation of a vector vv subject to such a rotation must then be

δvi=j=13Ωijvj=j,k=13ϵijkωkvj=j,k=13ϵikjωkvj=(ω×v)i\delta v_{i}=\sum_{j=1}^{3} \Omega_{ij}v_{j}=-\sum_{j,k=1}^{3} \epsilon_{ijk}\omega_{k}v_{j}=\sum_{j,k=1}^{3} \epsilon_{ikj}\omega_{k}v_{j}=(\boldsymbol{\omega}\times \mathbf{v})_{i}

using ϵijk=ϵikj\epsilon_{ijk}=-\epsilon_{ikj} and the tensor definition of the vector product. Putting components together we can say that, in general, ω×v\boldsymbol{\omega}\times \mathbf{v} is the variation of v\mathbf{v} under an infinitesimal rotation of angle ω\lvert \boldsymbol{\omega} \rvert around an axis parallel to ω\boldsymbol{\omega}. The norm of this variation is

δv=ω×v=ωvsinθ=ωv\lvert \delta \mathbf{v} \rvert =\lvert \boldsymbol{\omega}\times \mathbf{v} \rvert =\lvert \boldsymbol{\omega} \rvert \lvert \mathbf{v} \rvert \sin \theta=\lvert \boldsymbol{\omega} \rvert \lvert \mathbf{v}_{\perp} \rvert

To further analyze Ei\mathrm{E}_{i}, we can calculate the Commutator between the Ei\mathrm{E}_{i} and Ej\mathrm{E}_{j}:

[Ei,Ej]=EiEjEjEi[\mathrm{E}_{i},\mathrm{E}_{j}]=\mathrm{E}_{i}\mathrm{E}_{j}-\mathrm{E}_{j}\mathrm{E}_{i}

We need to figure out the product between these matrices:

(EiEj)mn=k=13(Ei)mk(Ej)kn=k=13ϵimkϵikn=k=13ϵimkϵjnk=(δijδmnδinδmj)(\mathrm{E}_{i}\mathrm{E}_{j})_{mn}=\sum_{k=1}^{3} (\mathrm{E}_{i})_{mk}(\mathrm{E}_{j})_{kn}=\sum_{k=1}^{3} \epsilon_{imk}\epsilon_{ikn}=-\sum_{k=1}^{3} \epsilon_{imk}\epsilon_{jnk}=-(\delta_{ij}\delta_{mn}-\delta_{in}\delta_{mj})

using the Dirac delta. The commutator components then are

([Ei,Ej])mn=(EiEj)mn(EjEi)mn=(δijδmnδinδmj)+(δjiδmnδjnδmi)([\mathrm{E}_{i},\mathrm{E}_{j}])_{mn} = (\mathrm{E}_{i}\mathrm{E}_{j})_{mn} - (\mathrm{E}_{j}\mathrm{E}_{i})_{mn} = -\left(\delta_{ij}\delta_{mn} - \delta_{in}\delta_{mj}\right) + \left(\delta_{ji}\delta_{mn} - \delta_{jn}\delta_{mi}\right)

Simplify:

[Ei,Ej]mn=δinδmjδjnδmi=k=13ϵijk(Ek)mn [\mathrm{E}_{i},\mathrm{E}_{j}]_{mn} = \delta_{in}\delta_{mj} - \delta_{jn}\delta_{mi} = \sum_{k=1}^{3} \epsilon_{ijk} (\mathrm{E}_k)_{mn}

You can verify that each component [Ei,Ej]mn[\mathrm{E}_i, \mathrm{E}_j]_{mn} is actually an entry of Ek\mathrm{E}_k weighted by ϵijk\epsilon_{ijk}. In particular, the commutator simplifies to:

[Ei,Ej]=k=13ϵijkEk\boxed{[\mathrm{E}_{i},\mathrm{E}_{j}] = \sum_{k=1}^{3} \epsilon_{ijk} \mathrm{E}_{k}}

This gives a compact formula for the commutator of the generators of rotation. This might seem insignificant, but the fact is that we now have a well-behaved commutator in the vector space of rotations. This means that it's not any old vector space, but specifically it's a Lie algebra. This provides access to all of the machinery specific to Lie algebras and helps to study the special orthogonal groups such as SO(3)SO(3).

Conservation of angular momentum

Rotations have a connection of fundamental importance to the conservation of angular momentum. We'll use the Lagrangian formalism for this proof, but this fact is true in general regardless of formalism.

Consider a 3D rotation RR around an axis parallel to ω\boldsymbol{\omega} and write it as the exponential of a linear combination of generators as above:

R=exp(αi=13ω^iEi)R=\exp\left( \alpha \sum_{i=1}^{3} \hat{\boldsymbol{\omega}}_{i}\mathrm{E}_{i} \right)

where ω^=ω/ω\hat{\boldsymbol{\omega}}=\boldsymbol{\omega}/\lvert \boldsymbol{\omega} \rvert. We define the Coordinate transformation

φ(q,α)=eαi=13ω^iEiq\varphi(q,\alpha)=e^{\alpha \sum_{i=1}^{3} \hat{\boldsymbol{\omega}}_{i}\mathrm{E}_{i}}\mathbf{q}

We can take the derivative of this in α\alpha to find

φα(q,α)=(i=13ω^iEi)eαi=13ω^iEiq\frac{ \partial \varphi }{ \partial \alpha }(q,\alpha)=\left( \sum_{i=1}^{3} \hat{\boldsymbol{\omega}}_{i}\mathrm{E}_{i} \right)e^{\alpha \sum_{i=1}^{3} \hat{\boldsymbol{\omega}}_{i}\mathrm{E}_{i}}\mathbf{q}

The sum in parenthesis is an infinitesimal rotation. When evaluated in α=0\alpha=0 we get

φα(q,0)=(i=13ω^iEi)q\frac{ \partial \varphi }{ \partial \alpha } (q,0)=\left( \sum_{i=1}^{3} \hat{\boldsymbol{\omega}}_{i}\mathrm{E}_{i} \right)\mathbf{q}

Now, assuming the Lagrangian L(q,q˙)\mathcal{L}(\mathbf{q}, \dot{\mathbf{q}}) is invariant under this rotation, we can invoke Nöther's theorem to guarantee the existence of a constant of motion:

jLq˙jφjα(q,0)\sum_j \frac{\partial \mathcal{L}}{\partial \dot{q}_j}\frac{\partial \varphi_j}{\partial \alpha} (q, 0)

Substituting φα(q,0)\frac{ \partial \varphi }{ \partial \alpha } (q,0) gives

j,kLq˙j(i=13ω^i(Ei)jkqk)\sum_{j, k} \frac{\partial \mathcal{L}}{\partial \dot{q}_j} \left( \sum_{i=1}^{3} \hat{\boldsymbol{\omega}}_i (\mathrm{E}_i)_{jk} q_k \right)

This can be rearranged as

i=13ω^ij,kpj(Ei)jkqk\sum_{i=1}^{3} \hat{\boldsymbol{\omega}}_i \sum_{j, k} p_j (\mathrm{E}_i)_{jk} q_k

Here, pj=Lq˙jp_j = \frac{\partial \mathcal{L}}{\partial \dot{q}_j} is the conjugate momentum and (Ei)jk(\mathrm{E}_i)_{jk} is the j,kj, k-th component of the ii-th rotation generator.

In vector form, recognize that

j,kpj(Ei)jkqk=pTEiq=(p×q)i=(L)i\sum_{j, k} p_j (\mathrm{E}_i)_{jk} q_k = \mathbf{p}^T \mathrm{E}_i \mathbf{q} = (\mathbf{p} \times \mathbf{q})_i = (\mathbf{L})_i

where L\mathbf{L} is the angular momentum. The conserved quantity becomes:

i=13ω^iLi=ω^L=Lω\sum_{i=1}^{3} \hat{\boldsymbol{\omega}}_i L_i = \hat{\boldsymbol{\omega}} \cdot \mathbf{L} = L_\omega

which is the projection of the total angular momentum onto the rotation axis ω^\hat{\boldsymbol{\omega}}.

This is an universally important fact: if the Lagrangian (or Hamiltonian for that matter) is invariant under rotation on an axis, then angular moment is conserved on that axis. This is true always as long as rotation invariance persists and in fact, for isolated systems, there is no restriction on what the axis is, which means that all angular momentum is conserved in such a system. This is one of the several important manifestations of Nöther's theorem.

Specific cases

Circular motion

If the axis D\mathbf{D} is fixed and the distance is a constant r0r_{0}, the coordinate system can be chosen to be Polar coordinates or Cylindrical coordinates depending on dimensions. If cylindrical, the plane of rotation should be the plane of reference, with the axis of rotation being the zz-axis.

A set of orthonormal axes can be chosen with the Cartesian tangent-normal Moving frame. Call ξ\mathbf{\xi} and η\mathbf{\eta} these two axes, potentially with D\mathbf{D} as the third if in 3D. Therefore, the position unit vector of the rotating object is R=ξsinθ+ηsinθ\mathbf{R}=\mathbf{\xi}\sin\theta+\mathbf{\eta}\sin\theta. In this system, angular speed is σ(t)=θ˙(t)\sigma(t)=\dot{\theta}(t), angular velocity is w(t)=σ(t)D=θ˙(t)D\mathbf{w}(t)=\sigma(t)\mathbf{D}=\dot{\theta}(t)\mathbf{D} and angular acceleration is α(t)=σ˙(t)D=θ¨(t)D\mathbf{\alpha}(t)=\dot{\sigma}(t)\mathbf{D}=\ddot{\theta}(t)\mathbf{D}.

The position therefore changes like

r(t)=r0R(t)+h0D\boxed{\mathbf{r}(t)=r_{0}\mathbf{R}(t)+h_{0}\mathbf{D}}

where h0h_{0} is the height of the movement above the plane Dr=0\mathbf{D}\cdot\mathbf{r}=0 if in 3D. The velocity therefore is v(t)=r0σP=r0σD×R=w×r\mathbf{v}(t)=r_{0}\sigma\mathbf{P}=r_{0}\sigma\mathbf{D}\times\mathbf{R}=\mathbf{w}\times\mathbf{r}, so

v(t)=w×r\boxed{\mathbf{v}(t)=\mathbf{w}\times\mathbf{r}}

so the tangential velocity can be derived from the angular velocity if the position is known. The acceleration then is a(t)=r0σ2R+r0θ¨P=r0σ2R+r0σ˙D×R=r0σ2R+α×r\mathbf{a}(t)=-r_{0}\sigma^{2}\mathbf{R}+r_{0}\ddot{\theta}\mathbf{P}=-r_{0}\sigma^{2}\mathbf{R}+r_{0}\dot{\sigma}\mathbf{D}\times\mathbf{R}=-r_{0}\sigma^{2}\mathbf{R}+\alpha\times\mathbf{r}, so

a(t)=r0σ2R+α×r\boxed{\mathbf{a}(t)=-r_{0}\sigma^{2}\mathbf{R}+\alpha\times\mathbf{r}}

where r0σ2R-r_{0}\sigma^{2}\mathbf{R} is the centripetal acceleration and α×r\mathbf{\alpha}\times\mathbf{r} is the tangential acceleration.

Moving axis

If the axis D(t)\mathbf{D}(t) is moving, let's consider the static axis case in a bit more detail before. We can define an antisymmetric matrix for any vector u\mathbf{u} as

C(u)=(0u3u2u30u1u2u10)C(\mathbf{u})=\begin{pmatrix}0 & -u_{3} & u_{2} \\ u_{3} & 0 & -u_{1} \\ -u_{2} & u_{1} & 0\end{pmatrix}

This is the matrix representation of the cross product, C(u)r=u×rC(\mathbf{u})\mathbf{r}=\mathbf{u}\times\mathbf{r} thanks to (3)(3), and thanks to (4)(4) we can say2

R(t)=I+C(D)sin(θ(t))+C(D)2(1cos(θ(t)))R(t)=I+C(\mathbf{D})\sin(\theta(t))+C(\mathbf{D})^{2}(1-\cos(\theta(t)))

Linear velocity is worked out as

r˙=v=w(t)×r(t)=C(w(t))r(t)\dot{\mathbf{r}}=\mathbf{v}=\mathbf{w}(t)\times \mathbf{r}(t)=C(\mathbf{w}(t))\mathbf{r}(t)

If we differentiate r\mathbf{r} directly, we get

v=R˙(t)r0=R˙(t)RTRr0=(R˙(t)RT)r(t)(5)\mathbf{v}=\dot{R}(t)r_{0}=\dot{R}(t)R^{T}R\mathbf{r}_{0}=(\dot{R}(t)R^{T})\mathbf{r}(t)\tag{5}

Therefore

C(w(t))=(R˙(t)RT)C(\mathbf{w}(t))=(\dot{R}(t)R^{T})

or

R˙(t)=C(w(t))R(t)(6)\dot{R}(t)=C(\mathbf{w}(t))R(t)\tag{6}

(5)(5) and (6)(6) equations tell us the rate of change for the rotation in terms of the current rotation and angular velocity.

Equipped with these, let's consider an actually moving axis D(t)\mathbf{D}(t). Using (4)(4), the rotation matrix corresponding to this vector for angle of rotation θ(t)\theta(t) is

R(t)=I+C(D(t))sin(θ(t))+C(D(t))2(1cos(θ(t)))(7)R(t)=I+C(\mathbf{D}(t))\sin(\theta(t))+C(\mathbf{D}(t))^{2}(1-\cos(\theta(t)))\tag{7}

which acts like

r(t)=R(t)r0\boxed{\mathbf{r}(t)=R(t)\mathbf{r}_{0}}

The linear velocity is given by (5)(5).

For a rotation matrix, I=RRTI=RR^{T}, so taking the time derivative

0=R˙RT+RR˙T=R˙RT+(R˙RT)T(R˙RT)T=R˙RT0=\dot{R}R^{T}+R\dot{R}^{T}=\dot{R}R^{T}+(\dot{R}R^{T})^{T} \quad \Rightarrow \quad(\dot{R}R^{T})^{T}=-\dot{R}R^{T}

which means S(t)=R˙(t)RT(t)S(t)=\dot{R}(t)R^{T}(t) is antisymmetric and because of that we can write S(t)=A(w(t))S(t)=A(\mathbf{w}(t)).

The angular velocity can be calculated using R(t)R(t) from (7)(7) and then calculating R˙RT\dot{R}R^{T}. Using C(D)3=C(D)C(\mathbf{D})^{3}=-C(\mathbf{D}) we can prove

w=θ˙D+sinθD˙+(cosθ1)D˙×D\boxed{\mathbf{w}=\dot{\theta}\mathbf{D}+\sin\theta\dot{\mathbf{D}}+(\cos\theta-1)\dot{\mathbf{D}}\times\mathbf{D}}

As D\mathbf{D} is a unit vector, the triad {D,D˙,D˙×D}\{\mathbf{D},\dot{\mathbf{D}},\dot{\mathbf{D}}\times\mathbf{D}\} forms an orthonormal basis. The angular velocity is expressed in this basis.

Footnotes

  1. Technically, SO(2)SO(2) is a specification for matrices in general, but the only matrices that satisfy these conditions are rotations.

  2. Here D\mathbf{D}, which is a unit vector, should not be confused with DD from (4)(4), which is a matrix. The usage of the same letter is an unfortunate coincidence.