Hydrogen atom


The hydrogen atom is the atom of the element hydrogen, a quantum physical system made up of a central positive charge carrier (a proton) and a negative charge carrier (an electron) surrounding it. It is the simplest case of a broader class of objects called hydrogenic atoms (or hydrogen-like atoms) which may possess a stronger central charge, that is, a full atomic nucleus.

center

The Hamiltonian for the Schrödinger equation of the hydrogen atom is

H^=22MR222mre2Ze24πε0re\hat{H}=- \frac{\hbar^{2}}{2M}\nabla ^{2}_{R}- \frac{\hbar^{2}}{2m}\nabla^{2}_{r_{e}}- \frac{Ze^{2}}{4\pi \varepsilon_{0}r_{e}}

where MM is the mass of the nucleus, mm is the mass of the electron, \hbar is the reduced Planck constant, ZZ is the atomic number (the number of protons), ε0\varepsilon_{0} is the vacuum permittivity and rer_{e} is the distance of the electron from the origin. The last term is the potential energy due to electromagnetic attraction.

Derivation

First things first, the end goal is finding the wavefunction of the system. We'll solve it in position representation, ψ(r)=rψ\psi(r)=\braket{ r | \psi }. This wavefunction is of course dependent on the spatial coordinates, but since the electron carries Spin (s=1/2s=1/2 and ms=±1/2m_{s}=\pm1/2 specifically), this will also be parameter. If we call qq the generalized coordinates of the wavefunction, we can write it as ψψ(q)=ψ(re,R,s,ms)\psi\equiv \psi(q)=\psi(r_{e},R,s,m_{s}).

However, the Hamiltonian has no spin term. ss and msm_{s} never appear. This suggests that the wavefunction that we get by solving H^ψ=Eψ\hat{H}\psi=E\psi is actually just the spatial part, with the spin part being a separate piece. Using separation of variables, we can split the two:

ψ(q)χsmsψ(re,R)\psi(q)\equiv \chi_{sm_{s}}\psi(r_{e},R)

We can now solve these separately.

The spin part

Since the Hamiltonian does not affect or depend on spin in any way, the spin part is just the eigenstates of the generic spin operator S^\hat{S}:

S^2χsms=2s(s+1)χsms,Szχsms=msχsms\hat{S}^{2}\chi_{sm_{s}}=\hbar ^{2} s(s+1)\chi_{sm_{s}},\quad S_{z}\chi_{sm_{s}}=\hbar m_{s}\chi_{sm_{s}}

Thankfully, we know the spin: it's s=1/2s=1/2, and this is the special case that is already solved in Spin > Spin 1/2. We know the eigenvectors of both S^2\hat{S}^{2} and S^z\hat{S}_{z} in spin 1/2: they're somewhat up to preference, so we'll choose

χ1/2,+1/2=(10),χ1/2,1/2=(01)\chi_{1/2,+1/2}=\begin{pmatrix} 1 \\ 0 \end{pmatrix},\quad \chi_{1/2,-1/2}=\begin{pmatrix} 0 \\ -1 \end{pmatrix}

And that's the spin part done already. The real challenge is solving the spatial part.

The spatial part

Since we have a central potential, the system lends itself well to be rewritten in spherical coordinates, as the Schrödinger equation has a general solution in this form. To do so, we make a first Coordinate transformation to the frame of reference of the center of mass. We define the reduced mass as usual:

μ=MmM+m\mu=\frac{Mm}{M+m}

Then, using μ\mu and expressing the Laplacian in the relative coordinate r=reR\mathbf{r}=\mathbf{r}_{e}-\mathbf{R}, the equation becomes

H^ψ(r)=[22μr2Ze24πε0r]ψ(r)=Eψ(r)\hat{H}\psi(r)=\left[ - \frac{\hbar^{2}}{2\mu} \nabla ^{2}_{r}- \frac{Ze^{2}}{4\pi \varepsilon_{0}r}\right]\psi(r)=E\psi(r)

The benefit is that now we only depend on one degree of freedom (rr) instead of two (re,Rr_{e},R). (Effectively, it's a single particle located in rr of mass μ\mu subject to the potential). Then, the spherical coordinate transformation from Cartesian coordinates is

{x=rsinθcosϕy=rsinθsinϕz=rcosθ,πθπ,0ϕ2π\begin{cases} x=r\sin \theta \cos \phi \\ y=r\sin \theta \sin \phi \\ z=r\cos \theta \end{cases},\quad-\pi\leq \theta\leq \pi,\quad 0\leq \phi\leq 2\pi

With this we can express the Laplacian in spherical coordinates. Writing the potential as V(r)V(r) for brevity, the Hamiltonian operator in spherical coordinates is

H^=22μ[1r2r(r2r)+1r2sinθθ(sinθθ)+1r2sin2θ2ϕ2]+V(r)\begin{align} \hat{H}&=- \frac{\hbar^{2}}{2\mu}\left[ \frac{1}{r^{2}}\frac{ \partial }{ \partial r } \left( r^{2}\frac{ \partial }{ \partial r } \right)+ \frac{1}{r^{2}\sin \theta}\frac{ \partial }{ \partial \theta } \left( \sin \theta \frac{ \partial }{ \partial \theta } \right)+ \frac{1}{r^{2}\sin ^{2}\theta}\frac{ \partial ^{2} }{ \partial \phi ^{2} } \right]+V(r) \\ \end{align}

We now introduce what is arguably the key component of the system: the quantum angular momentum L^=r^×p^=i(r^×)\hat{L}=\hat{r}\times \hat{p}=-i\hbar (\hat{r}\times \nabla)1. It is so important because if we express its Gradient in spherical coordinates, then take the square, we get the equation

L^2=2[1sinθθ(sinθθ)+1sin2θ2ϕ2]\hat{L}^{2}=-\hbar^{2}\left[ \frac{1}{\sin \theta}\frac{ \partial }{ \partial \theta } \left( \sin \theta \frac{ \partial }{ \partial \theta }\right) + \frac{1}{\sin ^{2} \theta}\frac{ \partial ^{2} }{ \partial \phi ^{2} } \right]

We can see that L^2\hat{L}^{2} comes up exactly in the Hamiltonian, so with a simple substitution

H^=22μ[1r2r(r2r)L^22r2]+V(r)\hat{H}=- \frac{\hbar^{2}}{2\mu}\left[ \frac{1}{r^{2}}\frac{ \partial }{ \partial r } \left( r^{2}\frac{ \partial }{ \partial r } \right)- \frac{\hat{L}^{2}}{\hbar^{2}r^{2}} \right]+V(r)

With this, the spherical Schrödinger equation for the hydrogen atom is

[22μr2r(r2r)+L^22μr2Ze24πε0r]ψ(r,θ,ϕ)=Eψ(r,θ,ϕ)\boxed{\left[ - \frac{\hbar^{2}}{2\mu r^{2}}\frac{ \partial }{ \partial r } \left( r^{2}\frac{ \partial }{ \partial r } \right)+ \frac{\hat{L}^{2}}{2\mu r^{2}}- \frac{Ze^{2}}{4\pi \varepsilon_{0}r} \right]\psi(r,\theta,\phi)=E\psi(r,\theta,\phi)}

where the angular dependence of the system is entirely contained inside the angular momentum. As usual, the square angular momentum, the momentum on one axis (zz) and the Hamiltonian all commute, [L^2,L^z]=[L^2,H^]=[L^z,H^]=0[\hat{L}^{2},\hat{L}_{z}]=[\hat{L}^{2},\hat{H}]=[\hat{L}_{z},\hat{H}]=0, so they all share simultaneous eigenstates. This sets the foundation of our final solution.

For a general solution of this equation in spherical coordinates, using a generic potential VV, see Equazione di Schrödinger > Soluzione tridimensionale. In there, we set ψ(r,θ,ϕ)=R(r)Y(θ,ϕ)\psi(r,\theta,\phi)=R(r)Y(\theta,\phi), find that the angular component Y(θ,ϕ)Y(\theta,\phi) of the solution is universal, completely independent on the potential and described by the Spherical harmonics. The radial component instead depends on VV and we must solve it uniquely for each the potential. As such, our next goal is to solve the radial component for our VV. But first, a refresher on the angular part.

Angular part

The angular components depend on the eigenstates of L^2\hat{L}^{2} and L^z\hat{L}_{z} and are found using the spherical harmonics Yl,m(θ,ϕ)Y_{l,m}(\theta,\phi):

L^2Yl,m(θ,ϕ)=l(l+1)2Yl,m(θ,ϕ)L^zYl,m(θ,ϕ)=mYl,m(θ,ϕ)\boxed{\hat{L}^{2}Y_{l,m}(\theta,\phi)=l(l+1)\hbar^{2}Y_{l,m}(\theta,\phi)}\qquad\hat{L}_{z}Y_{l,m}(\theta,\phi)=m\hbar Y_{l,m}(\theta,\phi)

Using spherical harmonics on L^z\hat{L}_{z} is a bit overkill, since it's really only dependent on ϕ\phi. In the general derivation, one finds that the harmonics themselves can go through separation of variables as Yl,m(θ,ϕ)=Θl,m(θ)Φm(ϕ)Y_{l,m}(\theta,\phi)=\Theta_{l,m}(\theta)\Phi_{m}(\phi). The Φm(ϕ)\Phi_{m}(\phi) function is pretty easy to solve and is found to be Φm(ϕ)=eimϕ\Phi_{m}(\phi)=e^{im\phi}. The Θl,m(θ)\Theta_{l,m}(\theta) function is more complicated is requires invoking the associated Legendre functions Pl,m(cosθ)P_{l,m}(\cos \theta) to get Θ(θ)=APl,m(cosθ)\Theta(\theta)=AP_{l,m}(\cos \theta) (AA is some constant). As such, we can rewrite the L^z\hat{L}_{z} eigenstates as

L^zΦm(ϕ)=mΦm(ϕ)\boxed{\hat{L}_{z}\Phi_{m}(\phi)=m\hbar \Phi_{m}(\phi)}

In the context of the hydrogen atom, the numbers ll and mm are given special names: ll is the azimuthal quantum number and mm is the magnetic azimuthal quantum number.

Still, spherical harmonics are generally complex functions. We know very well that only real numbers can be measured from a system, so we need to bridge the gap. We do the usual: take the square modulus

Yl,m(θ,ϕ)2=Θl,m(θ)2Φm(ϕ)2\lvert Y_{l,m}(\theta,\phi) \rvert^{2} =\lvert \Theta_{l,m}(\theta) \rvert^{2}\lvert \Phi_{m}(\phi) \rvert^{2}

Luckily, the modulus of Φm(ϕ)\Phi_{m}(\phi) is very easy: Φm(ϕ)2=eimϕ2=1/2π\lvert \Phi_{m}(\phi) \rvert^{2}=\lvert e^{im\phi} \rvert^{2}=1/2\pi. (Notably, it is completely independent of mm.) Unluckily, the square modulus of Θ(θ)\Theta(\theta) isn't. They can, for instance, be found computationally and plotted in graphs or animations. Analytically, we'll remain with

Yl,m(θ,ϕ)2=12πΘl,m(θ)2\lvert Y_{l,m}(\theta,\phi) \rvert^{2} =\frac{1}{2\pi}\lvert \Theta_{l,m}(\theta) \rvert^{2}

An important detail: since the spherical harmonics for a given ll is spherically symmetrical over all mm, so is the charge distribution (see Spherical harmonics > Symmetry).

Radial part

The radial part of the equation is is

[22μr2ddr(r2ddr)+2l(l+1)2μr2Ze24πε0r]R(r)=ER(r)\left[ - \frac{\hbar^{2}}{2\mu r^{2}} \frac{d}{dr} \left( r^{2} \frac{d}{dr} \right)+ \frac{\hbar^{2}l(l+1)}{2\mu r^{2}}- \frac{Ze^{2}}{4\pi \varepsilon_{0}r} \right]R(r)=ER(r)

where we used L^2ψ=2l(l+1)ψ\hat{L}^{2}\psi=\hbar ^{2}l(l+1)\psi and rewrote the partial derivatives as regular derivatives since rr is now the only variable. We set u(r)rR(r)u(r)\equiv rR(r) to simplify things:

[22μd2dr2+2l(l+1)2μr2Ze24πε0r]u(r)=Eu(r)\left[ - \frac{\hbar^{2}}{2\mu } \frac{d^{2}}{dr^{2}}+ \frac{\hbar^{2}l(l+1)}{2\mu r^{2}}- \frac{Ze^{2}}{4\pi \varepsilon_{0}r} \right]u(r)=Eu(r)

Notice how there is a term dependent on the angular momentum number ll even in the radial term. What we do is to combine it with the potential term in a single, effective potential VeffV_\text{eff} defined as2

Veff=Ze24πε0r+2l(l+1)2μr2\boxed{V_\text{eff}=-\frac{Ze^{2}}{4\pi \varepsilon_{0}r}+ \frac{\hbar^{2}l(l+1)}{2\mu r^{2}}}

which reduces the equation even further to

[22μd2dr2+Veff]u(r)=Eu(r)\left[ - \frac{\hbar^{2}}{2\mu}\frac{d^{2}}{dr^{2}}+ V_\text{eff}\right]u(r)=Eu(r)

or equivalently

d2u(r)dr2+2μ2[EVeff]u(r)=0\frac{d^{2}u(r)}{dr^{2}}+ \frac{2\mu}{\hbar ^{2}}[E-V_\text{eff}]u(r)=0

This potential looks something like this:

Plot Effective spherical Schrodinger potential.svg

Note that for l=0l=0, the potential is purely attractive: nothing is stopping the electron from falling and crashing into the nucleus. But if we add a rotation (l>0l>0), suddenly there is a spike of potential beyond a certain point which explains why electrons don't just collide with the nucleus. The value of rr at the bottom of each potential well for l>0l>0 should, in principle at least, correspond to the "allowed orbits" that the Bohr model identified. As always, the system is bound if E<0E<0 and scattering if E>0E>0.

Now, solving the radial part is not exactly trivial. First, to solve a (1D) differential equation like the Schrödinger equation we need a boundary condition, which we have not set up until now. We'll set u(0)=0u(0)=0, as that prevents the wavefunction from blowing up to infinity when r=0r=03.

For convenience, we introduce a couple of dimensionless variables

ρ8μE2r,λZe24πε0μ2E\rho\equiv\sqrt{ - \frac{8\mu E}{\hbar ^{2}} } r,\qquad \lambda\equiv\frac{Ze^{2}}{4\pi \varepsilon_{0}\hbar}\sqrt{ - \frac{\mu}{2E} }

This leads to

ddr=ddρdρdr=ddρ8μE2drdr=8μE2ddρ\frac{d}{dr}=\frac{d}{d\rho} \frac{d\rho}{dr}=\frac{d}{d\rho} \sqrt{ - \frac{8\mu E}{\hbar ^{2}} } \frac{dr}{dr}=\sqrt{ - \frac{8\mu E}{\hbar ^{2}} } \frac{d}{d\rho}

and so

d2dr2=8μE2d2dρ2\frac{d^{2}}{dr^{2}}=- \frac{8\mu E}{\hbar ^{2}} \frac{d^{2}}{d\rho ^{2}}

If we substitute these in the equation (with VeffV_\text{eff} expanded), we get

8μE2d2u(ρ)dρ2+2μ2[EZe24πε0ρ8μE2+2l(l+1)2μρ2(8μE2)]u(ρ)=0[8μE2d2dρ2+2μ2E2μ2Ze24πε0ρ8μE2+2μ24El(l+1)ρ2]u(ρ)=08μE2[d2dρ214+2μ2Ze24πε0ρ28μE144l(l+1)ρ2]u(ρ)=0[d2dρ214+Ze24πε0ρμ2El(l+1)ρ2]u(ρ)=0\begin{align} &- \frac{8\mu E}{\hbar ^{2}} \frac{d^{2}u(\rho)}{d\rho ^{2}}+ \frac{2\mu}{\hbar ^{2}}\left[ E- \frac{Ze^{2}}{4\pi \varepsilon_{0}\rho}\sqrt{ - \frac{8\mu E}{\hbar ^{2}} } + \frac{\hbar^{2}l(l+1)}{2\mu \rho ^{2}}\left( - \frac{8\mu E}{\hbar ^{2}} \right) \right]u(\rho)=0 \\ &\left[ - \frac{8\mu E}{\hbar ^{2}} \frac{d^{2}}{d\rho ^{2}}+ \frac{2\mu}{\hbar ^{2}}E- \frac{2\mu}{\hbar ^{2}} \frac{Ze^{2}}{4\pi \varepsilon_{0} \rho}\sqrt{ - \frac{8\mu E}{\hbar ^{2}} }+ \frac{2\mu}{\hbar ^{2}} \frac{4El(l+1)}{\rho ^{2}} \right] u(\rho)=0 \\ &- \frac{8\mu E}{\hbar ^{2}}\left[ \frac{d^{2}}{d\rho ^{2}}- \frac{1}{4} + \frac{2\mu}{\hbar ^{2}} \frac{Ze^{2}}{4\pi \varepsilon_{0}\rho}\sqrt{ - \frac{\hbar^{2}}{8\mu E} }- \frac{1}{4} \frac{4l(l+1)}{\rho ^{2}} \right]u(\rho)=0 \\ &\left[ \frac{d^{2}}{d\rho ^{2}}- \frac{1}{4}+ \frac{Ze^{2}}{4\pi \varepsilon_{0}\hbar\rho} \sqrt{ - \frac{\mu}{2E} }- \frac{l(l+1)}{\rho ^{2}} \right]u(\rho)=0 \end{align}

Noticing λ\lambda in the middle leads us to

[d2dρ2l(l+1)ρ2+λρ14]u(ρ)=0\left[ \frac{d^{2}}{d\rho ^{2}} - \frac{l(l+1)}{\rho ^{2}}+ \frac{\lambda}{\rho}- \frac{1}{4}\right]u(\rho)=0

A much simpler form, but still quite abstract. Instead of diving head-first into solving it, we can start by analyzing asymptotic behavior. When ρ\rho\to \infty, the equations greatly simplifies to

[d2dρ214]u(ρ)=0\left[ \frac{d^{2}}{d\rho ^{2}}- \frac{1}{4} \right]u(\rho)=0

This a linear second order ODE of the form u¨14u=0\ddot{u}- \frac{1}{4}u=0 and the solution is an exponential. Thus, for high ρ\rho, we have

u(ρ)e±ρ/2u(\rho)\sim e^{\pm \rho/2}

Using the fact that u(ρ)u(\rho) must be bounded everywhere, we keep only the negative sign. Since we have an asymptotic solution, we can claim that the general solution behaves in the same manner as the asymptote, but scaled by some other function f(ρ)f(\rho). Thus, in general, in and out of the asymptote:

u(ρ)=f(ρ)eρ/2u(\rho)=f(\rho)e^{-\rho/2}

By definition, f(ρ)f(\rho) must contain all of the information about EE and ll, since those are not present in the asymptotic term: f(ρ)fE,l(ρ)f(\rho)\equiv f_{E,l}(\rho). Thus, since

d2dρ2(f(ρ)eρ/2)=eρ/2[d2dρ2ddρ+14]f(ρ)\frac{d^{2}}{d\rho ^{2}}(f(\rho)e^{-\rho/2})=e^{-\rho/2}\left[ \frac{d^{2}}{d\rho ^{2}}- \frac{d}{d\rho}+ \frac{1}{4} \right]f(\rho)

our new equation to solve is

[d2dρ2ddρl(l+1)ρ2+λρ]f(ρ)=0(1)\left[ \frac{d^{2}}{d\rho ^{2}}- \frac{d}{d\rho} - \frac{l(l+1)}{\rho ^{2}}+ \frac{\lambda}{\rho}\right]f(\rho)=0\tag{1}

Our next problem is finding f(ρ)f(\rho).

Finding f(ρ)f(\rho)

To start, we again look at asymptotic behavior:

limρ0[d2dρ2ddρl(l+1)ρ2+λρ]f(ρ)=0\lim_{ \rho \to 0 } \left[ \frac{d^{2}}{d\rho ^{2}}- \frac{d}{d\rho}- \frac{l(l+1)}{\rho ^{2}} + \frac{\lambda}{\rho}\right]f(\rho)=0

This can only be true if f(ρ)ρl+1f(\rho)\sim \rho^{l+1}. In fact, using the definition of f(ρ)ρs+1f(\rho)\sim \rho^{s+1}:

(s+1)sρs1(s+1)ρsl(l+1)ρs1+λρs0(s+1)s \rho^{s-1}- (s+1)\rho^{s}-l(l+1)\rho^{s-1}+\lambda \rho^{s}\to0

But only the dominant term ρs1\rho^{s-1} matters in the limit so this is like saying

(s+1)sρs1l(l+1)ρs1=[s(s+1)l(l+1)]ρs10(s+1)s\rho^{s-1}-l(l+1)\rho^{s-1}=[s(s+1)-l(l+1)]\rho^{s-1}\to 0

Since ρs1>0\rho^{s-1}>0 for all ρ\rho, this is true only if s=ls=l, which proves that f(ρ)ρl+1f(\rho)\sim \rho^{l+1}.

Outside of the asymptote, f(ρ)f(\rho) must behave something like f(ρ)=ρl+1g(ρ)f(\rho)=\rho^{l+1}g(\rho) where g(ρ)g(\rho) is some function. As an added note, this means that our current understanding of u(ρ)u(\rho) is u(ρ)=eρ/2ρl+1g(ρ)u(\rho)=e^{-\rho/2}\rho^{l+1}g(\rho). We now substitute this in (1)(1) to get

[ρd2dρ2+(2l+2ρ)ddρ+(λl1)]g(ρ)=0(2)\left[ \rho \frac{d^{2}}{d\rho ^{2}}+ (2l+2-\rho) \frac{d}{d\rho}+ (\lambda-l-1) \right]g(\rho)=0\tag{2}

This equation can be solved analytically by expanding g(ρ)g(\rho) in its power series

g(ρ)=k=0ckρkfor c00g(\rho)=\sum_{k=0}^{\infty} c_{k}\rho^{k}\quad\text{for }c_{0}\neq 0

Taking the derivatives of this power series yields

k=0[k(k1)ckρk1+(2l+2ρ)kckρk1+(λl1)ckρk]=0\sum_{k=0}^{\infty}\left[ k(k-1)c_{k}\rho^{k-1}+ (2l+2-\rho)kc_{k}\rho^{k-1}+(\lambda-l-1)c_{k}\rho^{k} \right]=0

Split the sum like so

k=0[k(k1)ckρk1+(2l+2ρ)kckρk1]+k=0(λl1)ckρk=0\sum_{k=0}^{\infty} [k(k-1)c_{k}\rho^{k-1}+(2l+2-\rho)kc_{k}\rho^{k-1}]+\sum_{k=0}^{\infty}(\lambda-l-1)c_{k}\rho^{k} =0

and the first sum is further split as

k=0[k(k1)ckρk1+(2l+2)kckρk1]+k=0[kckρk]+k=0(λl1)ckρk=0\sum_{k=0}^{\infty} [k(k-1)c_{k}\rho^{k-1}+(2l+2)kc_{k}\rho^{k-1}]+\sum_{k=0}^{\infty} [-kc_{k}\rho^{k}]+\sum_{k=0}^{\infty}(\lambda-l-1)c_{k}\rho^{k} =0

Changing the index from kk to k+1k+1 in the first yields the equivalent form

k=0[k(k+1)+(2l+2)(k+1)]ck+1ρk\sum_{k=0}^{\infty} [k(k+1)+(2l+2)(k+1)]c_{k+1}\rho^{k}

Plug this in, combine the sums again and collect ρk\rho^{k} to get

k=0[(k(k+1)+(2l+2)(k+1))ck+1+(λl1k)ck]ρk=0\sum_{k=0}^{\infty} [(k(k+1)+(2l+2)(k+1))c_{k+1}+(\lambda-l-1-k)c_{k}]\rho^{k}=0

This is true for all ρ\rho, which means that each component must be zero termwise:

(k(k+1)+(2l+2)(k+1))ck+1+(λl1k)ck=0(k(k+1)+(2l+2)(k+1))c_{k+1}+(\lambda-l-1-k)c_{k}=0

which yields the recurrence relation

ck+1=[k+l+1λ(k+1)(k+2l+2)]ckc_{k+1}=\left[ \frac{k+l+1-\lambda}{(k+1)(k+2l+2)} \right]c_{k}

For large kk, the ratio is approximately

ck+1ck1k\frac{c_{k+1}}{c_{k}}\sim \frac{1}{k}

As it happens, this ratio is the same as that of the series for ρpeρ\rho^{p}e^{\rho} with pRp \in \mathbb{R}. This implies that g(ρ)g(\rho) has this very behavior

g(ρ)ρpeρfor some pRg(\rho)\sim \rho^{p}e^{\rho}\quad\text{for some }p \in \mathbb{R}

Combined with our previous result u(ρ)=eρ/2ρl+1g(ρ)u(\rho)=e^{-\rho/2}\rho^{l+1}g(\rho), this implies that the asymptotic behavior of u(ρ)u(\rho) is something of the form u(ρ)ρl+1+peρ/2u(\rho)\sim\rho^{l+1+p}e^{\rho/2}. But this can't be, for it would break our previous assumption of u(ρ)eρ/2u(\rho)\sim e^{-\rho/2} (and by extension the normalizability of the wavefunction). Thus the series must terminate early, which is to say that it must be a polynomial in ρ\rho. We shall call nrn_{r} the rank of the polynomial, the radial quantum number, and it is a nonnegative integer (possibly zero). This means that any terms of index k>nrk>n_{r} must be zero and that the highest term is cnrρnrc_{n_{r}}\rho^{n_{r}}.

Using the recurrence relation we can state

cnr+1=0=nr+l+1λ(nr+1)(nr+2l+2)cnrc_{n_{r}+1}=0=\frac{n_{r}+l+1-\lambda}{(n_{r}+1)(n_{r}+2l+2)}c_{n_{r}}

Multiply both sides by the denominator to get

(nr+l+1λ)cnr=0(n_{r}+l+1-\lambda)c_{n_{r}}=0

but we know that cnr0c_{n_{r}}\neq 0, so it must be that

λ=nr+l+1\lambda=n_{r}+l+1

We'll rename this to nn

nnr+l+1n\equiv n_{r}+l+1

and we shall call it the principal quantum number. Since nr=0,1,2,n_{r}=0,1,2,\ldots and l=0,1,2,l=0,1,2,\ldots, we have n=1,2,3,n=1,2,3,\ldots.

Eigenvalues

Now that we finally know what λ\lambda is, we can invert its definition to extract EE, substitute λ\lambda for nn and get the energy eigenvalues:

En=12n2(Ze24πε0)2μ2=e24πε0a0μmZ22n2=e24πε0aμZ22n2E_{n}=- \frac{1}{2n^{2}}\left( \frac{Ze^{2}}{4\pi \varepsilon_{0}} \right)^{2} \frac{\mu}{\hbar ^{2}}=- \frac{e^{2}}{4\pi \varepsilon_{0}a_{0}} \frac{\mu}{m} \frac{Z^{2}}{2n^{2}}=- \frac{e^{2}}{4\pi \varepsilon_{0}a_{\mu}} \frac{Z^{2}}{2n^{2}}

where we used the Bohr radius a0a_{0} and modified Bohr radius aμ=a0(m/μ)a_{\mu}=a_{0} (m/\mu). Thus, after all this derivation, we have the expression for the allowed energy levels of the hydrogen atom:

En=e24πε0aμZ22n2\boxed{E_{n}=- \frac{e^{2}}{4\pi \varepsilon_{0}a_{\mu}} \frac{Z^{2}}{2n^{2}}}

Notably, the energy eigenvalues are not dependent on ll (not directly, at least). This means that the energy eigenstates are degenerate in ll too, not just mm. For each nn there are n1n-1 values of ll, and for each ll there are 2l+12l+1 values of mm, which leads to a total eigenvalue degeneracy of

d=l=0n1(2l+1)=2n(n1)2+n=n2d=\sum_{l=0}^{n-1} (2l+1)=2 \frac{n(n-1)}{2}+n=n^{2}

The degeneracy with respect to mm is universal for spherically symmetric systems, but the degeneracy in ll is actually specific to the Coulomb potential, or rather potentials that go like V1/rV\sim 1/r. In fact, were the potential to behave differently, the ll degeneracy would be lifted and we'd have nn distinct states for each principal level nn. This happens, for instance, in the outer electrons of an atom with more than one electron shell, which are also subject to the repulsion of inner electrons, changing the effective potential to a different shape than 1/r\sim1/r. You can see this in the Many-electron atom article. In a sense, the ll degeneracy has little to do with the shape of an atom specifically: anything around a point charge would work the same.

The levels themselves are infinite and tend towards zero as you get further from the nucleus: this shouldn't be surprising considering that the electromagnetic potential of a point charge (the thing that is responsible for keeping the electron anchored to the nucleus) falls off with distance. They also get progressively closer to one another for larger nn. The difference between n=1n=1 and n=2n=2 is bigger than that between n=2n=2 and n=3n=3 and so on.

Furthermore, these energy levels are identical to those obtained from the Bohr model of the atom. Despite being a (semi)classical model, it already got the quantum levels correct down to the equation. But what it couldn't do, however, are the eigenstates themselves.

Eigenstates

To compute the eigenstate, we look back at (2)(2). Earlier, we looked only at the asymptotic behavior, which led us to the definition of the radial and principal quantum numbers and λ=n\lambda=n. We now want to find the actual solutions themselves. This is a second order Ordinary differential equation in ρ\rho and luckily, a similar equation has already been solved in the past.

The (generalized) Laguerre's differential equation is the following second order ODE:

[ρd2dρ2+(p+1ρ)ddρ+q]Lqp(ρ)=0\left[ \rho \frac{d^{2}}{d\rho^{2}}+(p+1-\rho) \frac{d}{d\rho}+q \right]L_{q}^{p}(\rho)=0

whose solutions Lqp(ρ)L_{q}^{p}(\rho) are the associated Laguerre polynomials

Lqp(ρ)xpeρq!dqdρq(ρq+peρ)L_{q}^{p}(\rho)\equiv \frac{x^{-p}e^{\rho}}{q!} \frac{d^{q}}{d\rho^{q}}(\rho^{q+p}e^{-\rho})

for some nonnegative integers qq and pp. This should ring a bell: it is basically just (2)(2) with slightly different variables. In fact, if we equate the differences we get

{p+1ρ=2l+2ρq=nl1g(ρ)=Lqp(ρ){p=2l+1q=nl1g(ρ)=Lqp(ρ)\begin{cases} p+1-\rho=2l+2-\rho \\ q=n-l-1 \\ g(\rho)=L_{q}^{p}(\rho) \end{cases}\quad\Rightarrow \quad \begin{cases} p=2l+1 \\ q=n-l-1 \\ g(\rho)=L_{q}^{p}(\rho) \end{cases}

Thus, the physically valid solutions of (2)(2) are

g(ρ)=ALnl12l+1(ρ)g(\rho)=AL_{n-l-1}^{2l+1}(\rho)

where AA is some Normalization constant. Substituting rr for ρ\rho and substituting g(ρ)g(\rho) into R(r)R(r) (after unwinding all the functions we've added)

Rnl(r)=u(r)r=naμ2Zf(ρ)eρ/2ρ=naμ2Zg(ρ)ρl+1eρ/2ρ=Aρleρ/2Lnl12l+1(ρ)R_{nl}(r)=\frac{u(r)}{r}=\frac{na_{\mu}}{2Z} \frac{f(\rho)e^{-\rho/2}}{\rho}=\frac{na_{\mu}}{2Z}\frac{g(\rho)\rho^{l+1}e^{-\rho/2}}{\rho}=A\rho^{l}e^{-\rho/2}L_{n-l-1}^{2l+1}(\rho)

where in the last step we have absorbed naμ/2Zna_{\mu}/2Z inside AA since they are both constants. Calculating the constant (see below) yields the radial wavefunction:

Rnl(ρ)=(2Znaμ)3(nl1)!2n[(n+l)!]3ρleρ/2Lnl12l+1(ρ)\boxed{R_{nl}(\rho)=- \sqrt{ \left( \frac{2Z}{na_{\mu}} \right)^{3} \frac{(n-l-1)!}{2n[(n+l)!]^{3}} }\rho^{l}e^{-\rho/2}L_{n-l-1}^{2l+1}(\rho)}

Spatial wavefunction

This completes our proof: we've assembled all the pieces we need to determine the spatial wavefunction of the hydrogenic atom, which is the product of the the radial part and the angular part:

ψnlm(r,θ,ϕ)=(2Znaμ)3(nl1)!2n[(n+l)!]3(2Znaμ)leZ/naμLnl12l+1(2Znaμ)Ylm(θ,ϕ)\boxed{\psi_{nlm}(r,\theta,\phi)=- \sqrt{ \left( \frac{2Z}{na_{\mu}} \right)^{3} \frac{(n-l-1)!}{2n[(n+l)!]^{3}} }\left( \frac{2Z}{na_{\mu}} \right)^{l}e^{-Z/na_{\mu}}L_{n-l-1}^{2l+1}\left( \frac{2Z}{na_{\mu}} \right)Y_{lm}(\theta,\phi)}

where

ρ=2Znaμr=2Zna0Mmr\rho=\frac{2Z}{na_{\mu}}r=\frac{2Z}{na_{0}} \frac{M}{m}r > since spherical harmonics are already normalized. Then > $$1=\lvert A \rvert^{2} \int_{0}^{\infty} \rho^{2l}e^{-\rho}[L_{n-l-1}^{2l+1}(\rho)]^{2} r^{2} dr

Use the definition of ρ\rho to substitute

> and get > $$1=\lvert A \rvert^{2} \left( \frac{na_{\mu}}{2Z} \right)^{3}\int_{0}^{\infty}\rho^{2l}e^{-\rho}[L_{n-l-1}^{2l+1}(\rho)]^{2}\rho ^{2}d\rho

Now, this integral is very hard4. Fortunately, someone else did the math already: in the Mathematical Handbook of Formulas and Tables, a special result for the associated Laguerre polynomials (Eq. 30.50) precisely solves this integral. After due substitutions5, the result is

\int_{0}^{\infty}\rho^{2l+2}e^{-\rho}[L_{n-l-1}^{2l+1}(\rho)]^{2}d\rho&=\frac{[2(n-l-1)+2l+2](n-l-1+2l+1)!}{(n-l-1)!} \\ &=\frac{2n(n+l)!}{(n-l-1)!} \end{align}
> If we plug this in the former equation, extract $\lvert A \rvert^{2}$, take the square root and choose the negative solution, we get > $$A=-\sqrt{ \left( \frac{2Z}{na_{\mu}} \right)^{3} \frac{(n-l-1)!}{2n(n+l)!} }
Results

Each of these wavefunctions represents a possible state of the electron around the nucleus, bringing along the baggage of all the probabilities of where it might be at any given time while "orbiting" the nucleus. In fact, this classical concept of "orbit" remained so prevalent throughout the years that it gave the special name we use to refer to these wavefunctions: orbitals. Common in chemistry too, we use a specific naming scheme to specify which orbital we are talking about, called spectroscopic notation.

As is, the orbitals have limited intepretability. But as with all quantum mechanical system, the square modulus of ψnlm\psi_{nlm} gives the Probability of finding the system in a given state. Specifically, ψnlm(r,θ,ϕ)2dr\lvert \psi_{nlm}(r,\theta,\phi) \rvert^{2}d\mathbf{r} is the probability of finding the electron in an infinitesimal volume element drd\mathbf{r} and is thus effectively the (shape of the) volume charge distribution of the hydrogen atom. Meanwhile, the probability of finding it at a distance rr anywhere around the nucleus is found by integrating ψnlm(r,θ,ϕ)2\lvert \psi_{nlm}(r,\theta,\phi) \rvert^{2} over all angles. This leads to what's known as a density function

Dnl(r)=r2Rnl(r)2\boxed{D_{nl}(r)=r^{2}\lvert R_{nl}(r) \rvert ^{2}}

since Ylm(θ,ϕ)Y_{lm}(\theta,\phi) are normalized. The first few radial wavefunction and their corresponding densities look like this.

Other visualizations of the wavefunctions are the xyxy-plane section plot below, where you can see the black lines representing the nodes (i.e. the zeros) of the wavefunctions.

Also see this website, which provides an interactive simulation of the orbitals: https://www.falstad.com/qmatom/.

Total wavefunction

Now that we painstakingly solved both the spin and spatial parts, we can put them together to officially have the full wavefunction of the hydrogen atom:

ψnlmlsms(r,θ,ϕ)=χsmsRnl(r)Ylml(θ,ϕ)\boxed{\psi_{nlm_{l}sm_{s}}(r,\theta,\phi)=\chi_{sm_{s}}R_{nl}(r)Y_{lm_{l}}(\theta,\phi)}

where mm was renamed to mlm_{l} to better distinguish it from the spin magnetic number msm_{s}. It is dependent on five quantum numbers, of which four variable:

  • nn is the principal quantum number (n=1,2,3,n=1,2,3,\ldots)
  • ll is the azimuthal quantum number (l<nl< n)
  • mlm_{l} is the magnetic azimuthal quantum number (lmll-l\leq m_{l}\leq l)
  • ss is the spin quantum number (s=1/2s=1/2 for electrons)
  • msm_{s} is the magnetic spin quantum number (s=±1s=\pm 1 for electrons)

Notably, the spatial part of the wavefunction is a usual complex valued function. However, the spin part is a vector, with two components. This means that the operation we're doing here is scalar multiplication of the spin vector by the spatial wavefunction: in other words, the total wavefunction is a two-dimensional vector, not a number.

For the curious, this is what the wavefunction looks like with all nested functions written out explicitly

ψnlmlsms(r,θ,ϕ)=χsms(2Znaμ)3(nl1)!2n[(n+l)!]3(2Znaμ)leZ/naμ(2Znaμ)2l1e2Z/naμ(nl1)!dnl1d(2Znaμ)nl1[(2Znaμ)n+le2Z/naμ]2l+14π(lml)!(l+ml)!eimlϕ(1cos2θ)ml/2dmdcosθm[12ll!(ddcosθ)l(cos2θ1)l]\begin{align} \psi_{nlm_{l}sm_{s}}(r,\theta,\phi)=&-\chi_{sm_{s}} \sqrt{ \left( \frac{2Z}{na_{\mu}} \right)^{3} \frac{(n-l-1)!}{2n[(n+l)!]^{3}} }\left( \frac{2Z}{na_{\mu}} \right)^{l}e^{-Z/na_{\mu}} \\ &\left( \frac{2Z}{na_{\mu}} \right)^{-2l-1}\frac{e^{ 2Z/na\mu }}{(n-l-1)!} \frac{d^{n-l-1}}{d\left( \frac{2Z}{na_{\mu}} \right)^{n-l-1}} \\ &\left[ \left( \frac{2Z}{na_{\mu}} \right)^{n+l}e^{-2Z/na_{\mu}} \right]\sqrt{\frac{2l+1}{4\pi} \frac{(l-m_{l})!}{(l+m_{l})!}}e^{im_{l}\phi}(1-\cos^{2}\theta)^{|m_{l}|/2} \\ &\frac{d^{\lvert m \rvert }}{d\cos \theta^{\lvert m \rvert }}\left[ \frac{1}{2^{l}l!} \left(\frac{d}{d\cos \theta}\right)^{l}(\cos^{2}\theta-1)^{l} \right] \end{align}

The fine structure

Although the wavefunction above provides good results, it still uses the Schrödinger equation as its starting point. The issue with this is that the Schrödinger equation does not consider the effects of relativity, and since electrons move with average speeds in the order of 1% of the speed of light, this leads to minor, but still significant discrepancies when comparing theory with experiment.

The formally correct way of handling this would be to start from relativistic quantum mechanics and the Dirac equation. However, that is beyond the scope of this section, and it is also historically misplaced. Before the Dirac equation was even developed, relativistic corrections to the hydrogen atom were already discovered and calculated by using perturbation theory, specifically time-independent perturbation theory. Namely, three corrections were developed: relativistic kinetic energy, spin-orbit coupling and the Darwin term. Each of these made the eigenvalues just a little closer to reality, matching the differences that we see in the laboratory. Together, these corrections are known as the fine structure of the hydrogen atom.

This is a very qualitative description of the fine structure. Please refer to e.g. Appendix 7 of Bransden & Joachaim for a rigorous derivation.

Our perturbed Hamiltonian will be

H=H0+HH=H_{0}+H'

where

H0=p22mZe24πε0rH_{0}=\frac{p^{2}}{2m}- \frac{Ze^{2}}{4\pi \varepsilon_{0}r}

is our usual Hamiltonian (the quantity, not the operator, note the lack of hat on HH) for a charged particle (our electron) in a central Coulomb potential and

HH1+H2+H3H'\equiv H_{1}'+H_{2}'+H_{3}'

will be the perturbation due to the three aforementioned corrections.

Relativistic kinetic energy

In relativity, the energy of the body is not just kinetic. We must use the full relativistic energy E=p2c2+m2c4E=\sqrt{ p^{2}c^{2}+m^{2}c^{4} }, from which we get the kinetic energy as

K=EE0=p2c2+m2c4mc2=mc21+p2m2c2mc2K=E-E_{0}=\sqrt{ p^{2}c^{2}+m^{2}c^{4} }-mc^{2}=mc^{2}\sqrt{ 1+ \frac{p^{2}}{m^{2}c^{2}} }-mc^{2}

If p2/mc21p^{2}/mc^{2}\ll 1, which is our case, we can expand the square root in a Taylor series of p2/mc2p^{2}/mc^{2}:

1+p2m2c21+12p2m2c218p4m4c4\sqrt{ 1+ \frac{p^{2}}{m^{2}c^{2}} }\simeq1+ \frac{1}{2} \frac{p^{2}}{m^{2}c^{2}}- \frac{1}{8} \frac{p^{4}}{m^{4}c^{4}}

Plugging this into the previous equation gives an approximate kinetic energy of

Kp22m18p4m3c2K\simeq \frac{p^{2}}{2m}- \frac{1}{8} \frac{p^{4}}{m^{3}c^{2}}

The first term is very clearly the classical kinetic energy, whereas the second is a term that is unique to relativity. This is the relativistic correction to kinetic energy:

H1=18p4m3c2\boxed{H_{1}'=- \frac{1}{8} \frac{p^{4}}{m^{3}c^{2}}}

The correction to the eigenvalues is given by the average of the perturbation in the generic orbital (i.e. the eigenstates of H0H_{0})

ΔE1=ψnlmlmsp48m3c2ψnlmlms=12mc2ψnlml(p22m)2ψnlml=12mc2ψnlml(H0+Ze24πεr)2ψnlml\begin{align} \Delta E_{1}&=\braket{ \psi_{nlm_{l}m_{s}} | - \frac{p^{4}}{8m^{3}c^{2}} | \psi_{nlm_{l}m_{s}} } \\ &=- \frac{1}{2mc^{2}}\braket{ \psi_{nlm_{l}} | \left( \frac{p^{2}}{2m} \right)^{2} | \psi_{nlm_{l}} } \\ &=- \frac{1}{2mc^{2}}\braket{ \psi_{nlm_{l}} | \left( H_{0}+ \frac{Ze^{2}}{4\pi \varepsilon r} \right)^{2} | \psi_{nlm_{l}} } \\ \end{align}

This equation can be further developed to get

ΔE1=En(Zα)2n2[34nl+1/2]\boxed{\Delta E_{1}=- E_{n} \frac{(Z\alpha)^{2}}{n^{2}}\left[ \frac{3}{4}- \frac{n}{l+1/2} \right]}

where α\alpha is a constant aptly known as the fine-structure constant.

Spin-orbit correction

The spin-orbit term takes the effect of the interaction between spin and angular momentum of the electron into account. The easiest way to reach it is by considering the frame of reference in which the electron is not moving and the nucleus is rotating around it. In such a frame, the electric and magnetic fields are

E=dVdrr^,B=1c2v×E\mathbf{E}=- \frac{dV}{dr}\hat{\mathbf{r}},\quad\mathbf{B}=- \frac{1}{c^{2}}\mathbf{v}\times \mathbf{E}

The magnetic field can be developed as

B=1c2v×(1rdVdr)r=1c2rdVdr(r×v)=1mc2rdVdrL\mathbf{B}=- \frac{1}{c^{2}}\mathbf{v}\times\left( - \frac{1}{r} \frac{dV}{dr} \right)\mathbf{r}=- \frac{1}{c^{2}r} \frac{dV}{dr}(\mathbf{r}\times \mathbf{v})=- \frac{1}{mc^{2}r} \frac{dV}{dr}\mathbf{L}

The intrinsic magnetic dipole moment of the electron is the Bohr magneton:

μB=emS\boldsymbol{\mu}_{B}=- \frac{e}{m}\mathbf{S}

Thus, the energy is given by the usual formula for a magnetic dipole:

H2=μBB=1mc2erdVdrLSH_{2}'=-\boldsymbol{\mu}_{B}\cdot \mathbf{B}=- \frac{1}{mc^{2}} \frac{e}{r} \frac{dV}{dr} \mathbf{L}\cdot \mathbf{S}

Evaluating the derivative gives the spin-orbit correction:

H2=Ze28m2c2πε0r3LS\boxed{H_{2}'=\frac{Ze^{2}}{8m^{2}c^{2}\pi \varepsilon_{0}r^{3}}\mathbf{L}\cdot \mathbf{S}}

This correction does not commute with neither LzL_{z} nor SzS_{z}, so to find the eigenvalue correction we must first identify some states that diagonalize H2H_{2}'. To do so, we introduce the total angular momentum

JL+S\mathbf{J}\equiv \mathbf{L}+\mathbf{S}

which associated quantum numbers jj and mjm_{j}. This allows us to write the Scalar product as

LS=12(J2L2S2)\mathbf{L}\cdot \mathbf{S}=\frac{1}{2}(\mathbf{J}^{2}-\mathbf{L}^{2}-\mathbf{S}^{2})

A good Basis is the one given by the states

ψnlmlmsml,ms12lmlmsjmjχmsψnlml\ket{\psi_{nlm_{l}m_{s}}} \equiv \sum_{m_{l},m_{s}}\braket{ \frac{1}{2}lm_{l}m_{s} | jm_{j} } \ket{\chi_{m_{s}}\psi_{nlm_{l}}}

The scalar product in the sum are a well-known set of numbers called the Clebsch-Gordan coefficients for s=1/2s=1/2. In these states we have the eigenvalue equations

{L2ψnljmj=l(l+1)ψnljmjS2ψnljmj=s(s+1)ψnljmj=34ψnljmjJ2ψnljmj=j(j+1)ψnljmjJzψnljmj=mjψnljmjH0ψnljmj=Enψnljmj\begin{cases} \mathbf{L}^{2}\ket{\psi_{nljm_{j}}} =\hbar l(l+1)\ket{\psi_{nljm_{j}}} \\ \mathbf{S}^{2}\ket{\psi_{nljm_{j}}} =\hbar s(s+1)\ket{\psi_{nljm_{j}}} =\frac{3}{4}\hbar \ket{\psi_{nljm_{j}}} \\ \mathbf{J}^{2}\ket{\psi_{nljm_{j}}} =\hbar j(j+1)\ket{\psi_{nljm_{j}}} \\ J_{z}\ket{\psi_{nljm_{j}}} =\hbar m_{j}\ket{\psi_{nljm_{j}}} \\ H_{0}\ket{\psi_{nljm_{j}}} =E_{n}\ket{\psi_{nljm_{j}}} \end{cases}

Since s=1/2s=1/2, the values for jj are j=s±1/2j=s\pm1/2, while mj=j,j+1,,j1,jm_{j}=-j,-j+1,\ldots,j-1,j as with all magnetic spin numbers. The energy correction can now be calculated:

ΔE2=ψnljmjH2ψnljmj=22[j(j+1)l(l+1)34]ψnljmjZe24m2c2πε0r3ψnljmj\Delta E_{2}=\braket{ \psi_{nljm_{j}} | H_{2}'|\psi_{nljm_{j}} } =\frac{\hbar^{2}}{2}\left[ j(j+1)-l(l+1)- \frac{3}{4} \right]\braket{ \psi_{nljm_{j}} | \frac{Ze^{2}}{4m^{2}c^{2}\pi \varepsilon_{0}r^{3}}| \psi_{nljm_{j}} }

It can be proven that

ψnljmj1r3ψnljmj=Z3a03n3l(l+12)(l+1)\braket{ \psi_{nljm_{j}} | \frac{1}{r^{3}} |\psi_{nljm_{j}} }=\frac{Z^{3}}{a_{0}^{3}n^{3}l\left( l+ \frac{1}{2} \right)(l+1)}

Using

En=mc22(Zα)2n2E_{n}=- \frac{mc^{2}}{2} \frac{(Z\alpha)^{2}}{n^{2}}

this leads to three possible values:

ΔE2={En2(Zα)2nl(l+12)(l+1)if j=l+12En2(Zα)2nl(l+12)(l+1)if j=l120if j=12\boxed{\Delta E_{2}=\left\{\begin{align} &-\frac{E_{n}}{2}\frac{(Z\alpha)^{2}}{ nl\left( l+ \frac{1}{2} \right)(l+1) }&\text{if }j=l+ \frac{1}{2} \\ &\frac{E_{n}}{2}\frac{(Z\alpha)^{2}}{ nl\left( l+ \frac{1}{2} \right)(l+1) }&\text{if }j=l- \frac{1}{2} \\ &0&\text{if }j=\frac{1}{2} \end{align}\right.}

Darwin term

The last correction is due to the Heisenberg inequality. Since the concept of "size" is not well-defined in quantum mechanics, the electron is non-localized and is not very well described by a point charge. The Darwin term consists of treating the electron as a cloud of charge described by a volume charge distribution. To do so, we starting by recalling that the usual electric potential for a point charge ee goes like

ϕ(r)er\phi(r)\sim \frac{e}{r}

We now introduce a new vector u\mathbf{u} which identifies the position of a point in the cloud, starting from the center of the cloud. If r\mathbf{r} is the center of the cloud, then r+u\mathbf{r}+\mathbf{u} is a point u\mathbf{u} away from the center. The corrected potential energy V~\tilde{V} will then be a function of r+u\mathbf{r}+\mathbf{u}:

V~(r+u)=qeϕ(r+u)dqe\tilde{V}(\mathbf{r}+\mathbf{u})=\int_{q_{e}}\phi(\mathbf{r}+\mathbf{u})dq_{e}

where we are integrating over the entire charge. We introduce the volume charge distribution we mentioned before as

ρ(u)eρ0(u)whereρ0(u) such that Veρ0(u)dq=1\rho(\mathbf{u})\equiv-e\rho_{0}(\mathbf{u})\quad\text{where}\quad \rho_{0}(\mathbf{u})\text{ such that }\int_{V_{e}} \rho_{0}(\mathbf{u})dq=1

This makes our corrected potential energy into

V~(r+u)=Veρ(u)ϕ(r+u)d3u=Veρ0(u)V(r+u)d3u\tilde{V}(\mathbf{r}+\mathbf{u})=\int_{V_{e}} \rho(\mathbf{u})\phi(\mathbf{r}+\mathbf{u})d^{3}u=\int_{V_{e}} \rho_{0}(\mathbf{u})V(\mathbf{r}+\mathbf{u})d^{3}u

We can expand V~\tilde{V} in a Taylor series in r+u\mathbf{r}+\mathbf{u} around r\mathbf{r} to get

V(r+u)V(r)+iVxi(r)ui+12ij2Vxixj(r)uiujV(\mathbf{r}+\mathbf{u})\simeq V(\mathbf{r})+\sum_{i} \frac{ \partial V }{ \partial x_{i} } (\mathbf{r})u_{i}+ \frac{1}{2}\sum_{ij} \frac{ \partial ^{2}V }{ \partial x_{i}\partial x_{j} }(\mathbf{r}) u_{i}u_{j}

We can plug this into the equation for V~\tilde{V} to get

V~(r)Veρ0(u)V(r)d3u+Veρ0(u)iVxi(r)uid3u+Veρ0(u)12ij2Vxixj(r)uiujd3u=V(r)+162V(r)Veρ0(u)u2du\begin{align} \tilde{V}(\mathbf{r})&\simeq\int_{V_{e}} \rho_{0}(\mathbf{u})V(\mathbf{r})d^{3}u+ \int_{V_{e}} \rho_{0}(\mathbf{u})\sum_{i} \frac{ \partial V }{ \partial x_{i} } (\mathbf{r})u_{i}d^{3}u+\int_{V_{e}} \rho_{0}(\mathbf{u})\frac{1}{2}\sum_{ij} \frac{ \partial ^{2}V }{ \partial x_{i}\partial x_{j} }(\mathbf{r}) u_{i}u_{j}d^{3}u \\ &=V(\mathbf{r})+ \frac{1}{6}\nabla ^{2}V(\mathbf{r})\int_{V_{e}}\rho_{0}(\mathbf{u})u^{2}du \end{align}

Let's assume that the potentials takes the shape given by

ρ0(u)={(43πλ2π)1if uλ2π0if u>λ2π\rho_{0}(u)=\begin{cases} \left( \frac{4}{3}\pi \frac{\lambda}{2\pi} \right)^{-1}&\text{if }u\leq \frac{\lambda}{2\pi} \\ 0&\text{if }u> \frac{\lambda}{2\pi} \end{cases}

In other words, it fades off spherically, with a radius given by the reduced wavelength λ/2π\lambda/2\pi. We also know that the Laplacian of 1/r1/r is

2(1r)=δ(r)\nabla ^{2}\left( \frac{1}{r} \right)=\delta(\mathbf{r})

so, a three-dimensional Dirac delta. Putting this together and solving results in the Darwin term:

H3=π22m2c2(Ze24πε0)δ(r)\boxed{H_{3}'=\frac{\pi \hbar^{2}}{2m^{2}c^{2}}\left( \frac{Ze^{2}}{4\pi \varepsilon_{0}} \right)\delta(\mathbf{r})}

The presence of a Dirac delta here is critical: since the delta kills every value outside of r=0\mathbf{r}=0, the only time this correction matters is, well, when we are evaluating r=0\mathbf{r}=0. This is significant because if you go back to look at the wavefunction we've derived above, the only ones which are nonzero in the origin are the ss orbitals, so when l=0l=0. Everything else is zero in the origin, which means that this correction does not matter to them. The Darwin term is just an ss orbital correction.

Using this knowledge allows us to calculate the eigenvalue difference in just the ss orbitals, which is to say when l=0,m=0l=0,m=0:

ΔE3=π22m2c2Ze24πε0ψn00δ(r)ψn00=π22m2c2Ze24πε0ψn00(r)2=mc22(Zα)2n2(Zα)2n\begin{align} \Delta E_{3}&=\frac{\pi \hbar^{2}}{2m^{2}c^{2}} \frac{Ze^{2}}{4\pi \varepsilon_{0}}\braket{ \psi_{n00} | \delta(\mathbf{r}) | \psi_{n00} } \\ &=\frac{\pi \hbar^{2}}{2m^{2}c^{2}} \frac{Ze^{2}}{4\pi \varepsilon_{0}}\lvert \psi_{n00}(\mathbf{r}) \rvert ^{2} \\ &=\frac{mc^{2}}{2} \frac{(Z\alpha)^{2}}{n^{2}} \frac{(Z\alpha)^{2}}{n} \end{align}

And hence

ΔE3=En(Zα)2nif l=0\boxed{\Delta E_{3}=-E_{n} \frac{(Z\alpha)^{2}}{n}\quad\text{if }l=0}

Putting it all together

Now that we have all three corrections, we can put the together to find the total correction to the energy eigenvalues:

ΔE=ΔE1+ΔE2+ΔE3=En(Zα)2n2(nj+1234)\boxed{\Delta E=\Delta E_{1}+\Delta E_{2}+\Delta E_{3}=E_{n} \frac{(Z\alpha)^{2}}{n^{2}}\left( \frac{n}{j+ \frac{1}{2}}- \frac{3}{4} \right)}

and adding this to the energy eigenvalues themselves we get the fine structure energy states:

Enj=En[1+(Zα)2n2(nj+1234)]\boxed{E_{nj}=E_{n}\left[ 1+ \frac{(Z\alpha)^{2}}{n^{2}}\left( \frac{n}{j+ \frac{1}{2}}- \frac{3}{4} \right) \right]}

As we can see, there is no direct dependence on the azimuthal angular momentum: the only thing that matters is the total angular momentum. As such, despite the fine structure leading to a great deal of splitting in the energy spectrum, it does not fully eliminate the degeneracy in ll.

We can see that, since α1/137\alpha \simeq 1/137, its square is α2104\alpha ^{2}\sim10^{-4}. As such, when Z=1Z=1 (so, when dealing with hydrogen), the correction is quite small, in the order of 104 eV10^{-4}\text{ eV}. Recalling that the ground state energy is the Rydberg energy, at about 13.6 eV-13.6\text{ eV}, it's quite a small contribution indeed. However, do note the dependence on ZZ. This derivation is not just for the hydrogen atom, but rather all hydrogenic atoms. This includes any atom with just a single valence electron, which for instance includes all alkali metals. Thus, while the fine structure may not be all that significant for hydrogen outside of high-precision experiments or extreme conditions, it may be quite relevant for heavier hydrogenic atoms, such as cesium (symbol: 55Cs^{55}\text{Cs}) for which Z2=552=3025103Z^{2}=55^{2}=3025\sim 10^{3}, thus causing the whole term to be 0.1 eV\sim0.1\text{ eV}.

Also, the relativistic correction is strictly positive, which means that the corrected states are more strongly bound (since EnE_{n} is negative). In a sense, relativity makes it harder to break an electron off its atom. This becomes progressively less and less relevant as you climb the states, due to the n2n^{-2} dependence, which means that the ground state (n=1n=1) is the most affected.

State transitions

Once the stationary states themselves are determined, it's fair to ask how the electron goes from one to another. After all, there's little point in knowing all possible states if the electron is perpetually confined to the ground state. As such, we now take on the investigation of state transitions of the electron.

Electromagnetic field

Of course, since stationary states are stable, we need some external influence to trigger a state transition. We will choose the application of an external electric field E\boldsymbol{\mathcal{E}} and magnetic field B\boldsymbol{\mathcal{B}} for this section, respectively determined by an electric potential ϕ\phi and a magnetic vector potential A\mathbf{A}. The Hamiltonian of an point charge qq of mass mm in these fields is

H=12m(pqA)2+qϕH=\frac{1}{2m}(\mathbf{p}-q\mathbf{A})^{2}+q\phi

(ignoring small spin-dependent terms.) Changing qeq\to-e and plugging this into the time-dependent Schrödinger equation yields

itψ(r,t)=[12m(i+eA)2Ze24πε0r]ψ(r,t)i\hbar \frac{ \partial }{ \partial t } \psi(\mathbf{r},t)=\left[ \frac{1}{2m}(-i\hbar \nabla+e\mathbf{A})^{2}- \frac{Ze^{2}}{4\pi \varepsilon_{0} r} \right]\psi(\mathbf{r},t)

Before we move on: the entirety of electrodynamics hinges on an arbitrary choice, which we called gauge freedom. Though proof won't be given here, the Schrödinger equation is gauge invariant, which means that regardless of the gauge we choose, the equation does not change. The only thing that changes is that the resulting wavefunction acquires a phase, and different gauges lead to different phases. However, since phases vanish when taking the square modulus ψ(r,t)2\lvert \psi(\mathbf{r},t) \rvert^{2}, it makes no physical difference. For this section, we choose to fix the gauge to A=0\nabla\cdot \mathbf{A}=0, known as the Coulomb gauge.

Theoretical calculation of the transition probabilities is arduous and only a surface level analysis will be reported here. See Bransden & Joachaim Chapters 4.1 and 4.2 for more discussion.

We'll assume that the incident perturbation is a nearly monochromatic pulse of radiation peaked about some angular frequency ω0\omega_{0} and nonzero only in a small region of width Δω\Delta \omega. As usual, a generic pulse can be represented as a superposition of plane waves of frequency ω\omega, which we assume to all share the same wavevector k^\hat{\mathbf{k}} and (linear) polarization vector ε^\hat{\varepsilon} (the two are orthogonal, k^ε^=0\hat{\mathbf{k}}\cdot \hat{\varepsilon}=0). A plane wave is solved by the real vector potential

A(ω;r,t)=A0(ω)[ei(krωt+δω)ei(krωt+δω)]\mathbf{A}(\omega;\mathbf{r},t)=\mathbf{A}_{0}(\omega)[e^{i(\mathbf{k}\cdot \mathbf{r}-\omega t+\delta_{\omega})}-e^{-i(\mathbf{k}\cdot \mathbf{r}-\omega t+\delta_{\omega})}]

with A0(ω)=A0(ω)ε^\mathbf{A}_{0}(\omega)=A_{0}(\omega)\hat{\varepsilon}. δω\delta_{\omega} is a real phase term. The superposition in therefore

A(ω;r,t)=ΔωA0(ω)ε^[ei(krωt+δω)ei(krωt+δω)]dω\mathbf{A}(\omega;\mathbf{r},t)=\int_{\Delta \omega}A_{0}(\omega)\hat{\varepsilon}[e^{i(\mathbf{k}\cdot \mathbf{r}-\omega t+\delta_{\omega})}-e^{-i(\mathbf{k}\cdot \mathbf{r}-\omega t+\delta_{\omega})}]d\omega

Since we're looking for the state transition caused by this perturbation, we set the time-dependent wavefunction to be generated by the usual superposition

Ψ(r,t)=kck(t)ψk(t)eiEkt/\Psi(\mathbf{r},t)=\sum_{k}c_{k}(t)\psi_{k}(\mathbf{t})e^{-iE_{k}t/\hbar}

where ck(t)c_{k}(t) are the coefficients of the expansion, ψk\psi_{k} are the stationary states and EkE_{k} are the eigenstates. We already know the ψk\psi_{k} and EkE_{k}, so to find the time evolution, we need to solve for the ck(t)c_{k}(t). We do this with time-dependent perturbation theory and by ignoring the A2\mathbf{A}^{2} terms, which leaves a perturbation of

H(t)=iemA(t)H'(t)=- \frac{i\hbar e}{m}\mathbf{A}(t)\cdot \nabla

The coefficients satisfy

c˙b(t)=ikψbH(t)ψkck(t)eiωbkt\dot{c}_{b}(t)=- \frac{i}{\hbar}\sum_{k}\braket{ \psi_{b} | H'(t) | \psi_{k} } c_{k}(t)e^{i\omega_{bk}t}

where ωbk=(EbEk)/\omega_{bk}=(E_{b}-E_{k})/\hbar. If we take the system to initially be in the stationary state ψa\ket{\psi_{a}} of energy EaE_{a}, to first order, the coefficients are

cb(1)(t)=i0tψbHψaeiωbatdtc_{b}^{(1)}(t)=- \frac{i}{\hbar}\int_{0}^{t}\braket{ \psi_{b} | H'|\psi_{a} } e^{i\omega_{ba}t'}dt'

Solving this for the A(t)\mathbf{A}(t) above yields

cb(1)(t)=emΔωA0(ω)[eiδωψbeikrε^ψa0tei(ωbaω)tdt++eiδωψbeikrε^ψa0tei(ωba+ω)tdt]dω\begin{align} c_{b}^{(1)}(t)=- \frac{e}{m}\int_{\Delta \omega}A_{0}(\omega)&\Big[ e^{i\delta_{\omega}}\braket{ \psi_{b} | e^{i\mathbf{k}\cdot \mathbf{r}}\hat{\varepsilon}\cdot \nabla| \psi_{a} }\int_{0}^{t}e^{i(\omega_{ba}-\omega)t'}dt'+ \\ &+e^{-i\delta_{\omega}}\braket{ \psi_{b} | e^{-i\mathbf{k}\cdot \mathbf{r}}\hat{\varepsilon}\cdot \nabla|\psi_{a} }\int_{0}^{t}e^{i(\omega_{ba}+\omega)t'}dt' \Big]d\omega \end{align}

The square brackets contain two terms: the first is the absorption term, which is nonzero only if Eb=Ea+ωE_{b}=E_{a}+\hbar \omega, and the second is the emission term, which is nonzero only if Eb=EaωE_{b}=E_{a}-\hbar \omega. This is because the first integral over dtdt' vanishes in realistic scenarios where the duration of the pulse is far larger than the periodic time 2π/ωba2\pi/\omega_{ba}, thus leading that integral to be negligible unless ωωba\omega \simeq \omega_{ba}, which happens precisely when Eb=Ea+ωE_{b}=E_{a}+\hbar \omega. Similarly, the second integral is negligible unless ωωba\omega \simeq-\omega_{ba} and so Eb=EaωE_{b}=E_{a}-\hbar \omega.

This shouldn't come as a surprise: after all, energy is conserved, so the state transition must absorb or emit precisely the difference in energy between the states.

In an absorption process, when only the first half of the square brackets is nonzero, the probability that an atom will be found in a certain state upon measurement goes like

cb(1)(t)2Δω[eA0(ω)m]2Mba(ω)2\lvert c_{b}^{(1)}(t) \rvert ^{2}\sim \int_{\Delta \omega}\left[ \frac{eA_{0}(\omega)}{m} \right]^{2}\lvert M_{ba}(\omega) \rvert ^{2}

where MbaM_{ba} is a matrix element and is given by

Mba=ψbeikrε^ψa=ψb(r)eikrε^ψa(r)drM_{ba}=\braket{ \psi_{b} | e^{i\mathbf{k}\cdot \mathbf{r}}\hat{\varepsilon}\cdot \nabla|\psi_{a} }=\int \psi_{b}^{*}(\mathbf{r})e^{i\mathbf{k}\cdot \mathbf{r}}\hat{\varepsilon}\cdot \nabla \psi_{a}(\mathbf{r})d\mathbf{r}

This allows us to define the absorption transition probability

Wba=4π2m2c(e24πε0)I(ωba)ωba2Mba(ωba)2W_{ba}=\frac{4\pi ^{2}}{m^{2}c}\left( \frac{e^{2}}{4\pi \varepsilon_{0}} \right) \frac{I(\omega_{ba})}{\omega ^{2}_{ba}}\lvert M_{ba}(\omega_{ba}) \rvert^{2}

where

I(ω)=ΔωI0(ω)dω=Δω2ε0ω2cA02(ω)dωI(\omega)=\int_{\Delta \omega}I_{0}(\omega)d\omega=\int_{\Delta \omega}2\varepsilon_{0}\omega ^{2}cA_{0}^{2}(\omega)d\omega

The transition probability is also bound to its absorption Cross section by dividing the rate at which energy is absorbed by the atom, ωbaWba\hbar \omega_{ba}W_{ba} by the intensity I(ωba)I(\omega_{ba}) to get

σba=4π2α2m2ωbaMba(ωba)2\sigma_{ba}=\frac{4\pi ^{2}\alpha \hbar^{2}}{m^{2}\omega_{ba}}\lvert M_{ba}(\omega_{ba}) \rvert ^{2}

Emission is complicated by the fact that there are two kinds of emission: stimulated and spontaneous. For stimulated emission, the transition rate is

Wab=4π2m2c(e24πε0)I(ωab)ωba2Mab(ωba)2W_{ab}=\frac{4\pi ^{2}}{m^{2}c}\left( \frac{e^{2}}{4\pi \varepsilon_{0}} \right) \frac{I(\omega_{ab})}{\omega ^{2}_{ba}}\lvert M_{ab}(\omega_{ba}) \rvert ^{2}

and similarly

σab=4π2α2m2ωabMab(ωba)2\sigma_{ab}=\frac{4\pi ^{2}\alpha \hbar^{2}}{m^{2}\omega_{ab}}\lvert M_{ab}(\omega_{ba}) \rvert ^{2}

Spontaneous emission is more difficult to explain due to the inherent quantum nature of it, requiring quantum electrodynamics to properly solve. In fact, spontaneous emission is actually not a "real" process, in that it is actually stimulated emission, where the stimulating field is the vacuum quantum field, which is always present.

The dipole approximation

The calculation of the matrix element Mba=ψbeikrε^ψaM_{ba}=\braket{ \psi_{b} | e^{i\mathbf{k}\cdot \mathbf{r}}\hat{\varepsilon}\cdot \nabla|\psi_{a} } can be greatly simplified by making an assumption on how the field looks like. Namely, we ask that the field is essentially uniform at atomic scales, so that the field does not change in space over the volume of the atom (but still changes in time of course). The characteristic size of the atom is on the order of 1010 m\sim10^{-10}\text{ m}, so we ask that the wavelength of the incoming radiation is considerably larger than this6. In so doing, we allow ourselves to expand the eikre^{i\mathbf{k}\cdot\mathbf{r}} in the matrix element in its exponential series:

eikr=1+ikr+12(ikr)2+e^{i\mathbf{k}\cdot \mathbf{r}}=1+i\mathbf{k}\cdot \mathbf{r}+ \frac{1}{2}(i\mathbf{k}\cdot \mathbf{r})^{2}+\ldots

For large enough wavelengths, and thus small enough k\mathbf{k} compared to r\mathbf{r}, we might as well say eikr1e^{i\mathbf{k}\cdot \mathbf{r}}\simeq 1. The expression for a transition probability can thus be expressed in terms of rba=ψbrψa\mathbf{r}_{ba}=\braket{ \psi_{b} | \mathbf{r} | \psi_{a} }, where we wrote the nabla operator through its first order equation of motion. This yields

Wba=4π2c2(e24πε0)I(ωba)ε^rba2W_{ba}=\frac{4\pi ^{2}}{c\hbar ^{2}}\left( \frac{e^{2}}{4\pi \varepsilon_{0}} \right)I(\omega_{ba})\lvert \hat{\varepsilon}\cdot \mathbf{r}_{ba} \rvert ^{2}

Now, this new term rba\mathbf{r}_{ba} can be shown to be not null only in very specific scenarios, which are when the new and old state satisfy the following two conditions:

Δl=lbla=±1,Δm=mbma=0,±1\boxed{\Delta l=l_{b}-l_{a}=\pm 1,\quad \Delta m=m_{b}-m_{a}=0,\pm 1}

But then the transition probability must follow these conditions: we call these selection rules, as they allow us to determine which transitions are allowed and which aren't, for if these rules are not met, then Wba=0W_{ba}=0 and the transition is impossible. These rules actually happen to be quite general, despite having derived them from a specific approximation in a specific kind of field and specifically for electrons. These even function, for instance, in transitions between vibrational states of molecules.

Footnotes

  1. The angular momentum operator is found by quantizing the classical quantities of rr and pp, which become r^\hat{r} and i-i\hbar \nabla respectively (\nabla is the Gradient). So, we get L^=i(r^×)\hat{L}=-i\hbar(\hat{r}\times \nabla).

  2. If you recall the general solution, this extra term is known as the "centrifugal term". It represent the fact that more intense rotations (high ll) tend to "push" outwards more. It's to quantum mechanics what the centrifugal force is to classical mechanics.

  3. Strictly speaking, the finiteness of the wavefunction is not mandatory. What is mandatory is that all possible physical states are described by a complete orthogonal set of wavefunctions. However, for many potentials, including the central electromagnetic one, this condition can be proven to simplify down to u(0)=0u(0)=0.

  4. It can be solved using the generating function of the Laguerre polynomials. See for instance Bransden & Joachaim Appendix 3. Be careful with definitions, as Bransden uses the alternative definition of the polynomials which omits 1/q!1/q!, whereas this article includes it.

  5. We have xρx\to\rho, m2l+1m\to2l+1 and nnl1n\to n-l-1. Be careful of the definition: the handbook's definition of the polynomials omits 1/n!1/n! whereas we include it, so we need to do two things before anything else: divide the polynomial by n!n! (which becomes (n!)2(n!)^{2} due to the square) and convert nn+mn\to n+m. Then do the substitutions above.

  6. What "considerably" means is clearly quite arbitrary. It depends, as always, on how precise one wants to be. Most everyday radiation, such as visible light is well over the threshold, as its wavelength is around 106 m\sim 10^{-6}\text{ m}. As such, the dipole approximation is suitable for quantum optics.