The hydrogen atom is the atom of the element hydrogen, a quantum physical system made up of a central positive charge carrier (a proton) and a negative charge carrier (an electron) surrounding it. It is the simplest case of a broader class of objects called hydrogenic atoms (or hydrogen-like atoms) which may possess a stronger central charge, that is, a full atomic nucleus.
The Hamiltonian for the Schrödinger equation of the hydrogen atom is
where is the mass of the nucleus, is the mass of the electron, is the reduced Planck constant, is the atomic number (the number of protons), is the vacuum permittivity and is the distance of the electron from the origin. The last term is the potential energy due to electromagnetic attraction.
Derivation#
First things first, the end goal is finding the wavefunction of the system. We'll solve it in position representation, . This wavefunction is of course dependent on the spatial coordinates, but since the electron carries Spin ( and specifically), this will also be parameter. If we call the generalized coordinates of the wavefunction, we can write it as .
However, the Hamiltonian has no spin term. and never appear. This suggests that the wavefunction that we get by solving is actually just the spatial part, with the spin part being a separate piece. Using separation of variables, we can split the two:
We can now solve these separately.
The spin part#
Since the Hamiltonian does not affect or depend on spin in any way, the spin part is just the eigenstates of the generic spin operator :
Thankfully, we know the spin: it's , and this is the special case that is already solved in Spin > Spin 1/2. We know the eigenvectors of both and in spin 1/2: they're somewhat up to preference, so we'll choose
And that's the spin part done already. The real challenge is solving the spatial part.
The spatial part#
Since we have a central potential, the system lends itself well to be rewritten in spherical coordinates, as the Schrödinger equation has a general solution in this form. To do so, we make a first Coordinate transformation to the frame of reference of the center of mass. We define the reduced mass as usual:
Then, using and expressing the Laplacian in the relative coordinate , the equation becomes
The benefit is that now we only depend on one degree of freedom () instead of two (). (Effectively, it's a single particle located in of mass subject to the potential). Then, the spherical coordinate transformation from Cartesian coordinates is
With this we can express the Laplacian in spherical coordinates. Writing the potential as for brevity, the Hamiltonian operator in spherical coordinates is
We now introduce what is arguably the key component of the system: the quantum angular momentum 1. It is so important because if we express its Gradient in spherical coordinates, then take the square, we get the equation
We can see that comes up exactly in the Hamiltonian, so with a simple substitution
With this, the spherical Schrödinger equation for the hydrogen atom is
where the angular dependence of the system is entirely contained inside the angular momentum. As usual, the square angular momentum, the momentum on one axis () and the Hamiltonian all commute, , so they all share simultaneous eigenstates. This sets the foundation of our final solution.
For a general solution of this equation in spherical coordinates, using a generic potential , see Equazione di Schrödinger > Soluzione tridimensionale. In there, we set , find that the angular component of the solution is universal, completely independent on the potential and described by the Spherical harmonics. The radial component instead depends on and we must solve it uniquely for each the potential. As such, our next goal is to solve the radial component for our . But first, a refresher on the angular part.
Angular part#
The angular components depend on the eigenstates of and and are found using the spherical harmonics :
Using spherical harmonics on is a bit overkill, since it's really only dependent on . In the general derivation, one finds that the harmonics themselves can go through separation of variables as . The function is pretty easy to solve and is found to be . The function is more complicated is requires invoking the associated Legendre functions to get ( is some constant). As such, we can rewrite the eigenstates as
In the context of the hydrogen atom, the numbers and are given special names: is the azimuthal quantum number and is the magnetic azimuthal quantum number.
Still, spherical harmonics are generally complex functions. We know very well that only real numbers can be measured from a system, so we need to bridge the gap. We do the usual: take the square modulus
Luckily, the modulus of is very easy: . (Notably, it is completely independent of .) Unluckily, the square modulus of isn't. They can, for instance, be found computationally and plotted in graphs or animations. Analytically, we'll remain with
An important detail: since the spherical harmonics for a given is spherically symmetrical over all , so is the charge distribution (see Spherical harmonics > Symmetry).
Radial part#
The radial part of the equation is is
where we used and rewrote the partial derivatives as regular derivatives since is now the only variable. We set to simplify things:
Notice how there is a term dependent on the angular momentum number even in the radial term. What we do is to combine it with the potential term in a single, effective potential defined as2
which reduces the equation even further to
or equivalently
This potential looks something like this:
Note that for , the potential is purely attractive: nothing is stopping the electron from falling and crashing into the nucleus. But if we add a rotation (), suddenly there is a spike of potential beyond a certain point which explains why electrons don't just collide with the nucleus. The value of at the bottom of each potential well for should, in principle at least, correspond to the "allowed orbits" that the Bohr model identified. As always, the system is bound if and scattering if .
Now, solving the radial part is not exactly trivial. First, to solve a (1D) differential equation like the Schrödinger equation we need a boundary condition, which we have not set up until now. We'll set , as that prevents the wavefunction from blowing up to infinity when 3.
For convenience, we introduce a couple of dimensionless variables
This leads to
and so
If we substitute these in the equation (with expanded), we get
Noticing in the middle leads us to
A much simpler form, but still quite abstract. Instead of diving head-first into solving it, we can start by analyzing asymptotic behavior. When , the equations greatly simplifies to
This a linear second order ODE of the form and the solution is an exponential. Thus, for high , we have
Using the fact that must be bounded everywhere, we keep only the negative sign. Since we have an asymptotic solution, we can claim that the general solution behaves in the same manner as the asymptote, but scaled by some other function . Thus, in general, in and out of the asymptote:
By definition, must contain all of the information about and , since those are not present in the asymptotic term: . Thus, since
our new equation to solve is
Our next problem is finding .
Finding #
To start, we again look at asymptotic behavior:
This can only be true if . In fact, using the definition of :
But only the dominant term matters in the limit so this is like saying
Since for all , this is true only if , which proves that .
Outside of the asymptote, must behave something like where is some function. As an added note, this means that our current understanding of is . We now substitute this in to get
This equation can be solved analytically by expanding in its power series
Taking the derivatives of this power series yields
Split the sum like so
and the first sum is further split as
Changing the index from to in the first yields the equivalent form
Plug this in, combine the sums again and collect to get
This is true for all , which means that each component must be zero termwise:
which yields the recurrence relation
For large , the ratio is approximately
As it happens, this ratio is the same as that of the series for with . This implies that has this very behavior
Combined with our previous result , this implies that the asymptotic behavior of is something of the form . But this can't be, for it would break our previous assumption of (and by extension the normalizability of the wavefunction). Thus the series must terminate early, which is to say that it must be a polynomial in . We shall call the rank of the polynomial, the radial quantum number, and it is a nonnegative integer (possibly zero). This means that any terms of index must be zero and that the highest term is .
Using the recurrence relation we can state
Multiply both sides by the denominator to get
but we know that , so it must be that
We'll rename this to
and we shall call it the principal quantum number. Since and , we have .
Eigenvalues#
Now that we finally know what is, we can invert its definition to extract , substitute for and get the energy eigenvalues:
where we used the Bohr radius and modified Bohr radius . Thus, after all this derivation, we have the expression for the allowed energy levels of the hydrogen atom:
Notably, the energy eigenvalues are not dependent on (not directly, at least). This means that the energy eigenstates are degenerate in too, not just . For each there are values of , and for each there are values of , which leads to a total eigenvalue degeneracy of
The degeneracy with respect to is universal for spherically symmetric systems, but the degeneracy in is actually specific to the Coulomb potential, or rather potentials that go like . In fact, were the potential to behave differently, the degeneracy would be lifted and we'd have distinct states for each principal level . This happens, for instance, in the outer electrons of an atom with more than one electron shell, which are also subject to the repulsion of inner electrons, changing the effective potential to a different shape than . You can see this in the Many-electron atom article. In a sense, the degeneracy has little to do with the shape of an atom specifically: anything around a point charge would work the same.
The levels themselves are infinite and tend towards zero as you get further from the nucleus: this shouldn't be surprising considering that the electromagnetic potential of a point charge (the thing that is responsible for keeping the electron anchored to the nucleus) falls off with distance. They also get progressively closer to one another for larger . The difference between and is bigger than that between and and so on.
Furthermore, these energy levels are identical to those obtained from the Bohr model of the atom. Despite being a (semi)classical model, it already got the quantum levels correct down to the equation. But what it couldn't do, however, are the eigenstates themselves.
Eigenstates#
To compute the eigenstate, we look back at . Earlier, we looked only at the asymptotic behavior, which led us to the definition of the radial and principal quantum numbers and . We now want to find the actual solutions themselves. This is a second order Ordinary differential equation in and luckily, a similar equation has already been solved in the past.
The (generalized) Laguerre's differential equation is the following second order ODE:
whose solutions are the associated Laguerre polynomials
for some nonnegative integers and . This should ring a bell: it is basically just with slightly different variables. In fact, if we equate the differences we get
Thus, the physically valid solutions of are
where is some Normalization constant. Substituting for and substituting into (after unwinding all the functions we've added)
where in the last step we have absorbed inside since they are both constants. Calculating the constant (see below) yields the radial wavefunction:
Spatial wavefunction#
This completes our proof: we've assembled all the pieces we need to determine the spatial wavefunction of the hydrogenic atom, which is the product of the the radial part and the angular part:
where
> since spherical harmonics are already normalized. Then > $$1=\lvert A \rvert^{2} \int_{0}^{\infty} \rho^{2l}e^{-\rho}[L_{n-l-1}^{2l+1}(\rho)]^{2} r^{2} dr> and get > $$1=\lvert A \rvert^{2} \left( \frac{na_{\mu}}{2Z} \right)^{3}\int_{0}^{\infty}\rho^{2l}e^{-\rho}[L_{n-l-1}^{2l+1}(\rho)]^{2}\rho ^{2}d\rhoUse the definition of to substitute
> If we plug this in the former equation, extract $\lvert A \rvert^{2}$, take the square root and choose the negative solution, we get > $$A=-\sqrt{ \left( \frac{2Z}{na_{\mu}} \right)^{3} \frac{(n-l-1)!}{2n(n+l)!} }Now, this integral is very hard4. Fortunately, someone else did the math already: in the Mathematical Handbook of Formulas and Tables, a special result for the associated Laguerre polynomials (Eq. 30.50) precisely solves this integral. After due substitutions5, the result is
\int_{0}^{\infty}\rho^{2l+2}e^{-\rho}[L_{n-l-1}^{2l+1}(\rho)]^{2}d\rho&=\frac{[2(n-l-1)+2l+2](n-l-1+2l+1)!}{(n-l-1)!} \\ &=\frac{2n(n+l)!}{(n-l-1)!} \end{align}
Results#
Each of these wavefunctions represents a possible state of the electron around the nucleus, bringing along the baggage of all the probabilities of where it might be at any given time while "orbiting" the nucleus. In fact, this classical concept of "orbit" remained so prevalent throughout the years that it gave the special name we use to refer to these wavefunctions: orbitals. Common in chemistry too, we use a specific naming scheme to specify which orbital we are talking about, called spectroscopic notation.
As is, the orbitals have limited intepretability. But as with all quantum mechanical system, the square modulus of gives the Probability of finding the system in a given state. Specifically, is the probability of finding the electron in an infinitesimal volume element and is thus effectively the (shape of the) volume charge distribution of the hydrogen atom. Meanwhile, the probability of finding it at a distance anywhere around the nucleus is found by integrating over all angles. This leads to what's known as a density function
since are normalized. The first few radial wavefunction and their corresponding densities look like this.
Other visualizations of the wavefunctions are the -plane section plot below, where you can see the black lines representing the nodes (i.e. the zeros) of the wavefunctions.
Also see this website, which provides an interactive simulation of the orbitals: https://www.falstad.com/qmatom/.
Total wavefunction#
Now that we painstakingly solved both the spin and spatial parts, we can put them together to officially have the full wavefunction of the hydrogen atom:
where was renamed to to better distinguish it from the spin magnetic number . It is dependent on five quantum numbers, of which four variable:
- is the principal quantum number ()
- is the azimuthal quantum number ()
- is the magnetic azimuthal quantum number ()
- is the spin quantum number ( for electrons)
- is the magnetic spin quantum number ( for electrons)
Notably, the spatial part of the wavefunction is a usual complex valued function. However, the spin part is a vector, with two components. This means that the operation we're doing here is scalar multiplication of the spin vector by the spatial wavefunction: in other words, the total wavefunction is a two-dimensional vector, not a number.
For the curious, this is what the wavefunction looks like with all nested functions written out explicitly
The fine structure#
Although the wavefunction above provides good results, it still uses the Schrödinger equation as its starting point. The issue with this is that the Schrödinger equation does not consider the effects of relativity, and since electrons move with average speeds in the order of 1% of the speed of light, this leads to minor, but still significant discrepancies when comparing theory with experiment.
The formally correct way of handling this would be to start from relativistic quantum mechanics and the Dirac equation. However, that is beyond the scope of this section, and it is also historically misplaced. Before the Dirac equation was even developed, relativistic corrections to the hydrogen atom were already discovered and calculated by using perturbation theory, specifically time-independent perturbation theory. Namely, three corrections were developed: relativistic kinetic energy, spin-orbit coupling and the Darwin term. Each of these made the eigenvalues just a little closer to reality, matching the differences that we see in the laboratory. Together, these corrections are known as the fine structure of the hydrogen atom.
This is a very qualitative description of the fine structure. Please refer to e.g. Appendix 7 of Bransden & Joachaim for a rigorous derivation.
Our perturbed Hamiltonian will be
where
is our usual Hamiltonian (the quantity, not the operator, note the lack of hat on ) for a charged particle (our electron) in a central Coulomb potential and
will be the perturbation due to the three aforementioned corrections.
Relativistic kinetic energy#
In relativity, the energy of the body is not just kinetic. We must use the full relativistic energy , from which we get the kinetic energy as
If , which is our case, we can expand the square root in a Taylor series of :
Plugging this into the previous equation gives an approximate kinetic energy of
The first term is very clearly the classical kinetic energy, whereas the second is a term that is unique to relativity. This is the relativistic correction to kinetic energy:
The correction to the eigenvalues is given by the average of the perturbation in the generic orbital (i.e. the eigenstates of )
This equation can be further developed to get
where is a constant aptly known as the fine-structure constant.
Spin-orbit correction#
The spin-orbit term takes the effect of the interaction between spin and angular momentum of the electron into account. The easiest way to reach it is by considering the frame of reference in which the electron is not moving and the nucleus is rotating around it. In such a frame, the electric and magnetic fields are
The magnetic field can be developed as
The intrinsic magnetic dipole moment of the electron is the Bohr magneton:
Thus, the energy is given by the usual formula for a magnetic dipole:
Evaluating the derivative gives the spin-orbit correction:
This correction does not commute with neither nor , so to find the eigenvalue correction we must first identify some states that diagonalize . To do so, we introduce the total angular momentum
which associated quantum numbers and . This allows us to write the Scalar product as
A good Basis is the one given by the states
The scalar product in the sum are a well-known set of numbers called the Clebsch-Gordan coefficients for . In these states we have the eigenvalue equations
Since , the values for are , while as with all magnetic spin numbers. The energy correction can now be calculated:
It can be proven that
Using
this leads to three possible values:
Darwin term#
The last correction is due to the Heisenberg inequality. Since the concept of "size" is not well-defined in quantum mechanics, the electron is non-localized and is not very well described by a point charge. The Darwin term consists of treating the electron as a cloud of charge described by a volume charge distribution. To do so, we starting by recalling that the usual electric potential for a point charge goes like
We now introduce a new vector which identifies the position of a point in the cloud, starting from the center of the cloud. If is the center of the cloud, then is a point away from the center. The corrected potential energy will then be a function of :
where we are integrating over the entire charge. We introduce the volume charge distribution we mentioned before as
This makes our corrected potential energy into
We can expand in a Taylor series in around to get
We can plug this into the equation for to get
Let's assume that the potentials takes the shape given by
In other words, it fades off spherically, with a radius given by the reduced wavelength . We also know that the Laplacian of is
so, a three-dimensional Dirac delta. Putting this together and solving results in the Darwin term:
The presence of a Dirac delta here is critical: since the delta kills every value outside of , the only time this correction matters is, well, when we are evaluating . This is significant because if you go back to look at the wavefunction we've derived above, the only ones which are nonzero in the origin are the orbitals, so when . Everything else is zero in the origin, which means that this correction does not matter to them. The Darwin term is just an orbital correction.
Using this knowledge allows us to calculate the eigenvalue difference in just the orbitals, which is to say when :
And hence
Putting it all together#
Now that we have all three corrections, we can put the together to find the total correction to the energy eigenvalues:
and adding this to the energy eigenvalues themselves we get the fine structure energy states:
As we can see, there is no direct dependence on the azimuthal angular momentum: the only thing that matters is the total angular momentum. As such, despite the fine structure leading to a great deal of splitting in the energy spectrum, it does not fully eliminate the degeneracy in .
We can see that, since , its square is . As such, when (so, when dealing with hydrogen), the correction is quite small, in the order of . Recalling that the ground state energy is the Rydberg energy, at about , it's quite a small contribution indeed. However, do note the dependence on . This derivation is not just for the hydrogen atom, but rather all hydrogenic atoms. This includes any atom with just a single valence electron, which for instance includes all alkali metals. Thus, while the fine structure may not be all that significant for hydrogen outside of high-precision experiments or extreme conditions, it may be quite relevant for heavier hydrogenic atoms, such as cesium (symbol: ) for which , thus causing the whole term to be .
Also, the relativistic correction is strictly positive, which means that the corrected states are more strongly bound (since is negative). In a sense, relativity makes it harder to break an electron off its atom. This becomes progressively less and less relevant as you climb the states, due to the dependence, which means that the ground state () is the most affected.
State transitions#
Once the stationary states themselves are determined, it's fair to ask how the electron goes from one to another. After all, there's little point in knowing all possible states if the electron is perpetually confined to the ground state. As such, we now take on the investigation of state transitions of the electron.
Electromagnetic field#
Of course, since stationary states are stable, we need some external influence to trigger a state transition. We will choose the application of an external electric field and magnetic field for this section, respectively determined by an electric potential and a magnetic vector potential . The Hamiltonian of an point charge of mass in these fields is
(ignoring small spin-dependent terms.) Changing and plugging this into the time-dependent Schrödinger equation yields
Before we move on: the entirety of electrodynamics hinges on an arbitrary choice, which we called gauge freedom. Though proof won't be given here, the Schrödinger equation is gauge invariant, which means that regardless of the gauge we choose, the equation does not change. The only thing that changes is that the resulting wavefunction acquires a phase, and different gauges lead to different phases. However, since phases vanish when taking the square modulus , it makes no physical difference. For this section, we choose to fix the gauge to , known as the Coulomb gauge.
Theoretical calculation of the transition probabilities is arduous and only a surface level analysis will be reported here. See Bransden & Joachaim Chapters 4.1 and 4.2 for more discussion.
We'll assume that the incident perturbation is a nearly monochromatic pulse of radiation peaked about some angular frequency and nonzero only in a small region of width . As usual, a generic pulse can be represented as a superposition of plane waves of frequency , which we assume to all share the same wavevector and (linear) polarization vector (the two are orthogonal, ). A plane wave is solved by the real vector potential
with . is a real phase term. The superposition in therefore
Since we're looking for the state transition caused by this perturbation, we set the time-dependent wavefunction to be generated by the usual superposition
where are the coefficients of the expansion, are the stationary states and are the eigenstates. We already know the and , so to find the time evolution, we need to solve for the . We do this with time-dependent perturbation theory and by ignoring the terms, which leaves a perturbation of
The coefficients satisfy
where . If we take the system to initially be in the stationary state of energy , to first order, the coefficients are
Solving this for the above yields
The square brackets contain two terms: the first is the absorption term, which is nonzero only if , and the second is the emission term, which is nonzero only if . This is because the first integral over vanishes in realistic scenarios where the duration of the pulse is far larger than the periodic time , thus leading that integral to be negligible unless , which happens precisely when . Similarly, the second integral is negligible unless and so .
This shouldn't come as a surprise: after all, energy is conserved, so the state transition must absorb or emit precisely the difference in energy between the states.
In an absorption process, when only the first half of the square brackets is nonzero, the probability that an atom will be found in a certain state upon measurement goes like
where is a matrix element and is given by
This allows us to define the absorption transition probability
where
The transition probability is also bound to its absorption Cross section by dividing the rate at which energy is absorbed by the atom, by the intensity to get
Emission is complicated by the fact that there are two kinds of emission: stimulated and spontaneous. For stimulated emission, the transition rate is
and similarly
Spontaneous emission is more difficult to explain due to the inherent quantum nature of it, requiring quantum electrodynamics to properly solve. In fact, spontaneous emission is actually not a "real" process, in that it is actually stimulated emission, where the stimulating field is the vacuum quantum field, which is always present.
The dipole approximation#
The calculation of the matrix element can be greatly simplified by making an assumption on how the field looks like. Namely, we ask that the field is essentially uniform at atomic scales, so that the field does not change in space over the volume of the atom (but still changes in time of course). The characteristic size of the atom is on the order of , so we ask that the wavelength of the incoming radiation is considerably larger than this6. In so doing, we allow ourselves to expand the in the matrix element in its exponential series:
For large enough wavelengths, and thus small enough compared to , we might as well say . The expression for a transition probability can thus be expressed in terms of , where we wrote the nabla operator through its first order equation of motion. This yields
Now, this new term can be shown to be not null only in very specific scenarios, which are when the new and old state satisfy the following two conditions:
But then the transition probability must follow these conditions: we call these selection rules, as they allow us to determine which transitions are allowed and which aren't, for if these rules are not met, then and the transition is impossible. These rules actually happen to be quite general, despite having derived them from a specific approximation in a specific kind of field and specifically for electrons. These even function, for instance, in transitions between vibrational states of molecules.
Footnotes#
-
The angular momentum operator is found by quantizing the classical quantities of and , which become and respectively ( is the Gradient). So, we get . ↩
-
If you recall the general solution, this extra term is known as the "centrifugal term". It represent the fact that more intense rotations (high ) tend to "push" outwards more. It's to quantum mechanics what the centrifugal force is to classical mechanics. ↩
-
Strictly speaking, the finiteness of the wavefunction is not mandatory. What is mandatory is that all possible physical states are described by a complete orthogonal set of wavefunctions. However, for many potentials, including the central electromagnetic one, this condition can be proven to simplify down to . ↩
-
It can be solved using the generating function of the Laguerre polynomials. See for instance Bransden & Joachaim Appendix 3. Be careful with definitions, as Bransden uses the alternative definition of the polynomials which omits , whereas this article includes it. ↩
-
We have , and . Be careful of the definition: the handbook's definition of the polynomials omits whereas we include it, so we need to do two things before anything else: divide the polynomial by (which becomes due to the square) and convert . Then do the substitutions above. ↩
-
What "considerably" means is clearly quite arbitrary. It depends, as always, on how precise one wants to be. Most everyday radiation, such as visible light is well over the threshold, as its wavelength is around . As such, the dipole approximation is suitable for quantum optics. ↩