Orthogonal polynomials
From Free net encyclopedia
In mathematics, an orthogonal polynomial sequence is an infinite sequence of polynomials p0(x), p1(x), p2(x) ... , in which each pn(x) has degree n, and such that any two different polynomials in the sequence are orthogonal to each other in the following sense:
- One can define an inner product on functions, (analogous to the ordinary "dot product" for vectors), by integrating the product of the functions:
- <math>\langle f,g \rangle=\int_{x_1}^{x_2} f(x)g(x)\,dx</math>
- More generally, one can put a fixed "weight function" W(x) into the integral:
- <math>\langle f,g \rangle=\int_{x_1}^{x_2} f(x)g(x)W(x)\,dx</math>
- Two functions are orthogonal to each other if their inner product is zero, in the same way that ordinary vectors are orthogonal (perpendicular) if their dot product is zero.
- Such an inner product makes the set of all functions of finite norm a Hilbert space.
So a polynomial sequence is an orthogonal sequence with respect to the weight function <math>W</math> when any two different polynomials in the sequence are orthogonal, using that weight function, i.e.,
- <math>\langle p_m, p_n \rangle=\int_{x_1}^{x_2} p_m(x) p_n(x)\,W(x)\,dx=0\qquad \mathrm{whenever}\qquad m\neq n.</math>
The interval of integration is called the interval of orthogonality. It might be infinite at one or both ends.
The field of orthogonal polynomials developed in the late 19th century from a study of continued fractions by Stieltjes. It evolved into a field rich in applications to many areas of mathematics and physics.
The simplest orthogonal polynomials are the Legendre polynomials, for which the interval of orthogonality is [−1, 1] and the weight function is simply 1:
- <math>P_0(x) = 1\,</math>
- <math>P_1(x) = x\,</math>
- <math>P_2(x) = \frac{3x^2-1}{2}\,</math>
- <math>P_3(x) = \frac{5x^3-3x}{2}\,</math>
- <math>P_4(x) = \frac{35x^4-30x^2+3}{8}\,</math>
- <math>\dots</math>
These are all orthogonal over [−1, 1]:
- <math>\int_{-1}^{1} P_m(x)P_n(x)\,dx = 0\qquad \mathrm{whenever}\qquad m \ne n</math>
We require that the weight function be strictly positive in the interior of the interval of orthogonality. In some cases, it may be zero, or go off to infinity, at the end points. The integral of the weight function times any polynomial must be finite.
Now any sequence of polynomials <math>p_0, p_1 \dots</math>, with each <math>\ p_k</math> having degree k, is a basis for the (infinite-dimensional) vector space of all polynomials. An orthogonal sequence is just a sequence that comprises an orthogonal basis for that space, relative to the given inner product.
The Gram-Schmidt process can turn any basis for a vector space into an orthogonal basis, by starting with one vector and then repeatedly incorporating new vectors while making each new vector orthogonal to all the previous ones. This is done by subtracting suitable linear combinations of the previous vectors. Doing this for polynomials is often used as an exercise in elementary linear algebra courses. It results in the Legendre polynomials.
When making an orthogonal basis, one may be tempted to make an orthonormal basis, that is, one in which <math>\langle p_n, p_n \rangle\ =\ 1</math>. For polynomials, this would often result in ugly square roots in the coefficients. Instead, polynomials are often scaled in a way that mathematicians agree on, that makes the coefficients and other formulas simpler. This is called standardization. The "classical" polynomials listed below have been standardized, typically by setting their leading coefficients to some specific quantity, or by setting a specific value for the polynomial. This standardization has no mathematical significance; it is just a convention. Standardization also involves scaling the weight function in an agreed-upon way.
Once a polynomial sequence has been standardized, we can define the norm. Let
- <math>h_n=\langle p_n,\ p_n \rangle</math>
The norm is the square root of this. The values of <math>\ h_n</math> for the standardized classical polynomials will be listed in the table below. Using <math>\ h_n</math>, we have
- <math>\langle p_m,\ p_n \rangle\ =\ \delta_{mn}h_n</math>
where δmn is the Kronecker delta.
Contents |
General properties of orthogonal polynomial sequences
All orthogonal polynomial sequences have a number of elegant and fascinating properties. Before proceeding with them:
Lemma 1: Given an orthogonal polynomial sequence <math>\ p_i(x)</math>, any nth-degree polynomial <math>\ S(x)</math> can be expanded in terms of <math>p_0 \dots p_n</math>. That is, there are coefficients <math>{\alpha}_0 \dots {\alpha}_n</math> such that
- <math>S(x)=\sum_{i=0}^n {\alpha}_i\ p_i(x)</math>
Proof by mathematical induction. Choose <math>\ {\alpha}_n</math> so that the <math>\ x^n</math> term of <math>\ S(x)</math> matches that of <math>\ {\alpha}_nP_n(x)</math>. Then <math>\ S(x)-{\alpha}_n P_n(x)</math> is an <math>{n-1}^{st}</math> degree polynomial. Continue downward.
Lemma 2: Given an orthogonal polynomial sequence, each of its polynomials is orthogonal to any polynomial of strictly lower degree.
Proof: Given n, any polynomial of degree n − 1 or lower can be expanded in terms of <math>p_0 \dots p_{n-1}</math>. <math>P_n\,</math> is orthogonal to each of them.
Recurrence relations
Any orthogonal sequence has a recurrence formula relating any three consecutive polynomials in the sequence.
- <math>p_{n+1}\ =\ (a_nx+b_n)\ p_n\ -\ c_n\ p_{n-1}</math>
The coefficients a, b, and c depend on n. They also depend on the standardization, obviously. (proof)
The values of <math>a_n</math>, <math>b_n</math> and <math>c_n</math> can be worked out directly. Let <math>k_j</math> and <math>k_j'</math> be the first and second coefficients of <math>p_j</math>:
- <math>p_j(x)=k_jx^j+k_j'x^{j-1}+\cdots</math>
and <math>h_j</math> be the inner product of <math>p_j</math> with itself:
- <math>h_j\ =\ \langle p_j,\ p_j \rangle</math>
we have
- <math>a_n=\frac{k_{n+1}}{k_n}\ \ \ \ \ \ \ b_n=a_n \left(\frac{k_{n+1}'}{k_{n+1}} -
\frac{k_n'}{k_n} \right)\ \ \ \ \ \ \ c_n=a_n \left(\frac{k_{n-1}h_n}{k_n h_{n-1}} \right).</math>
Existence of real roots
Each polynomial in an orthogonal sequence has all n of its roots real, distinct, and strictly inside the interval of orthogonality. (proof)
(Anyone who has graphed polynomials in high school knows that it is very rare for a randomly-chosen high-degree polynomial to have all of its roots real.)
Interlacing of roots
The roots of each polynomial lie strictly between the roots of the next higher polynomial in the sequence. (proof)
Differential equations leading to orthogonal polynomials
A very important class of orthogonal polynomials arises from a differential equation of the form
- <math>{Q(x)}\,f + {L(x)}\,f' + {\lambda}f = 0\,</math>
where Q is a given quadratic (at most) polynomial, and L is a given linear polynomial. The function f, and the constant λ, are to be found.
- (Note that it makes sense for such an equation to have a polynomial solution.
- Each term in the equation is a polynomial, and the degrees are consistent.)
This is a Sturm-Liouville type of equation. Such equations generally have singularities in their solution functions f except for particular values of λ. They can be thought of a eigenvector/eigenvalue problems: Letting D be the differential operator <math>D(f) = Q f + L f'\,</math> and changing the sign of λ, the problem is to find the eigenvectors (eigenfunctions) f, and the corresponding eigenvalues λ, such that f does not have singularities and D(f) = λf.
The solutions of this differential equation have singularities unless λ takes on specific values. There is a series of numbers <math>{\lambda}_0, {\lambda}_1, {\lambda}_2 \dots\,</math> that lead to a series of polynomial solutions <math>P_0, P_1, P_2 \dots\,</math> if one of the following sets of conditions are met:
- Q is actually quadratic, L is linear, Q has two distinct real roots, the root of L lies strictly between the roots of Q, and the leading terms of Q and L have the same sign.
- Q is not actually quadratic, but is linear, L is linear, the roots of Q and L are different, and the leading terms of Q and L have the same sign if the root of L is less than the root of Q, or vice-versa.
- Q is just a nonzero constant, L is linear, and the leading term of L has the opposite sign of Q.
These three cases lead to the Jacobi-like, Laguerre-like, and Hermite-like polynomials, respectively.
In each of these three cases, we have the following:
- The solutions are a series of polynomials <math>P_0, P_1, P_2 \dots\,</math>, each <math>P_n\,</math> having degree n, and corresponding to a number <math>{\lambda}_n\,</math>.
- The interval of orthogonality is bounded by whatever roots Q has.
- The root of L is inside the interval of orthogonality.
- Letting <math>R(x) = e^{\int \frac{L(x)}{Q(x)}\,dx}\,</math>, the polynomials are orthogonal under the weight function <math>W(x) =\frac{R(x)}{Q(x)}\,</math>
- W(x) has no zeros or infinities inside the interval, though it may have zeros or infinities at the end points.
- W(x) gives a finite inner product to any polynomials.
- W(x) can be made to be greater than 0 in the interval. (Negate the entire differential equation if necessary so that Q(x) > 0 inside the interval.)
Because of the constant of integration, the quantity R(x) is determined only up to an arbitrary positive multiplicative constant. It will be used only in homogeneous differential equations (where this doesn't matter) and in the definition of the weight function (which can also be indeterminate.) The tables below will give the "official" values of R(x) and W(x).
Rodrigues formula
Under the assumptions of the preceding section, Pn(x) is proportional to <math>\frac{1}{W(x)} \ \frac{d^n}{dx^n}\left(W(x)[Q(x)]^n\right)</math>
This is known as Rodrigues formula. It is often written
- <math>P_n(x) = \frac{1}{{e_n}W(x)} \ \frac{d^n}{dx^n}\left(W(x)[Q(x)]^n\right)</math>
where the numbers en depend on the standardization. The standard values of en will be given in the tables below.
The numbers λn
Under the assumptions of the preceding section, we have
- <math>{\lambda}_n = - n \left( \frac{n-1}{2}\ Q + L' \right)</math>
(Since Q is quadratic and L is linear, <math>Q</math> and <math>L'</math> are constants, so these are just numbers.)
Second form for the differential equation
Let <math>R(x) = e^{\int \frac{L(x)}{Q(x)}\,dx}\,</math>.
Then
- <math>(Ry')' = R\,y + R'\,y' = R\,y + \frac{R\,L}{Q}\,y'</math>
Now multiply the differential equation
- <math>{Q}\,y + {L}\,y' + {\lambda}\,y = 0\,</math>
by R/Q, getting
- <math>R\,y + \frac{R\,L}{Q}\,y' + \frac{R\,\lambda}{Q}\,y = 0\,</math>
or
- <math>(Ry')' + \frac{R\,\lambda}{Q}\,y = 0\,</math>
This is the standard Sturm-Liouville form for the equation.
Third form for the differential equation
Let <math>S(x) = \sqrt{R(x)} = e^{\int \frac{L(x)}{2\,Q(x)}\,dx}\,</math>.
Then
- <math>S' = \frac{S\,L}{2\,Q}.</math>
Now multiply the differential equation
- <math>{Q}\,y + {L}\,y' + {\lambda}\,y = 0\,</math>
by S/Q, getting
- <math>S\,y + \frac{S\,L}{Q}\,y' + \frac{S\,\lambda}{Q}\,y = 0\,</math>
or
- <math>S\,y + 2\,S'\,y' + \frac{S\,\lambda}{Q}\,y = 0\,</math>
But <math>(S\,y) = S\,y + 2\,S'\,y' + S\,y</math>, so
- <math>(S\,y) + \left(\frac{S\,\lambda}{Q} - S\right)\,y = 0,\,</math>
or, letting u = Sy,
- <math>u + \left(\frac{\lambda}{Q} - \frac{S}{S}\right)\,u = 0.\,</math>
Formulas involving derivatives
Under the assumptions of the preceding section, let <math>P_n^{[r]}</math> denote the rth derivative of <math>P_n</math>. (We put the "r" in brackets to avoid confusion with an exponent.) <math>P_n^{[r]}</math> is a polynomial of degree n − r. Then we have the following:
- (orthogonality) For fixed r, the polynomial sequence <math>P_r^{[r]}, P_{r+1}^{[r]}, P_{r+2}^{[r]}, \dots</math> are orthogonal, weighted by <math>WQ^r\,</math>.
- (generalized Rodriguez formula) <math>P_n^{[r]}</math> is proportional to <math>\frac{1}{W(x)[Q(x)]^r} \ \frac{d^{n-r}}{dx^{n-r}}\left(W(x)[Q(x)]^n\right)</math>.
- (differential equation) <math>P_n^{[r]}</math> is a solution of <math>{Q}\,y + (rQ'+L)\,y' + [{\lambda}_n-{\lambda}_r]\,y = 0\,</math>, where <math>{\lambda}_r\,</math> is the same function as <math>{\lambda}_n\,</math>, that is, <math>{\lambda}_r = - r \left( \frac{r-1}{2}\ Q + L' \right)</math>
- (differential equation, second form) <math>P_n^{[r]}</math> is a solution of <math>(RQ^{r}y')' + [{\lambda}_n-{\lambda}_r]RQ^{r-1}\,y = 0\,</math>
There are also some mixed recurrences. In each of these, the numbers a, b, and c depend on n and r, and are unrelated in the various formulas.
- <math>P_n^{[r]} = aP_{n+1}^{[r+1]} + bP_n^{[r+1]} + cP_{n-1}^{[r+1]}</math>
- <math>P_n^{[r]} = (ax+b)P_n^{[r+1]} + cP_{n-1}^{[r+1]}</math>
- <math>QP_n^{[r+1]} = (ax+b)P_n^{[r]} + cP_{n-1}^{[r]}</math>
There are an enormous number of other formulas involving orthogonal polynomials in various ways. Here is a tiny sample of them, relating to the Chebyshev, Associated Laguerre, and Hermite polynomials:
- <math>2\,T_{m}(x)\,T_{n}(x) = T_{m+n}(x) + T_{m-n}(x)\,</math>
- <math>H_{2n}(x) = (-4)^{n}\,n!\,L_{n}^{(-1/2)}(x^2)</math>
- <math>H_{2n+1}(x) = 2(-4)^{n}\,n!\,x\,L_{n}^{(1/2)}(x^2)</math>
The classical orthogonal polynomials
The class of polynomials arising from the differential equation described above have many important applications in such areas as mathematical physics, interpolation theory, the theory of random matrices, computer approximations, and many others. All of these polynomial sequences are equivalent, under scaling and/or shifting of the domain, and standardizing of the polynomials, to more restricted classes. Those restricted classes are the "classical orthogonal polynomials".
- Every Jacobi-like polynomial sequence can have its domain shifted and/or scaled so that its interval of orthogonality is [−1, 1], and has Q = 1 − x2. They can then be standardized into the Jacobi polynomials <math>P_n^{(\alpha, \beta)}</math>. There are several important subclasses of these: Gegenbauer, Legendre, and two types of Chebyshev.
- Every Laguerre-like polynomial sequence can have its domain shifted, scaled, and/or reflected so that its interval of orthogonality is <math>[0, \infty)</math>, and has Q = x. They can then be standardized into the Associated Laguerre polynomials <math>L_n^{(\alpha)}</math>. The plain Laguerre polynomials <math>\ L_n</math> are a subclass of these.
- Every Hermite-like polynomial sequence can have its domain shifted and/or scaled so that its interval of orthogonality is <math>(-\infty, \infty)</math>, and has Q = 1. They can then be standardized into the Hermite polynomials <math>H_n\,</math>.
Because all polynomial sequences arising from a differential equation in the manner described above are trivially equivalent to the classical polynomials, the actual classical polynomials are always used.
Jacobi polynomials
The Jacobi-like polynomials, once they have had their domain shifted and scaled so that the interval of orthogonality is [−1, 1], still have two parameters to be determined. They are <math>\alpha</math> and <math>\beta</math> in the Jacobi polynomials, written <math>P_n^{(\alpha, \beta)}</math>. We have <math>Q(x) = 1-x^2\,</math> and <math>L(x) = \beta-\alpha-(\alpha+\beta+2)\, x</math>. Both <math>\alpha</math> and <math>\beta</math> are required to be greater than −1. (This puts the root of L inside the interval of orthogonality.)
When <math>\alpha</math> and <math>\beta</math> are not equal, these polynomials are not symmetrical about x=0.
The differential equation
- <math>(1-x^2)\,y + (\beta-\alpha-[\alpha+\beta+2]\,x)\,y' + {\lambda}\,y = 0\qquad with\qquad\lambda = n(n+1+\alpha+\beta)\,</math>
is Jacobi's equation.
For further details, see Jacobi polynomials.
Gegenbauer polynomials
When one sets the parameters <math>\alpha</math> and <math>\beta</math> in the Jacobi polynomials equal to each other, one obtains the Gegenbauer or ultraspherical polynomials. They are written <math>C_n^{(\alpha)}</math>, and defined as
- <math>C_n^{(\alpha)}(x) = \frac{\Gamma(2\alpha\!+\!n)\,\Gamma(\alpha\!+\!1/2)}
{\Gamma(2\alpha)\,\Gamma(\alpha\!+\!n\!+\!1/2)}\! \ P_n^{(\alpha-1/2, \alpha-1/2)}</math> We have <math>Q(x) = 1-x^2\,</math> and <math>L(x) = -(2\alpha+1)\, x</math>. <math>\alpha\,</math> is required to be greater than −1/2.
(Incidentally, the standardization given in the table below would make no sense for α=0 and nǂ0, because it would set the polynomials to zero. In that case, the accepted standardization sets <math>C_n^{(0)}(1) = \frac{2}{n}</math> instead of the value given in the table.)
Ignoring the above considerations, the parameter <math>\alpha</math> is closely related to the derivatives of <math>C_n^{(\alpha)}</math>:
- <math>C_n^{(\alpha+1)}(x) = \frac{1}{2\alpha}\! \ \frac{d}{dx}C_{n+1}^{(\alpha)}(x)</math>
or, more generally:
- <math>C_n^{(\alpha+m)}(x) = \frac{\Gamma(\alpha)}{2^m\Gamma(\alpha+m)}\! \ C_{n+m}^{(\alpha)[m]}(x)</math>
All the other classical Jacobi-like polynomials (Legendre, etc.) are special cases of the Gegenbauer polynomials, obtained by choosing a value of <math>\alpha</math> and choosing a standardization.
For further details, see Gegenbauer polynomials.
Legendre polynomials
The differential equation is
- <math>(1-x^2)\,y - 2x\,y' + {\lambda}\,y = 0\qquad with\qquad\lambda = n(n+1)\,</math>
This is Legendre's equation.
The second form of the differential equation is
- <math>([1-x^2]\,y')' + \lambda\,y = 0\,</math>
The recurrence relation is
- <math>(n+1)\,P_{n+1}(x) = (2n+1)x\,P_n(x) - n\,P_{n-1}(x)\,</math>
Rodrigues formula is
- <math>P_n(x) = (-1)^n\,\frac{1}{2^n\,n!} \ \frac{d^n}{dx^n}\left([1-x^2]^n\right)</math>
Associated Legendre functions
The Associated Legendre functions, denoted <math>P_\ell^{(m)}(x)</math> where l and m are integers with <math>0{\le}m{\le}\ell</math>, are defined as
- <math>P_\ell^{(m)}(x) = (1-x^2)^{m/2}\ P_\ell^{[m]}(x)\,</math>
The m in parentheses (to avoid confusion with an exponent) is a parameter. The m in brackets denotes the mth derivative of the Legendre polynomial.
(The associated Legendre functions are not polynomials when m is odd.)
They have a recurrence relation:
- <math>(\ell+1-m)\,P_{\ell+1}^{(m)}(x) = (2\ell+1)x\,P_\ell^{(m)}(x) - (\ell+m)\,P_{\ell-1}^{(m)}(x)\,</math>
For fixed m, the sequence <math>P_m^{(m)}, P_{m+1}^{(m)}, P_{m+2}^{(m)}, \dots</math> are orthogonal over [−1, 1], with weight 1.
For given m, <math>P_\ell^{(m)}(x)</math> are the solutions of
- <math>[1-x^2]\,y -2xy' + [\lambda - \frac{m^2}{1-x^2}]\,y = 0\qquad \mathrm{with}\qquad\lambda = \ell(\ell+1)\,</math>
These functions are most useful when the argument is reparameterized in terms of angles:
- <math>P_\ell^{(m)}(\cos \theta) = (\sin \theta)^m\ P_\ell^{[m]}(\cos \theta)\,</math>
For fixed m, <math>P_m^{(m)}(\cos \theta), P_{m+1}^{(m)}(\cos \theta), P_{m+2}^{(m)}(\cos \theta), \dots</math> are orthogonal, parameterized by θ over <math>[0, \pi]</math>, with weight <math>\sin \theta</math>:
- <math>\int_{0}^{\pi} P_\ell^{(m)}(\cos \theta) P_r^{(m)}(\cos \theta)\,\sin \theta\,d\theta=0\ \mbox{if}\ \ell\neq r.</math>
In terms of θ, <math>P_\ell^{(m)}(\cos \theta)</math> are solutions of
- <math>\frac{d^{2}y}{d\theta^2} + \cot \theta \frac{dy}{d\theta} + \left[\lambda - \frac{m^2}{\sin^2\theta}\right]\,y = 0\,</math>
More precisely, given an integer m<math>\ge</math>0, the above equation has nonsingular solutions only when <math>\lambda = \ell(\ell+1)\,</math> for <math>\ell</math> an integer<math>{\ge}m</math>, and the solutions are proportional to <math>P_\ell^{(m)}(\cos \theta)</math>.
Spherical harmonics
What makes these functions useful is that they are central to the solution of the equation <math>\nabla^2\psi + \lambda\psi = 0</math> on the surface of a sphere. In spherical coordinates θ (colatitude) and φ (longitude), the Laplacian is
- <math>\nabla^2\psi = \frac{\partial^2\psi}{\partial\theta^2} + \cot \theta \frac{\partial \psi}{\partial \theta} + \csc^2 \theta\frac{\partial^2\psi}{\partial\phi^2}</math>
When the partial differential equation
- <math>\frac{\partial^2\psi}{\partial\theta^2} + \cot \theta \frac{\partial \psi}{\partial \theta} + \csc^2 \theta\frac{\partial^2\psi}{\partial\phi^2} + \lambda \psi = 0</math>
is solved by the method of separation of variables, one gets a φ-dependent part <math>\sin(m\phi)</math> or <math>\cos(m\phi)</math> for integer m≥0, and an equation for the θ-dependent part
- <math>\frac{d^{2}y}{d\theta^2} + \cot \theta \frac{dy}{d\theta} + \left[\lambda - \frac{m^2}{\sin^2\theta}\right]\,y = 0\,</math>
for which the solutions are <math>P_\ell^{(m)}(\cos \theta)</math> with <math>\ell{\ge}m</math> and <math>\lambda = \ell(\ell+1)</math>.
Therefore, the equation
- <math>\nabla^2\psi + \lambda\psi = 0</math>
has nonsingular separated solutions only when <math>\lambda = \ell(\ell+1)</math>, and the solutions are proportional to
- <math>P_\ell^{(m)}(\cos \theta)\ \cos (m\phi)\ \ \ \ 0 \le m \le \ell</math>
and
- <math>P_\ell^{(m)}(\cos \theta)\ \sin (m\phi)\ \ \ \ 0 < m \le \ell</math>
For each choice of <math>\ell</math>, there are <math>2\ell+1</math> functions for the various values of m and choices of sine and cosine. They are all orthogonal in both l and m when integrated over the surface of the sphere.
They are usually written in terms of complex exponentials:
- <math>Y_{\ell, m}(\theta, \phi) = \sqrt{\frac{(2\ell+1)(\ell-{\mid}m{\mid})!}{4\pi(\ell+{\mid}m{\mid})!}}\ P_\ell^{({\mid}m{\mid})}(\cos \theta)\ e^{im\phi}\qquad -\ell \le m \le \ell</math>
The functions <math>Y_{\ell, m}(\theta, \phi)</math> are called spherical harmonics. The quantity in the square root is a normalizing factor. These functions form a complete orthonormal set of functions in the sense of Fourier series.
When a 3-dimensional spherically symmetric partial differential equation is solved by the method of separation of variables in spherical coordinates, the part that remains after removal of the radial part is typically of the form <math>\nabla^2\psi(\theta, \phi) + \lambda\psi(\theta, \phi) = 0</math>, and hence the solutions are spherical harmonics.
For further details, see Legendre polynomials.
Chebyshev polynomials
The differential equation is
- <math>(1-x^2)\,y - x\,y' + {\lambda}\,y = 0\qquad with\qquad\lambda = n^2\,</math>
This is Chebyshev's equation.
The recurrence relation is
- <math>T_{n+1}(x) = 2x\,T_n(x) - T_{n-1}(x)\,</math>
Rodrigues formula is
- <math>T_n(x) = \frac{\Gamma(1/2)\sqrt{1-x^2}}{(-2)^n\,\Gamma(n+1/2)} \ \frac{d^n}{dx^n}\left([1-x^2]^{n-1/2}\right)</math>
These polynomials have the property that, in the interval of orthogonality,
- <math>T_n(x) = \cos(n\,cos^{-1}(x))</math>
(To prove it, use the recurrence formula.)
This means that all their local minima and maxima have values of −1 and +1, that is, the polynomials are "level". Because of this, expansion of functions in terms of Chebyshev polynomials is sometimes used for polynomial approximations in computer math libraries.
Some authors use versions of these polynomials that have been shifted so that the interval of orthogonality is [0, 1] or [−2, 2].
There are also Chebyshev polynomials of the second kind, denoted <math>U_n\,</math>
We have:
- <math>U_n = \frac{1}{n+1}\,T_{n+1}'\,</math>
For further details, including the expressions for the first few polynomials, see Chebyshev polynomials.
Laguerre polynomials
The most general Laguerre-like polynomials, after the domain has been shifted and scaled, are the Associated Laguerre polynomials (also called Generalized Laguerre polynomials), denoted <math>L_n^{(\alpha)}</math>. There is a parameter <math>\alpha</math>, which can be any real number strictly greater than −1. The parameter is put in parentheses to avoid confusion with an exponent. The plain Laguerre polynomials are simply the <math>\alpha = 0</math> version of these:
- <math>L_n(x) = L_n^{(0)}(x)</math>
The differential equation is
- <math>x\,y + (\alpha + 1-x)\,y' + {\lambda}\,y = 0\qquad \mathrm{with}\qquad\lambda = n\,</math>
This is Laguerre's equation.
The second form of the differential equation is
- <math>(x^{\alpha+1}\,e^{-x}\, y')' + {\lambda}\,x^{\alpha}\,e^{-x}\,y = 0\,</math>
The recurrence relation is
- <math>(n+1)\,L_{n+1}^{(\alpha)}(x) = (2n+1+\alpha-x)\,L_n^{(\alpha)}(x) - (n+\alpha)\,L_{n-1}^{(\alpha)}(x)\,</math>
Rodrigues formula is
- <math>L_n^{(\alpha)}(x) = \frac{x^{-\alpha}e^x}{n!} \ \frac{d^n}{dx^n}\left(x^{n+\alpha}\,e^{-x}\right)</math>
The parameter <math>\alpha</math> is closely related to the derivatives of <math>L_n^{(\alpha)}</math>:
- <math>L_n^{(\alpha+1)}(x) = - \frac{d}{dx}L_{n+1}^{(\alpha)}(x)</math>
or, more generally:
- <math>L_n^{(\alpha+m)}(x) = (-1)^m L_{n+m}^{(\alpha)[m]}(x)</math>
Laguerre's equation can be manipulated into a form that is more useful in applications:
- <math>u = x^{\frac{\alpha-1}{2}}e^{-x/2}L_n^{(\alpha)}(x)</math>
is a solution of
- <math>u + \frac{2}{x}\,u' + \left[\frac{\lambda}{x} - \frac{1}{4} - \frac{\alpha^2-1}{4x^2}\right]\,u = 0\qquad \mathrm{with}\qquad\lambda = n+\frac{\alpha+1}{2}\,</math>
This can be further manipulated. When <math>\ell = \frac{\alpha-1}{2}</math> is an integer, and <math>n{\ge}\ell+1</math>:
- <math>u = x^{\ell}e^{-x/2}L_{n-\ell-1}^{(2\ell+1)}(x)</math>
is a solution of
- <math>u + \frac{2}{x}\,u' + \left[\frac{\lambda}{x} - \frac{1}{4} - \frac{\ell(\ell+1)}{x^2}\right]\,u = 0\qquad with\qquad\lambda = n\,</math>
The solution is often expressed in terms of derivatives instead of associated Laguerre polynomials:
- <math>u = x^{\ell}e^{-x/2}L_{n+\ell}^{[2\ell+1]}(x).</math>
This equation arises in quantum mechanics, in the radial part of the solution of the Schrödinger equation for a one-electron atom.
Physicists often use a definition for the Laguerre polynomials that is larger, by a factor of (n!), than the definition used here.
For further details, including the expressions for the first few polynomials, see Laguerre polynomials.
Hermite polynomials
The differential equation is
- <math>y - 2xy' + {\lambda}\,y = 0,\qquad \mathrm{with}\qquad\lambda = 2n.\,</math>
This is Hermite's equation.
The second form of the differential equation is
- <math>(e^{-x^2}\,y')' + e^{-x^2}\,\lambda\,y = 0\,</math>
The third form is
- <math>(e^{-x^2/2}\,y) + ({\lambda}+1-x^2)(e^{-x^2/2}\,y) = 0\,</math>
The recurrence relation is
- <math>H_{n+1}(x) = 2x\,H_n(x) - 2n\,H_{n-1}(x).\,</math>
Rodrigues formula is
- <math>H_n(x) = (-1)^n\,e^{x^2} \ \frac{d^n}{dx^n}\left(e^{-x^2}\right).</math>
The first few Hermite polynomials are
- <math>H_0(x) = 1\,</math>
- <math>H_1(x) = 2x\,</math>
- <math>H_2(x) = 4x^2-2\,</math>
- <math>H_3(x) = 8x^3-12x\,</math>
- <math>H_4(x) = 16x^4-48x^2+12\,</math>
One can define the associated Hermite functions
- <math>{\psi}_n(x) = (h_n)^{-1/2}\,e^{-x^2/2}H_n(x)</math>
Because the multiplier is proportional to the square root of the weight function, these functions are orthogonal over <math>(-\infty, \infty)</math> with no weight function.
The third form of the differential equation above, for the Associated Hermite functions, is
- <math>\psi + ({\lambda}+1-x^2)\psi = 0\,</math>
The associated Hermite functions arise in many areas of mathematics and physics. In quantum mechanics, they are the solutions of Schrödinger's equation for the harmonic oscillator. They are also eigenfunctions (with eigenvalue (−i)n) of the Fourier transform.
Some authors, particularly probabilists, use an alternate definition of the Hermite polynomials, with a weight function of <math>e^{-x^2/2}</math> instead of <math>e^{-x^2}</math>. This is generally named with the two-letter symbol <math>He\,</math>. It could be defined as
- <math>He_n(x) = 2^{-n/2}\,H_n\left(\frac{x}{\sqrt{2}}\right)</math>
For further details, see Hermite polynomials.
Table of classical orthogonal polynomials
Name, and conventional symbol | Chebyshev, <math>\ T_n</math> | Chebyshev (second kind), <math>\ U_n</math> |
Legendre, <math>\ P_n</math> | Hermite, <math>\ H_n</math> |
---|---|---|---|---|
Limits of orthogonality | <math>-1, 1\,</math> | <math>-1, 1\,</math> | <math>-1, 1\,</math> | <math>-\infty, \infty</math> |
Weight, <math>W(x)\,</math> | <math>(1-x^2)^{-1/2}\,</math> | <math>(1-x^2)^{1/2}\,</math> | <math>1\,</math> | <math>e^{-x^2}</math> |
Standardization | <math>T_n(1)=1\,</math> | <math>U_n(1)=n+1\,</math> | <math>P_n(1)=1\,</math> | Lead term = <math>2^n\,</math> |
Square of norm, <math>h_n\,</math> | <math>\left\{
\begin{matrix} \pi &:~n=0 \\ \pi/2 &:~n\ne 0 \end{matrix}\right. </math> |
<math>\pi/2\,</math> | <math>\frac{2}{2n+1}</math> | <math>2^n\,n!\,\sqrt{\pi}</math> |
Leading term, <math>k_n\,</math> | <math>2^{n-1}\,</math> | <math>2^n\,</math> | <math>\frac{(2n)!}{2^n\,(n!)^2}\,</math> | <math>2^n\,</math> |
Second term, <math>k'_n\,</math> | <math>0\,</math> | <math>0\,</math> | <math>0\,</math> | <math>0\,</math> |
<math>Q\,</math> | <math>1-x^2\,</math> | <math>1-x^2\,</math> | <math>1-x^2\,</math> | <math>1\,</math> |
<math>L\,</math> | <math>-x\,</math> | <math>-3x\,</math> | <math>-2x\,</math> | <math>-2x\,</math> |
<math>R(x) =e^{\int \frac{L(x)}{Q(x)}\,dx}</math> | <math>(1-x^2)^{1/2}\,</math> | <math>(1-x^2)^{3/2}\,</math> | <math>1-x^2\,</math> | <math>e^{-x^2}\,</math> |
Constant in diff. equation, <math>{\lambda}_n\,</math> | <math>n^2\,</math> | <math>n(n+2)\,</math> | <math>n(n+1)\,</math> | <math>2n\,</math> |
Constant in Rodrigues formula, <math>e_n\,</math> | <math>(-2)^n\,\frac{\Gamma(n+1/2)}{\sqrt{\pi}}\,</math> | <math>2(-2)^n\,\frac{\Gamma(n+3/2)}{(n+1)\,\sqrt{\pi}}\,</math> | <math>(-2)^n\,n!\,</math> | <math>(-1)^n\,</math> |
Recurrence relation, <math>a_n\,</math> | <math>2\,</math> | <math>2\,</math> | <math>\frac{2n+1}{n+1}\,</math> | <math>2\,</math> |
Recurrence relation, <math>b_n\,</math> | <math>0\,</math> | <math>0\,</math> | <math>0\,</math> | <math>0\,</math> |
Recurrence relation, <math>c_n\,</math> | <math>1\,</math> | <math>1\,</math> | <math>\frac{n}{n+1}\,</math> | <math>2n\,</math> |
Name, and conventional symbol | Associated Laguerre, <math>L_n^{(\alpha)}</math> | Laguerre, <math>\ L_n</math> |
---|---|---|
Limits of orthogonality | <math>0, \infty\,</math> | <math>0, \infty\,</math> |
Weight, <math>W(x)\,</math> | <math>x^{\alpha}e^{-x}\,</math> | <math>e^{-x}\,</math> |
Standardization | Lead term = <math>\frac{(-1)^n}{n!}\,</math> | Lead term = <math>\frac{(-1)^n}{n!}\,</math> |
Square of norm, <math>h_n\,</math> | <math>1\,</math> | <math>1\,</math> |
Leading term, <math>k_n\,</math> | <math>\frac{(-1)^n}{n!}\,</math> | <math>\frac{(-1)^n}{n!}\,</math> |
Second term, <math>k'_n\,</math> | <math>\frac{(-1)^{n+1}(n+\alpha)}{(n-1)!}\,</math> | <math>\frac{(-1)^{n+1}n}{(n-1)!}\,</math> |
<math>Q\,</math> | <math>x\,</math> | <math>x\,</math> |
<math>L\,</math> | <math>\alpha+1-x\,</math> | <math>1-x\,</math> |
<math>R(x) =e^{\int \frac{L(x)}{Q(x)}\,dx}</math> | <math>x^{\alpha+1}\,e^{-x}\,</math> | <math>x\,e^{-x}\,</math> |
Constant in diff. equation, <math>{\lambda}_n\,</math> | <math>n\,</math> | <math>n\,</math> |
Constant in Rodrigues formula, <math>e_n\,</math> | <math>n!\,</math> | <math>n!\,</math> |
Recurrence relation, <math>a_n\,</math> | <math>\frac{-1}{n+1}\,</math> | <math>\frac{-1}{n+1}\,</math> |
Recurrence relation, <math>b_n\,</math> | <math>\frac{2n+1+\alpha}{n+1}\,</math> | <math>\frac{2n+1}{n+1}\,</math> |
Recurrence relation, <math>c_n\,</math> | <math>\frac{n+\alpha}{n+1}\,</math> | <math>\frac{n}{n+1}\,</math> |
Name, and conventional symbol | Gegenbauer, <math>C_n^{(\alpha)}</math> | Jacobi, <math>P_n^{(\alpha, \beta)}</math> |
---|---|---|
Limits of orthogonality | <math>-1, 1\,</math> | <math>-1, 1\,</math> |
Weight, <math>W(x)\,</math> | <math>(1-x^2)^{\alpha-1/2}\,</math> | <math>(1-x)^\alpha(1+x)^\beta\,</math> |
Standardization | <math>C_n^{(\alpha)}(1)=\frac{\Gamma(n+2\alpha)}{n!\,\Gamma(2\alpha)}\,</math> if <math>\alpha\ne0</math> | <math>P_n^{(\alpha, \beta)}(1)=\frac{\Gamma(n+1+\alpha)}{n!\,\Gamma(1+\alpha)}\,</math> |
Square of norm, <math>h_n\,</math> | <math>\frac{\pi\,2^{1-2\alpha}\Gamma(n+2\alpha)}{n!(n+\alpha)(\Gamma(\alpha))^2}</math> | <math>\frac{2^{\alpha+\beta+1}\,\Gamma(n\!+\!\alpha\!+\!1)\,\Gamma(n\!+\!\beta\!+\!1)} {n!(2n\!+\!\alpha\!+\!\beta\!+\!1)\Gamma(n\!+\!\alpha\!+\!\beta\!+\!1)}</math> |
Leading term, <math>k_n\,</math> | <math>\frac{\Gamma(2n+2\alpha)\Gamma(1/2+\alpha)}{n!\,2^n\,\Gamma(2\alpha)\Gamma(n+1/2+\alpha)}\,</math> | <math>\frac{\Gamma(2n+1+\alpha+\beta)}{n!\,2^n\,\Gamma(n+1+\alpha+\beta)}\,</math> |
Second term, <math>k'_n\,</math> | <math>0\,</math> | <math>\frac{(\alpha-\beta)\,\Gamma(2n+\alpha+\beta)}{(n-1)!\,2^n\,\Gamma(n+1+\alpha+\beta)}\,</math> |
<math>Q\,</math> | <math>1-x^2\,</math> | <math>1-x^2\,</math> |
<math>L\,</math> | <math>-(2\alpha+1)\,x\,</math> | <math>\beta-\alpha-(\alpha+\beta+2)\,x\,</math> |
<math>R(x) =e^{\int \frac{L(x)}{Q(x)}\,dx}</math> | <math>(1-x^2)^{\alpha+1/2}\,</math> | <math>(1-x)^{\alpha+1}(1+x)^{\beta+1}\,</math> |
Constant in diff. equation, <math>{\lambda}_n\,</math> | <math>n(n+2\alpha)\,</math> | <math>n(n+1+\alpha+\beta)\,</math> |
Constant in Rodrigues formula, <math>e_n\,</math> | <math>\frac{(-2)^n\,n!\,\Gamma(2\alpha)\,\Gamma(n\!+\!1/2\!+\!\alpha)} {\Gamma(n\!+\!2\alpha)\Gamma(\alpha\!+\!1/2)}</math> | <math>(-2)^n\,n!\,</math> |
Recurrence relation, <math>a_n\,</math> | <math>\frac{2(n+\alpha)}{n+1}\,</math> | <math>\frac{(2n+1+\alpha+\beta)(2n+2+\alpha+\beta)}{2(n+1)(n+1+\alpha+\beta)}</math> |
Recurrence relation, <math>b_n\,</math> | <math>0\,</math> | <math>\frac{({\alpha}^2-{\beta}^2)(2n+1+\alpha+\beta)}{2(n+1)(2n+\alpha+\beta)(n+1+\alpha+\beta)}</math> |
Recurrence relation, <math>c_n\,</math> | <math>\frac{n+2{\alpha}-1}{n+1}\,</math> | <math>\frac{(n+\alpha)(n+\beta)(2n+2+\alpha+\beta)}{(n+1)(n+1+\alpha+\beta)(2n+\alpha+\beta)}</math> |