Cramer's rule
Template:Use American English Template:Short description In linear algebra, Cramer's rule is an explicit formula for the solution of a system of linear equations with as many equations as unknowns, valid whenever the system has a unique solution. It expresses the solution in terms of the determinants of the (square) coefficient matrix and of matrices obtained from it by replacing one column by the column vector of right-sides of the equations. It is named after Gabriel Cramer, who published the rule for an arbitrary number of unknowns in 1750,<ref>Template:Cite web</ref><ref>Template:Cite journal</ref> although Colin Maclaurin also published special cases of the rule in 1748,<ref>Template:Cite book</ref> and possibly knew of it as early as 1729.<ref>Template:Cite book</ref><ref>Template:Cite book</ref><ref>Template:Cite journal</ref>
Cramer's rule, implemented in a naive way, is computationally inefficient for systems of more than two or three equations.<ref name="Poole2014">Template:Cite book</ref> In the case of Template:Mvar equations in Template:Mvar unknowns, it requires computation of Template:Math determinants, while Gaussian elimination produces the result with the same (up to a constant factor independent of Template:Tmath) computational complexity as the computation of a single determinant.<ref name="HoffmanFrankel2001">Template:Cite book</ref><ref name="Shores2007">Template:Cite book</ref> Moreover, Bareiss algorithm is a simple modification of Gaussian elimination that produces in a single computation a matrix whose nonzero entries are the determinants involved in Cramer's rule.
General case
[edit]Consider a system of Template:Mvar linear equations for Template:Mvar unknowns, represented in matrix multiplication form as follows:
- <math> A\mathbf{x} = \mathbf{b}</math>
where the Template:Math matrix Template:Mvar has a nonzero determinant, and the vector <math> \mathbf{x} = (x_1, \ldots, x_n)^\mathsf{T} </math> is the column vector of the variables. Then the theorem states that in this case the system has a unique solution, whose individual values for the unknowns are given by:
- <math> x_i = \frac{\det(A_i)}{\det(A)} \qquad i = 1, \ldots, n</math>
where <math> A_i </math> is the matrix formed by replacing the Template:Mvar-th column of Template:Mvar by the column vector Template:Math.
A more general version of Cramer's rule<ref>Template:Cite journal</ref> considers the matrix equation
- <math> AX = B</math>
where the Template:Math matrix Template:Mvar has a nonzero determinant, and Template:Mvar, Template:Mvar are Template:Math matrices. Given sequences <math> 1 \leq i_1 < i_2 < \cdots < i_k \leq n </math> and <math> 1 \leq j_1 < j_2 < \cdots < j_k \leq m </math>, let <math> X_{I,J} </math> be the Template:Math submatrix of Template:Mvar with rows in <math> I := (i_1, \ldots, i_k ) </math> and columns in <math> J := (j_1, \ldots, j_k ) </math>. Let <math> A_{B}(I,J) </math> be the Template:Math matrix formed by replacing the <math>i_s</math> column of Template:Mvar by the <math>j_s</math> column of Template:Mvar, for all <math> s = 1,\ldots, k </math>. Then
- <math> \det X_{I,J} = \frac{\det(A_{B}(I,J))}{\det(A)}. </math>
In the case <math> k = 1 </math>, this reduces to the normal Cramer's rule.
The rule holds for systems of equations with coefficients and unknowns in any field, not just in the real numbers.
Proof
[edit]The proof for Cramer's rule uses the following properties of the determinants: linearity with respect to any given column and the fact that the determinant is zero whenever two columns are equal, which is implied by the property that the sign of the determinant flips if you switch two columns.
Fix the index Template:Math of a column, and consider that the entries of the other columns have fixed values. This makes the determinant a function of the entries of the Template:Mvarth column. Linearity with respect to this column means that this function has the form
- <math>D_j(a_{1,j}, \ldots, a_{n,j})= C_{1,j}a_{1,j}+\cdots, C_{n,j}a_{n,j},</math>
where the <math>C_{i,j}</math> are coefficients that depend on the entries of Template:Mvar that are not in column Template:Mvar. So, one has
- <math>\det(A)=D_j(a_{1,j}, \ldots, a_{n,j})=C_{1,j}a_{1,j}+\cdots, C_{n,j}a_{n,j}</math>
(Laplace expansion provides a formula for computing the <math>C_{i,j}</math> but their expression is not important here.)
If the function <math>D_j</math> is applied to any other column Template:Math of Template:Mvar, then the result is the determinant of the matrix obtained from Template:Mvar by replacing column Template:Math by a copy of column Template:Math, so the resulting determinant is 0 (the case of two equal columns).
Now consider a system of Template:Mvar linear equations in Template:Mvar unknowns <math>x_1, \ldots,x_n</math>, whose coefficient matrix is Template:Mvar, with det(A) assumed to be nonzero:
- <math>\begin{matrix}
a_{11}x_1+a_{12}x_2+\cdots+a_{1n}x_n&=&b_1\\ a_{21}x_1+a_{22}x_2+\cdots+a_{2n}x_n&=&b_2\\ &\vdots&\\ a_{n1}x_1+a_{n2}x_2+\cdots+a_{nn}x_n&=&b_n. \end{matrix}</math>
If one combines these equations by taking Template:Math times the first equation, plus Template:Math times the second, and so forth until Template:Math times the last, then for every Template:Mvar the resulting coefficient of Template:Mvar becomes
- <math>D_j(a_{1,k},\ldots,a_{n,k}).</math>
So, all coefficients become zero, except the coefficient of <math>x_j</math> that becomes <math>\det(A).</math> Similarly, the constant coefficient becomes <math>D_j(b_1,\ldots,b_n),</math> and the resulting equation is thus
- <math>\det(A)x_j=D_j(b_1,\ldots, b_n),</math>
which gives the value of <math>x_j</math> as
- <math>x_j=\frac1{\det(A)}D_j(b_1,\ldots, b_n).</math>
As, by construction, the numerator is the determinant of the matrix obtained from Template:Mvar by replacing column Template:Math by Template:Math, we get the expression of Cramer's rule as a necessary condition for a solution.
It remains to prove that these values for the unknowns form a solution. Let Template:Mvar be the Template:Math matrix that has the coefficients of <math>D_j</math> as Template:Mvarth row, for <math>j=1,\ldots,n</math> (this is the adjugate matrix for Template:Mvar). Expressed in matrix terms, we have thus to prove that
- <math>\mathbf x = \frac1{\det(A)}M\mathbf b</math>
is a solution; that is, that
- <math>A\left(\frac1{\det(A)}M\right)\mathbf b=\mathbf b.</math>
For that, it suffices to prove that
- <math>A\,\left(\frac1{\det(A)}M\right)=I_n,</math>
where <math>I_n</math> is the identity matrix.
The above properties of the functions <math>D_j</math> show that one has Template:Math, and therefore,
- <math>\left(\frac1{\det(A)}M\right)\,A=I_n.</math>
This completes the proof, since a left inverse of a square matrix is also a right-inverse (see Invertible matrix theorem).
For other proofs, see below.
Finding inverse matrix
[edit]Template:Main article Let Template:Mvar be an Template:Math matrix with entries in a field Template:Math. Then
- <math>A\,\operatorname{adj}(A) = \operatorname{adj}(A)\,A=\det(A) I</math>
where Template:Math denotes the adjugate matrix, Template:Math is the determinant, and Template:Math is the identity matrix. If Template:Math is nonzero, then the inverse matrix of Template:Mvar is
- <math>A^{-1} = \frac{1}{\det(A)} \operatorname{adj}(A).</math>
This gives a formula for the inverse of Template:Mvar, provided Template:Math. In fact, this formula works whenever Template:Math is a commutative ring, provided that Template:Math is a unit. If Template:Math is not a unit, then Template:Mvar is not invertible over the ring (it may be invertible over a larger ring in which some non-unit elements of Template:Mvar may be invertible).
Applications
[edit]Explicit formulas for small systems
[edit]Consider the linear system
- <math>\left\{\begin{matrix}
a_1x + b_1y&= {\color{red}c_1}\\ a_2x + b_2y&= {\color{red}c_2} \end{matrix}\right.</math>
which in matrix format is
- <math>\begin{bmatrix} a_1 & b_1 \\ a_2 & b_2 \end{bmatrix}\begin{bmatrix} x \\ y \end{bmatrix}=\begin{bmatrix} {\color{red}c_1} \\ {\color{red}c_2} \end{bmatrix}.</math>
Assume Template:Math is nonzero. Then, with the help of determinants, Template:Mvar and Template:Mvar can be found with Cramer's rule as
- <math>\begin{align}
x &= \frac{\begin{vmatrix} {\color{red}{c_1}} & b_1 \\ {\color{red}{c_2}} & b_2 \end{vmatrix}}{\begin{vmatrix} a_1 & b_1 \\ a_2 & b_2 \end{vmatrix}} = { {\color{red}c_1}b_2 - b_1{\color{red}c_2} \over a_1b_2 - b_1a_2}, \quad y = \frac{\begin{vmatrix} a_1 & {\color{red}{c_1}} \\ a_2 & {\color{red}{c_2}} \end{vmatrix}}{\begin{vmatrix} a_1 & b_1 \\ a_2 & b_2 \end{vmatrix}} = { a_1{\color{red}c_2} - {\color{red}c_1}a_2 \over a_1b_2 - b_1a_2} \end{align}.</math>
The rules for Template:Math matrices are similar. Given
- <math>\left\{\begin{matrix}
a_1x + b_1y + c_1z&= {\color{red}d_1}\\ a_2x + b_2y + c_2z&= {\color{red}d_2}\\ a_3x + b_3y + c_3z&= {\color{red}d_3} \end{matrix}\right.</math>
which in matrix format is
- <math>\begin{bmatrix} a_1 & b_1 & c_1 \\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3 \end{bmatrix}\begin{bmatrix} x \\ y \\ z \end{bmatrix}=\begin{bmatrix} {\color{red}d_1} \\ {\color{red}d_2} \\ {\color{red}d_3} \end{bmatrix}.</math>
Then the values of Template:Mvar and Template:Mvar can be found as follows:
- <math>x = \frac{\begin{vmatrix} {\color{red}d_1} & b_1 & c_1 \\ {\color{red}d_2} & b_2 & c_2 \\ {\color{red}d_3} & b_3 & c_3 \end{vmatrix} } { \begin{vmatrix} a_1 & b_1 & c_1 \\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3 \end{vmatrix}}, \quad
y = \frac {\begin{vmatrix} a_1 & {\color{red}d_1} & c_1 \\ a_2 & {\color{red}d_2} & c_2 \\ a_3 & {\color{red}d_3} & c_3 \end{vmatrix}} {\begin{vmatrix} a_1 & b_1 & c_1 \\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3 \end{vmatrix}}, \text{ and } z = \frac { \begin{vmatrix} a_1 & b_1 & {\color{red}d_1} \\ a_2 & b_2 & {\color{red}d_2} \\ a_3 & b_3 & {\color{red}d_3} \end{vmatrix}} {\begin{vmatrix} a_1 & b_1 & c_1 \\ a_2 & b_2 & c_2 \\ a_3 & b_3 & c_3 \end{vmatrix} }.</math>
Differential geometry
[edit]Ricci calculus
[edit]Cramer's rule is used in the Ricci calculus in various calculations involving the Christoffel symbols of the first and second kind.<ref>Template:Cite book</ref>
In particular, Cramer's rule can be used to prove that the divergence operator on a Riemannian manifold is invariant with respect to change of coordinates. We give a direct proof, suppressing the role of the Christoffel symbols. Let <math>(M,g)</math> be a Riemannian manifold equipped with local coordinates <math> (x^1, x^2, \dots, x^n)</math>. Let <math>A=A^i \frac{\partial}{\partial x^i}</math> be a vector field. We use the summation convention throughout.
- Theorem.
- The divergence of <math>A</math>,
- <math> \operatorname{div} A = \frac{1}{\sqrt{\det g}} \frac{\partial}{\partial x^i} \left( A^i \sqrt{\det g} \right),</math>
- is invariant under change of coordinates.
Template:Collapse top Let <math>(x^1,x^2,\ldots,x^n)\mapsto (\bar x^1,\ldots,\bar x^n)</math> be a coordinate transformation with non-singular Jacobian. Then the classical transformation laws imply that <math>A=\bar A^{k}\frac{\partial}{\partial\bar x^{k}}</math> where <math>\bar A^{k}=\frac{\partial \bar x^{k}}{\partial x^{j}}A^{j}</math>. Similarly, if <math>g=g_{mk}\,dx^{m}\otimes dx^{k}=\bar{g}_{ij}\,d\bar x^{i}\otimes d\bar x^{j}</math>, then <math>\bar{g}_{ij}=\,\frac{\partial x^{m}}{\partial\bar x^{i}}\frac{\partial x^{k}}{\partial \bar x^{j}}g_{mk}</math>. Writing this transformation law in terms of matrices yields <math>\bar g=\left(\frac{\partial x}{\partial\bar{x}}\right)^{\text{T}}g\left(\frac{\partial x}{\partial\bar{x}}\right)</math>, which implies <math>\det\bar g=\left(\det\left(\frac{\partial x}{\partial\bar{x}}\right)\right)^{2}\det g</math>.
Now one computes
- <math>\begin{align}
\operatorname{div} A &=\frac{1}{\sqrt{\det g}}\frac{\partial}{\partial x^{i}}\left( A^{i}\sqrt{\det g}\right)\\ &=\det\left(\frac{\partial x}{\partial\bar{x}}\right)\frac{1}{\sqrt{\det\bar g}}\frac{\partial \bar x^k}{\partial x^{i}}\frac{\partial}{\partial\bar x^{k}}\left(\frac{\partial x^{i}}{\partial \bar x^{\ell}}\bar{A}^{\ell}\det\!\left(\frac{\partial x}{\partial\bar{x}}\right)^{\!\!-1}\!\sqrt{\det\bar g}\right). \end{align}</math> In order to show that this equals <math>\frac{1}{\sqrt{\det\bar g}}\frac{\partial}{\partial\bar x^{k}}\left(\bar A^{k}\sqrt{\det\bar{g}}\right)</math>, it is necessary and sufficient to show that
- <math>\frac{\partial\bar x^{k}}{\partial x^{i}}\frac{\partial}{\partial\bar x^{k}}\left(\frac{\partial x^{i}}{\partial \bar x^{\ell}}\det\!\left(\frac{\partial x}{\partial\bar{x}}\right)^{\!\!\!-1}\right)=0\qquad\text{for all } \ell, </math>
which is equivalent to
- <math>\frac{\partial}{\partial \bar x^{\ell}}\det\left(\frac{\partial x}{\partial\bar{x}}\right)
=\det\left(\frac{\partial x}{\partial\bar{x}}\right)\frac{\partial\bar x^{k}}{\partial x^{i}}\frac{\partial^{2}x^{i}}{\partial\bar x^{k}\partial\bar x^{\ell}}. </math> Carrying out the differentiation on the left-hand side, we get:
- <math>\begin{align}
\frac{\partial}{\partial\bar x^{\ell}}\det\left(\frac{\partial x}{\partial\bar{x}}\right) &=(-1)^{i+j}\frac{\partial^{2}x^{i}}{\partial\bar x^{\ell}\partial\bar x^{j}}\det M(i|j)\\ &=\frac{\partial^{2}x^{i}}{\partial\bar x^{\ell}\partial\bar x^{j}}\det\left(\frac{\partial x}{\partial\bar{x}}\right)\frac{(-1)^{i+j}}{\det\left(\frac{\partial x}{\partial\bar{x}}\right)}\det M(i|j)=(\ast), \end{align}</math> where <math>M(i|j)</math> denotes the matrix obtained from <math>\left(\frac{\partial x}{\partial\bar{x}}\right)</math> by deleting the <math>i</math>th row and <math>j</math>th column. But Cramer's Rule says that
- <math>\frac{(-1)^{i+j}}{\det\left(\frac{\partial x}{\partial\bar{x}}\right)}\det M(i|j) </math>
is the <math>(j,i)</math>th entry of the matrix <math>\left(\frac{\partial \bar{x}}{\partial x}\right)</math>. Thus
- <math>(\ast)=\det\left(\frac{\partial x}{\partial\bar{x}}\right)\frac{\partial^{2}x^{i}}{\partial\bar x^{\ell}\partial\bar x^{j}}\frac{\partial\bar x^{j}}{\partial x^{i}},</math>
completing the proof.
Computing derivatives implicitly
[edit]Consider the two equations <math>F(x, y, u, v) = 0</math> and <math>G(x, y, u, v) = 0</math>. When u and v are independent variables, we can define <math>x = X(u, v)</math> and <math>y = Y(u, v).</math>
An equation for <math>\dfrac{\partial x}{\partial u}</math> can be found by applying Cramer's rule.
Template:Collapse top First, calculate the first derivatives of F, G, x, and y:
- <math>\begin{align}
dF &= \frac{\partial F}{\partial x} dx + \frac{\partial F}{\partial y} dy +\frac{\partial F}{\partial u} du +\frac{\partial F}{\partial v} dv = 0 \\[6pt] dG &= \frac{\partial G}{\partial x} dx + \frac{\partial G}{\partial y} dy +\frac{\partial G}{\partial u} du +\frac{\partial G}{\partial v} dv = 0 \\[6pt] dx &= \frac{\partial X}{\partial u} du + \frac{\partial X}{\partial v} dv \\[6pt] dy &= \frac{\partial Y}{\partial u} du + \frac{\partial Y}{\partial v} dv. \end{align}</math>
Substituting dx, dy into dF and dG, we have:
- <math>\begin{align}
dF &= \left(\frac{\partial F}{\partial x} \frac{\partial x}{\partial u} +\frac{\partial F}{\partial y} \frac{\partial y}{\partial u} + \frac{\partial F}{\partial u} \right) du + \left(\frac{\partial F}{\partial x} \frac{\partial x}{\partial v} +\frac{\partial F}{\partial y} \frac{\partial y}{\partial v} +\frac{\partial F}{\partial v} \right) dv = 0 \\ [6pt] dG &= \left(\frac{\partial G}{\partial x} \frac{\partial x}{\partial u} +\frac{\partial G}{\partial y} \frac{\partial y}{\partial u} +\frac{\partial G}{\partial u} \right) du + \left(\frac{\partial G}{\partial x} \frac{\partial x}{\partial v} +\frac{\partial G}{\partial y} \frac{\partial y}{\partial v} +\frac{\partial G}{\partial v} \right) dv = 0. \end{align}</math>
Since u, v are both independent, the coefficients of du, dv must be zero. So we can write out equations for the coefficients:
- <math>\begin{align}
\frac{\partial F}{\partial x} \frac{\partial x}{\partial u} +\frac{\partial F}{\partial y} \frac{\partial y}{\partial u} & = -\frac{\partial F}{\partial u} \\[6pt] \frac{\partial G}{\partial x} \frac{\partial x}{\partial u} +\frac{\partial G}{\partial y} \frac{\partial y}{\partial u} & = -\frac{\partial G}{\partial u} \\[6pt] \frac{\partial F}{\partial x} \frac{\partial x}{\partial v} +\frac{\partial F}{\partial y} \frac{\partial y}{\partial v} & = -\frac{\partial F}{\partial v} \\[6pt] \frac{\partial G}{\partial x} \frac{\partial x}{\partial v} +\frac{\partial G}{\partial y} \frac{\partial y}{\partial v} & = -\frac{\partial G}{\partial v}. \end{align}</math>
Now, by Cramer's rule, we see that:
- <math>\frac{\partial x}{\partial u} = \frac{\begin{vmatrix} -\frac{\partial F}{\partial u} & \frac{\partial F}{\partial y} \\ -\frac{\partial G}{\partial u} & \frac{\partial G}{\partial y}\end{vmatrix}}{\begin{vmatrix}\frac{\partial F}{\partial x} & \frac{\partial F}{\partial y} \\ \frac{\partial G}{\partial x} & \frac{\partial G}{\partial y}\end{vmatrix}}.</math>
This is now a formula in terms of two Jacobians:
- <math>\frac{\partial x}{\partial u} = -\frac{\left(\frac{\partial (F, G)}{\partial (u, y)}\right)}{\left(\frac{\partial (F, G)}{\partial(x, y)}\right)}.</math>
Similar formulas can be derived for <math>\frac{\partial x}{\partial v}, \frac{\partial y}{\partial u}, \frac{\partial y}{\partial v}.</math> Template:Collapse bottom
Integer programming
[edit]Cramer's rule can be used to prove that an integer programming problem whose constraint matrix is totally unimodular and whose right-hand side is integer, has integer basic solutions. This makes the integer program substantially easier to solve.
Ordinary differential equations
[edit]Cramer's rule is used to derive the general solution to an inhomogeneous linear differential equation by the method of variation of parameters.
Example
[edit]Consider the linear system
- <math>\begin{matrix}
12x + 3y&= 15\\ 2x - 3y&= 13 \end{matrix}</math>
Applying Cramer's Rule gives
- <math>\begin{align}
x &= \frac{\begin{vmatrix} 15 & 3 \\ {13} & -3 \end{vmatrix}}{\begin{vmatrix} 12 & 3 \\ 2 & -3 \end{vmatrix}} = { -84 \over -42} = {\color{red}2}, \quad y = \frac{\begin{vmatrix} 12 & 15 \\ 2 & {13} \end{vmatrix}}{\begin{vmatrix} 12 & 3 \\ 2 & -3 \end{vmatrix}} = -{ 126 \over 42} = {\color{red}-3} \end{align}.</math>
These values can be verified by substituting back into the original equations: <math display="block"> 12x + 3y = (12 \times {\color{red}2}) + (3 \times ({\color{red}-3})) = 24 - 9 = 15 </math> and <math display="block"> 2x - 3y = (2 \times {\color{red}2}) - (3 \times ({\color{red}-3})) = 4 - (-9) = 13, </math>
as required.
Geometric interpretation
[edit]Cramer's rule has a geometric interpretation that can be considered also a proof or simply giving insight about its geometric nature. These geometric arguments work in general and not only in the case of two equations with two unknowns presented here.
Given the system of equations
- <math>\begin{matrix}a_{11}x_1+a_{12}x_2&=b_1\\a_{21}x_1+a_{22}x_2&=b_2\end{matrix}</math>
it can be considered as an equation between vectors
- <math>x_1\binom{a_{11}}{a_{21}}+x_2\binom{a_{12}}{a_{22}}=\binom{b_1}{b_2}. </math>
The area of the parallelogram determined by <math>\binom{a_{11}}{a_{21}}</math> and <math>\binom{a_{12}}{a_{22}}</math> is given by the determinant of the system of equations:
- <math>\begin{vmatrix}a_{11}&a_{12}\\a_{21}&a_{22}\end{vmatrix}.</math>
In general, when there are more variables and equations, the determinant of Template:Mvar vectors of length Template:Mvar will give the volume of the parallelepiped determined by those vectors in the Template:Mvar-th dimensional Euclidean space.
Therefore, the area of the parallelogram determined by <math>x_1\binom{a_{11}}{a_{21}}</math> and <math>\binom{a_{12}}{a_{22}}</math> has to be <math>x_1</math> times the area of the first one since one of the sides has been multiplied by this factor. Now, this last parallelogram, by Cavalieri's principle, has the same area as the parallelogram determined by <math>\binom{b_1}{b_2}=x_1\binom{a_{11}}{a_{21}}+x_2\binom{a_{12}}{a_{22}}</math> and <math>\binom{a_{12}}{a_{22}}.</math>
Equating the areas of this last and the second parallelogram gives the equation
- <math>\begin{vmatrix}b_1&a_{12}\\b_2&a_{22}\end{vmatrix} = \begin{vmatrix}a_{11}x_1&a_{12}\\a_{21}x_1&a_{22}\end{vmatrix} =x_1 \begin{vmatrix}a_{11}&a_{12}\\a_{21}&a_{22}\end{vmatrix} </math>
from which Cramer's rule follows.
Other proofs
[edit]A proof by abstract linear algebra
[edit]This is a restatement of the proof above in abstract language.
Consider the map <math>\mathbf{x}=(x_1,\ldots, x_n) \mapsto \frac{1}{\det A} \left(\det (A_1),\ldots, \det(A_n)\right),</math> where <math>A_i</math> is the matrix <math>A</math> with <math>\mathbf{x}</math> substituted in the <math>i</math>th column, as in Cramer's rule. Because of linearity of determinant in every column, this map is linear. Observe that it sends the <math>i</math>th column of <math>A</math> to the <math>i</math>th basis vector <math>\mathbf{e}_i=(0,\ldots, 1, \ldots, 0) </math> (with 1 in the <math>i</math>th place), because determinant of a matrix with a repeated column is 0. So we have a linear map which agrees with the inverse of <math>A</math> on the column space; hence it agrees with <math>A^{-1}</math> on the span of the column space. Since <math>A</math> is invertible, the column vectors span all of <math>\mathbb{R}^n</math>, so our map really is the inverse of <math>A</math>. Cramer's rule follows.
A short proof
[edit]A short proof of Cramer's rule <ref>Template:Cite journal</ref> can be given by noticing that <math>x_1</math> is the determinant of the matrix
- <math>X_1=\begin{bmatrix}
x_1 & 0 & 0 & \cdots & 0\\ x_2 & 1 & 0 & \cdots & 0\\ x_3 & 0 & 1 & \cdots & 0\\ \vdots & \vdots & \vdots & \ddots &\vdots \\ x_n & 0 & 0 & \cdots & 1 \end{bmatrix}</math>
On the other hand, assuming that our original matrix Template:Mvar is invertible, this matrix <math>X_1</math> has columns <math>A^{-1}\mathbf{b}, A^{-1}\mathbf{v}_2, \ldots, A^{-1}\mathbf{v}_n </math>, where <math>\mathbf{v}_n</math> is the n-th column of the matrix Template:Mvar. Recall that the matrix <math>A_1</math> has columns <math>\mathbf{b}, \mathbf{v}_2, \ldots, \mathbf{v}_n </math>, and therefore <math>X_1=A^{-1}A_1</math>. Hence, by using that the determinant of the product of two matrices is the product of the determinants, we have
- <math> x_1= \det (X_1) = \det (A^{-1}) \det (A_1)= \frac{\det (A_1)}{\det (A)}.</math>
The proof for other <math>x_j</math> is similar.
Using Geometric Algebra
[edit]Inconsistent and indeterminate cases
[edit]A system of equations is said to be inconsistent when there are no solutions and it is called indeterminate when there is more than one solution. For linear equations, an indeterminate system will have infinitely many solutions (if it is over an infinite field), since the solutions can be expressed in terms of one or more parameters that can take arbitrary values.
Cramer's rule applies to the case where the coefficient determinant is nonzero. In the 2×2 case, if the coefficient determinant is zero, then the system is inconsistent if the numerator determinants are nonzero, or indeterminate if the numerator determinants are zero.
For 3×3 or higher systems, the only thing one can say when the coefficient determinant equals zero is that if any of the numerator determinants are nonzero, then the system must be inconsistent. However, having all determinants zero does not imply that the system is indeterminate. A simple example where all determinants vanish (equal zero) but the system is still inconsistent is the 3×3 system x+y+z=1, x+y+z=2, x+y+z=3.