Proof about system of linear equations in echelon form

In summary: It's not hard to see that the same thing happens if we interchange the order of the equations: ##x_1 = 4##, ##x_2 = 1##, and ##x_3 = 5##. So, the system of echelon equations is a reliable way to find solutions to systems of linear equations in any order.
  • #1
s3a
818
8

Homework Statement


Problem:
Consider a system of linear equations in echelon form with r equations and n unknowns.

Prove the following.:

(i) If r = n, then the system has a unique solution.

(ii) If r < n, then we can arbitrarily assign values to the n - r free variables and solve uniquely for the r pivot variables, obtaining a solution of the system.

Solution:
(i) Suppose r = n. Then we have a square system AX = B where the matrix A of coefficient sis (upper) triangular with nonzero diagonal elements. Thus, A is invertible. By Theorem 3.10, the system has a unique solution.

(ii) Assigning values to the n - r free variables yields a triangular system in the pivot variables, which, by (i), has a unique solution.

Statement of Theorem 3.10:
Suppose the field K is infinite. Then the system AX = B has: (a) a unique solution, (b) no solution, or (c) an infinite number of solutions.

Homework Equations


There is a theorem (that I found online) which states than an upper triangular matrix is invertible if and only if all of its diagonal elements are non zero.

The Attempt at a Solution


I understand the solution for part (i) up until and including "Thus, A is invertible.", but I don't get the part that says "By Theorem 3.10, the system has a unique solution.". How does one come to that conclusion from Theorem 3.10? In other words, how does one determine from Theorem 3.10 that the system has a unique solution as opposed to having no solution or an infinite number of solutions? I'm not asking for a general way to justify determining that the system has a unique solution; I'm asking for a way to justify that the system has a unique solution by specifically using Theorem 3.10.

If something I said is unclear, let me know.

Any help in understanding the solution of this proof would be greatly appreciated!
 
Physics news on Phys.org
  • #2
s3a said:

Homework Statement


Problem:
Consider a system of linear equations in echelon form with r equations and n unknowns.

Prove the following.:

(i) If r = n, then the system has a unique solution.

(ii) If r < n, then we can arbitrarily assign values to the n - r free variables and solve uniquely for the r pivot variables, obtaining a solution of the system.

Solution:
(i) Suppose r = n. Then we have a square system AX = B where the matrix A of coefficient sis (upper) triangular with nonzero diagonal elements. Thus, A is invertible. By Theorem 3.10, the system has a unique solution.

(ii) Assigning values to the n - r free variables yields a triangular system in the pivot variables, which, by (i), has a unique solution.

Statement of Theorem 3.10:
Suppose the field K is infinite. Then the system AX = B has: (a) a unique solution, (b) no solution, or (c) an infinite number of solutions.

Homework Equations


There is a theorem (that I found online) which states than an upper triangular matrix is invertible if and only if all of its diagonal elements are non zero.

The Attempt at a Solution


I understand the solution for part (i) up until and including "Thus, A is invertible.", but I don't get the part that says "By Theorem 3.10, the system has a unique solution.". How does one come to that conclusion from Theorem 3.10? In other words, how does one determine from Theorem 3.10 that the system has a unique solution as opposed to having no solution or an infinite number of solutions? I'm not asking for a general way to justify determining that the system has a unique solution; I'm asking for a way to justify that the system has a unique solution by specifically using Theorem 3.10.

If something I said is unclear, let me know.

Any help in understanding the solution of this proof would be greatly appreciated!

You don't need Theorem 3.10; in fact, appealing to the theorem without knowing what it means or where it comes from seems to me to be counterproductive. It is much more revealing to recall why we bother with echelon forms in the first place: they make solving the equations a snap.

For example, suppose your echelon form is
[tex] \begin{array}{ccc|c}
1 & 2 & 3 & 4\\
0 & 1 & 2 & 15 \\
0 & 0 & 1 & 6
\end{array} [/tex]
This is just a shorthand form for the system of equations
[tex]
\begin{array}{ccccccc}
x_1 & +& 2 x_2 & + &3x_3 & = & 4\\
& & x_2 & + & 2x_3 & =& 1 5 \\
& & & & x_3 & =& 6
\end{array} [/tex]
which has an obviously unique solution: first, ##x_3 = 6##, then put that value into the second equation and get the value of ##x_2 = 15 - 2x_3 = 15 - 12 = 3##. Then put the now-known values of ##x_2, x_3## into the first equation; that gives you a unique value for ##x_1##.

It works in exactly the same way for the general case of r = n.
 
  • #3
Thanks for the response, and I get the intuition (i.e., what you said), but I not only want to know how to express it formally (as if I had to prove it on an exam), but I actually do care about appealing to Theorem 3.10 (because when I see something that I don't get in my books, I don't like to move on :wink:).

So, could you please help me understand my book's proof (which makes use of Theorem 3.10)?
 
Last edited:
  • #4
Here's the augmented matrix that Ray showed. The 3 x 3 matrix at the left is in echelon form, and r (number of equations, or number of rows) = n (number of variables, or number of columns in matrix A).
$$ \begin{array}{ccc|c}
1 & 2 & 3 & 4\\
0 & 1 & 2 & 15 \\
0 & 0 & 1 & 6
\end{array} $$
Pretty clearly this matrix equation has a unique solution - you can solve for z, then for y, and finally for x.The unique solution represents a single point in R3.

Now suppose we ended up with this augmented matrix, which is in echelon form.
$$ \begin{array}{ccc|c}
1 & 2 & 3 & 4\\
0 & 1 & 2 & 15 \\
0 & 0 & 0 & 0 \end{array} $$
Here r < n (i.e., there are two rows with nonzero leading entries, so we can arbitrarily assign values to the n - r = 3 - 2 = 1 free variable, and solve for the r = 2 pivot variables in terms of the free variable. Since the free variable can take on any value, there are an infinite number of solutions.
Geometrically, the solution space is a line in R3, produced by all three planes intersecting in a line..

Last, suppose we ended with this augmented matrix, which is almost identical to the previous one, except for the nonzero value in the bottom row.
$$ \begin{array}{ccc|c}
1 & 2 & 3 & 4\\
0 & 1 & 2 & 15 \\
0 & 0 & 0 & 2 \end{array} $$
Again, we have r < n, but this time the last row means that the third equation is 0x + 0y + 0z = 2, which has no solution at all, no matter what we choose for x, y, or z. The system of equations that this matrix represents is inconsistent.

Geometrically, we could have a number of situations that cause this, including two of the planes being parallel, or all three being parallel. It's possible that each pair of planes intersect (in a line), but there is no point that is on all three planes.
 
  • #5
s3a said:
Thanks for the response, and I get the intuition (i.e., what you said), but I not only want to know how to express it formally (as if I had to prove it on an exam), but I actually do care about appealing to Theorem 3.10 (because when I see something that I don't get in my books, I don't like to move on :wink:).

So, could you please help me understand my book's proof (which makes use of Theorem 3.10)?

If the solution you wrote is the book's, then I don't understand it. Theorem 3.10 makes no mention of matrix inverses, so I don't see how that comes into the solution at all. In fact, the only proofs I have seen of theorems like 3.10 START from the echelon form, then go through manipulations like I did (and like Mark 44 did) and thereby PROVE the theorem as output from that reasoning process (not as input). There may be other, completely different proofs of Theorem 3.10 that I have never seen (it would not surprise me), but the usual presentation is exactly backwards from the one you seem to want. So, I am unable to help you.

I suggest you go to the library and check out another book on the subject; it may present an alternative point of view that clarifies these issues for you.
 

Related to Proof about system of linear equations in echelon form

1. What is echelon form in a system of linear equations?

Echelon form is a special form of a system of linear equations in which the coefficients of each variable are arranged in a triangular pattern, with the leading coefficients (non-zero numbers) increasing from left to right as you move down the rows.

2. How do you know if a system of linear equations is in echelon form?

A system of linear equations is in echelon form if it satisfies the following conditions:

  • The leading coefficient of each row is 1
  • The leading coefficient of each row is to the right of the leading coefficient of the row above it
  • All elements below a leading coefficient are 0

3. Why is it important to put a system of linear equations in echelon form?

Putting a system of linear equations in echelon form makes it easier to solve and allows you to quickly identify the number of solutions the system has. It also helps to identify the pivot variables (variables with leading coefficients) and free variables (variables without leading coefficients).

4. Can any system of linear equations be put in echelon form?

Yes, any system of linear equations can be put in echelon form by using elementary row operations. These operations include swapping rows, multiplying a row by a non-zero number, and adding a multiple of one row to another row.

5. How does echelon form relate to the solution of a system of linear equations?

In echelon form, the solution of a system of linear equations can be easily determined. If there are no rows with all zeros except for the rightmost column, the system has a unique solution. If there is a row with all zeros except for the last column, the system has infinitely many solutions. If there is a row with all zeros in both the coefficients and constants columns, the system either has no solution or an infinite number of solutions.

Similar threads

  • Calculus and Beyond Homework Help
Replies
2
Views
155
  • Calculus and Beyond Homework Help
Replies
7
Views
1K
  • Calculus and Beyond Homework Help
Replies
6
Views
989
  • Calculus and Beyond Homework Help
Replies
14
Views
2K
  • Calculus and Beyond Homework Help
Replies
4
Views
80
  • Calculus and Beyond Homework Help
Replies
2
Views
358
  • Calculus and Beyond Homework Help
Replies
6
Views
430
  • Calculus and Beyond Homework Help
Replies
2
Views
558
  • Calculus and Beyond Homework Help
Replies
4
Views
1K
  • Calculus and Beyond Homework Help
Replies
10
Views
2K
Back
Top