Why Use Gram-Schmidt to Make a Unitary Matrix?

nomadreid · Nov 27, 2023

I understand the rationale for using the Gram-Schmidt process to find an orthogonal (or orthonormal) basis from a given set of linearly independent vectors (e.g., eigenvectors of a Hermitian matrix). However, the rational for using it on the columns of a matrix in order to get a unitary matrix (for example, if one diagonalizes a matrix and one gets a matrix P in PMP^-1 which is not unitary) is not clear. (Simply normalizing the columns doesn't work for all matrices.)

An intuitive explanation would be super. Thanks.

pasmith · Nov 27, 2023

What is the relationship between the columns of [itex]P[/itex] and the eigenvectors of [itex]M[/itex]?

nomadreid · Nov 27, 2023

Pasmith, as far as I know, they are the same. So after applying G-S, I get them to be orthogonal. How does this make the matrix unitary?

Office_Shredder · Nov 29, 2023

Traditionally the only difference between an orthogonal and a unitary matrix is whether you are working over the real or complex numbers so I'm a bit confused what exactly you're stuck on

Gram Schmidt over complex numbers has some complex conjugates when computing the inner product that you might be forgetting to include if you're getting your fields mixed up

nomadreid · Dec 2, 2023

Sorry about the delay in the reply.

A side remark: as far as whether one is working over the real or complex numbers seems not to be the difference, since even if you are working in the complex numbers, although an orthogonal matrix is unitary, not all unitary matrices are orthogonal. But anyway, I get your point that they are related, with similar definitions. My question was a bit different.

But I found a partial answer to my own question for the case I was working on: there was a typo that made me think that a particular matrix was normal, when it wasn't normal, and hence not unitarily diagonalizable. It was diagonalizable, but the diagonalizing matrix (is there a term for the P in PMP^-1 ?) was not unitary, and I was looking for a way to turn it into a unitary matrix. Somewhere (oops, didn't keep the source) I saw a suggestion to do so via the Gram-Schmidt process. However (given the incorrect matrix), that would've been a dead end, so the question up to that point became invalid.

However, this gives rise to another question. Suppose I have a normal matrix, and I diagonalize it. There are many ways to diagonalize it, and I presume that not all of them will yield a unitary matrix. Am I correct that, in order to get a unitary matrix from a non-unitary diagonalizing matrix, I just have to normalize it? Or is that too simple?

Thanks for your patience and your help.

Office_Shredder · Dec 9, 2023

nomadreid said:

A side remark: as far as whether one is working over the real or complex numbers seems not to be the difference, since even if you are working in the complex numbers, although an orthogonal matrix is unitary, not all unitary matrices are orthogonal. But anyway, I get your point that they are related, with similar definitions. My question was a bit different.

The unitary matrices that contain only real numbers are orthogonal :)

nomadreid said:

However, this gives rise to another question. Suppose I have a normal matrix, and I diagonalize it. There are many ways to diagonalize it, and I presume that not all of them will yield a unitary matrix. Am I correct that, in order to get a unitary matrix from a non-unitary diagonalizing matrix, I just have to normalize it? Or is that too simple?

Just to check, your specific question here is, if ##AA^*=A^*A##, and you find some ##P## such that ##PAP^{-1}## is diagonal, is it guaranteed that ##P## is unitary?

I guess I agree it doesn't have to be. ##P## can have as columns any eigenvectors, and it will only be unitary if you pick eigenvectors of unit length.

nomadreid · Dec 10, 2023

Thanks, Office_Shredder

Office_Shredder said:

The unitary matrices that contain only real numbers are orthogonal :)

Indeed, but I am a little confused at the diversity of the definitions of an orthogonal unitary matrix: in some places I see the orthogonal matrix defined as real from the start, whereas elsewhere I just see the condition that the eigenvectors have a dot product of 1. The matrix
0 -i
i 0
satisfies the latter, but not the former. Or must it have the inner product be zero, then
0 i
i 0
would work. So does the definition of orthogonal require that it be real?

Office_Shredder said:

I guess I agree it doesn't have to be. can have as columns any eigenvectors, and it will only be unitary if you pick eigenvectors of unit length

So if I just normalize the columns of P, resulting in P', is P' guaranteed to be unitary?

Office_Shredder · Dec 10, 2023

nomadreid said:

Thanks, Office_Shredder

Indeed, but I am a little confused at the diversity of the definitions of an orthogonal unitary matrix: in some places I see the orthogonal matrix defined as real from the start, whereas elsewhere I just see the condition that the eigenvectors have a dot product of 1. The matrix
0 -i
i 0
satisfies the latter, but not the former. Or must it have the inner product be zero, then
0 i
i 0
would work. So does the definition of orthogonal require that it be real?

I think the thing you're forgetting is the dot product is the inner product for ##\mathbb{R}^n##, but not for ##\mathbb{C}^n##. Both orthogonal and unitary matrices satisfy that the columns form an orthonormal basis under the standard inner product of that vector space.

Typically a matrix which is referred to as orthogonal is *assumed* to have the field of the vector space be the real numbers, not the complex numbers.

nomadreid said:

So if I just normalize the columns of P, resulting in P', is P' guaranteed to be unitary?

I'm not sure what exactly it means to normalize the columns of P, but I suspect the answer is yes.

nomadreid · Dec 11, 2023

Thanks for your help and your patience, Office_Shredder

Office_Shredder said:

I'm not sure what exactly it means to normalize the columns of P, but I suspect the answer is yes.

I mean to divide each column by its length so that the length of the resulting column vector =1. For this to work, all of P's columns and rows (whereby P is the matrix in the diagonalization PMP^-1) would have to have the same length. My question here is whether the assumption of normality is necessary and sufficient to ensure this uniformity.

Office_Shredder said:

I think the thing you're forgetting is the dot product is the inner product for , but not for . Both orthogonal and unitary matrices satisfy that the columns form an orthonormal basis under the standard inner product of that vector space.

Ah, yes, I should not have mentioned the dot product. My example
0 -i
i 0
works for a matrix that satisfies the basic definition that multiplying it times its Hermitian conjugate results in the identity, and the columns and rows have length one, and the two columns A and B (or the two columns) have the inner product <A|B> = (A*)B [or (A)(B*) if you like](where C* is the Hermitian conjugate of C) equal to zero, and the absolute value of the determinant is one... in other words, it checks all the boxes except that it is not real.

Office_Shredder said:

Typically a matrix which is referred to as orthogonal is *assumed* to have the field of the vector space be the real numbers, not the complex numbers.

If this is not part of the definition of orthogonality, is there a reason (convenience?) for this assumption?

Office_Shredder · Dec 11, 2023

nomadreid said:

If this is not part of the definition of orthogonality, is there a reason (convenience?) for this assumption?

I think it is part of the definition.

nomadreid · Dec 11, 2023

Office_Shredder said:

I think it is part of the definition.

Ah. Interesting, since orthogonality of vectors does not have this restriction in its definition. I suppose this part of the definition for matrices serves some purpose, or maybe no one ever needed a matrix as in my example to be orthogonal.

nuuskur · Dec 14, 2023

A square matrix over ##\mathbb C## is unitary if and only if its column vectors are pairwise orthonormal. The dot product over ##\mathbb C## is defined as ##\langle x,y \rangle = \sum x_i\overline{y}_i##.

nomadreid · Dec 14, 2023

nuuskur said:

A square matrix over ##\mathbb C## is unitary if and only if its column vectors are pairwise orthonormal. The dot product over ##\mathbb C## is defined as ##\langle x,y \rangle = \sum x_i\overline{y}_i##.

Thanks for the note, muskur. Did you perhaps mean "inner product" instead of "dot product"?

The inner product definition you cited, I believe, is the "mathematician's version", whereas the "physicist's version", which is essentially the same, has the complex conjugate of the first, rather than the second term. Or do I have those reversed?

nuuskur · Dec 14, 2023

Dot product and inner product are synonyms in this context. Inner product is a more general term for a map ## V\times V\to F ## with some extra properties, where ##V## is a vector space over ##F##. In the case of ##\mathbb R^n## and ##\mathbb C^n## one usually says dot product. ~~As far as I know the complex dot product is always regarded the way I pointed out earlier.~~ I haven't checked what happens with ##\langle x,y \rangle = \sum \overline{x}_iy_i ##.

nomadreid · Dec 15, 2023

nuuskur said:

I haven't checked what happens with .

small typo: line over the x_i

Why Use Gram-Schmidt to Make a Unitary Matrix?

1. What is the purpose of using the Gram-Schmidt process in creating a unitary matrix?

2. How does the Gram-Schmidt process work in the context of matrix operations?

3. What are the advantages of using a unitary matrix in computations?

4. Are there any limitations or challenges associated with the Gram-Schmidt process?

5. Can the Gram-Schmidt process be applied to any set of vectors?

Similar threads

Hot Threads

Recent Insights