Proving the Gradient of f(x) in Matrix Notation

oxxiissiixxo · Oct 11, 2008

Homework Statement

f(x)=(1/2)*(x^T)*(A)*(x)-(x^T)*(b)

Show that the gradient of f(x) is (1/2)*[((A^T)+A)*x]-(b)

where x^transpose is transpose of x and A^transpose is transpose of A.

Note: A is real matrix n*n and b is a column matrix n

Homework Equations

The Attempt at a Solution

I need to kinda proof that.

Dick · Oct 11, 2008

Write it in index notation. (1/2)*x_i*A_ij*x_j+x_i*b_i. All indices summed. Now dx_i/dx_k=delta(i,k). You kinda knew that, right? Use the product rule on the A part.

oxxiissiixxo · Oct 11, 2008

Dick said:

Write it in index notation. (1/2)*x_i*A_ij*x_j+x_i*b_i. All indices summed. Now dx_i/dx_k=delta(i,k). You kinda knew that, right? Use the product rule on the A part.

All indices summed. Now dx_i/dx_k=delta(i,k).

what do you mean by that?

Dick · Oct 11, 2008

Can you write it in index notation? By dx_i/dx_k=delta(i,k) I mean the derivative is 1 if i=k and 0 if i is not equal k.

oxxiissiixxo · Oct 11, 2008

I think that I may be able to make the function into index notation form.
If I make the equation to be (1/2)*x_i*A_ij*x_j+x_i*b_i, I then take the derivative respect with x_k? How this will work?
If k=i, after taking the derivative the equation will become (1/2)*1*A_ij*x_j+1*b_i. What I will have to do for the A_ij? I have not learned how to do the product rule of a matrix yet.
Will k=i and j so that I will have to do 2 derivative of i and j? if so how this works?

Dick · Oct 11, 2008

You don't have a matrix anymore once you indexed everything out. There are two x's on the first term one on the right and one on the left which is why you are getting A+A^T.

oxxiissiixxo · Oct 12, 2008

would you mind to show me the steps to get from f(x)=(1/2)*(x^T)*(A)*(x)-(x^T)*(b) to (1/2)*[((A^T)+A)*x]-(b).

I did a lot of research about this problem these days. I found a relation: D[f(x)^Tg(x)] = g(x)^Tf'(x) + f(x)^Tg'(x). however i am not convinced. I am sorry this is the first time i do this kinda proof. I can't handle it.

Dick · Oct 12, 2008

I can't do that until you at least make a try at the problem. I've told you what to do. The gradient of f(x) is the vector (df(x)/dx_1,df(x)/dx_2,...df(x)/dx_n). Write out the expression as a summation over indices and take d/dx_k of it.

oxxiissiixxo · Oct 12, 2008

hi dick thankz i think that i got it YEAH! XD

I have another question if i want to find all the matrices for X that satisfies this equation A*X*B^T = C. How should I deal with this problem? should i also make that into index notation?

B^T is the transpose of B.

Dick · Oct 12, 2008

Are A or B invertible?

oxxiissiixxo · Oct 12, 2008

It doesn't really say. This is the original question:
Find all matrices X that satisfy the equation A*X*B^T = C, in terms of the LU
factorizations of A and B. State the precise conditions under which there are no
solutions.

Dick · Oct 12, 2008

I'm really not an expert on LU factorization stuff. I think you should post this in a new thread so other people can take a crack at it.

oxxiissiixxo · Oct 12, 2008

Thanks Man anyway. You are my big help.

Proving the Gradient of f(x) in Matrix Notation

Homework Statement

Homework Equations

The Attempt at a Solution

Related to Proving the Gradient of f(x) in Matrix Notation

What is the gradient of a function in matrix notation?

Why is it important to prove the gradient of a function in matrix notation?

What are the steps involved in proving the gradient of a function in matrix notation?

Can the gradient of a function in matrix notation be used to find the direction of maximum increase?

Are there any limitations to proving the gradient of a function in matrix notation?

Similar threads

Hot Threads

Recent Insights