Differentiation on banach spaces

In summary: CliowaIn summary, Cliowa was trying to derive the Leibniz (Product) Rule using the chain rule, but was having trouble. He found a mistake in the expression and was able to solve it using the tensor product of spaces.
  • #1
cliowa
191
0
Let [tex]L(A;B)[/tex] be the space of linear maps [tex]l:A\rightarrow B[/tex].

My goal is to derive the Leibniz (Product) Rule using the chain rule. Let [tex]f_i:U\subset E\rightarrow F_i, i=1,2[/tex] be differentiable maps and let [tex]B\in L(F_1,F_2;G)[/tex]. Then the mapping [tex]B(f_1,f_2)=B\circ (f_1\times f_2):U\subset E\rightarrow G[/tex] is differentiable.
Using the chain rule I find for [tex]u\in U[/tex]: [tex]D \{B\circ (f_1\times f_2)\}(u)=DB \{(f_1\times f_2)(u)\}\circ D(f_1\times f_2)(u)=B\circ D(f_1\times f_2)(u)\in L(E;G)[/tex], as B is a linear map. I could now go on and take the derivative of the cartesian product: [tex]...=B\circ(Df_1\times Df_2)(u)[/tex], but I feel it's all wrong. To me, there's no product rule in sight. Could you point out my error(s)?
Thanks alot. Best regards...Cliowa
 
Last edited:
Physics news on Phys.org
  • #2
What do you mean by [itex]U \subset E[/itex]?

I'm going to assume that by [itex]f_1 \times f_2[/itex], you mean the function defined by:

[tex](f_1 \times f_2)(a) = (f_1(a), f_2(a))[/tex]

(IMHO [itex](f_1, f_2)[/itex] is a better notation for this, and using [itex]f_1 \times f_2[/itex] to mean the map [itex](f_1 \times f_2)(a, b) = (f_1(a), f_2(b))[/itex])


There's a problem in this expression:

[tex]DB \{(f_1\times f_2)(u)\}\circ D(f_1\times f_2)(u)[/tex]

I can't figure out any way to parse this expression so that the composition operator makes sense.
 
  • #3
Hurkyl said:
What do you mean by [itex]U \subset E[/itex]?
U subset of E, I should have written that I meant an open subset U of E. That's simply in order to avoid trouble with closedness, bounds.

Hurkyl said:
I'm going to assume that by [itex]f_1 \times f_2[/itex], you mean the function defined by:

[tex](f_1 \times f_2)(a) = (f_1(a), f_2(a))[/tex]

(IMHO [itex](f_1, f_2)[/itex] is a better notation for this, and using [itex]f_1 \times f_2[/itex] to mean the map [itex](f_1 \times f_2)(a, b) = (f_1(a), f_2(b))[/itex])
I'm sorry, I thought that was the usual notation, but of course I meant the same thing as you wrote: the cartesian product.

Hurkyl said:
There's a problem in this expression:

[tex]DB \{(f_1\times f_2)(u)\}\circ D(f_1\times f_2)(u)[/tex]

I can't figure out any way to parse this expression so that the composition operator makes sense.

As [tex]B\in L(F_1,F_2;G)[/tex] it follows that [tex]DB(v,w)=B, \forall (v,w)\in F_1\times F_2[/tex], right?
As [tex](f_1\times f_2)=(f_1,f_2):U\subset E\rightarrow F_1\times F_2[/tex] i get [tex]D(f_1\times f_2)=D(f_1,f_2):E\rightarrow L(E;F_1,F_2)[/tex] so that the composition is perfectly fine:
[tex]L(F_1,F_2;G)\circ L(E;F_1,F_2) "="L(E;G)[/tex]. Please do correct me if I'm wrong.

If this stuff seems strange to you, how would you do it?
 
  • #4
cliowa said:
U subset of E, I should have written that I meant an open subset U of E. That's simply in order to avoid trouble with closedness, bounds.
I mainly found it confusing where it was being used. I figured that's what you meant, it just looked odd and wanted to make sure.

I'm sorry, I thought that was the usual notation, but of course I meant the same thing as you wrote: the cartesian product.
It might be in this particular subject -- I'm just showing my bias towards category theory. :smile:

so that the composition is perfectly fine:
You're right there, my mistake.


Oh, I see the problem. If B is something resembling multiplication, then it wouldn't be a linear map [itex]F_1 \times F_2 \rightarrow G[/itex], but instead a bilinear map.

It helps to look at special cases -- here, I was thinking about ordinary calculus. Here, we'd want something like B(a, b) = ab to recover the product rule, and that is clearly not linear on R².

If you want to consider linear maps instead of bilinear maps, you're going to have to use the tensor product of your spaces.
 
  • #5
Oh, I see the problem. If B is something resembling multiplication, then it wouldn't be a linear map [itex]F_1 \times F_2 \rightarrow G[/itex], but instead a bilinear map.

Hurkyl said:
It helps to look at special cases -- here, I was thinking about ordinary calculus. Here, we'd want something like B(a, b) = ab to recover the product rule, and that is clearly not linear on R².
That one is clear to me, that's also how I figured it would work.

Hurkyl said:
Oh, I see the problem. If B is something resembling multiplication, then it wouldn't be a linear map [itex]F_1 \times F_2 \rightarrow G[/itex], but instead a bilinear map.
Hurkyl said:
If you want to consider linear maps instead of bilinear maps, you're going to have to use the tensor product of your spaces.

I read all this in a book by J. Marsden and R. Abraham called "Manifolds, Tensor Analysis, and Applications", available at https://www.amazon.com/gp/product/0387967907/?tag=pfamazon01-20.
They introduce topology, spaces, then derivatives. I attached a picture with some extracts of their claims, so you can have a look at the important parts. However they are not talking about bilinear maps, nor have they (yet) introduced tensor products, so there must be some other way of doing it. But I can't see how!
(You should also have a look at the example they're doing, as this is how they apply their theory.)
Thanks a lot and best regards...Cliowa
 

Attachments

  • marsden.manifolds.jpeg
    marsden.manifolds.jpeg
    34.1 KB · Views: 535
  • example.of.usage.jpeg
    example.of.usage.jpeg
    20 KB · Views: 528
Last edited by a moderator:
  • #6
i have explained all this in an earlier response to a question about product rules and derivatives of vector valued functions.

anyway your whole setup is flawed foundationally. the derivative of a map in a banach space is not just a linear map but a continuous linear map.
 
  • #7
mathwonk said:
anyway your whole setup is flawed foundationally. the derivative of a map in a banach space is not just a linear map but a continuous linear map.
I feel I never claimed that, but let me get this straight: If [itex]f:E\rightarrow F[/itex], where E, F are Banach spaces, then the derivative function [itex]Df:E\rightarrow L(E,F)[/itex] must not be continuous, right? However, the differential at a point [itex]e\in E[/itex] is [itex]Df(e)\in L(E,F)[/itex] and therefore is continuous. Or what did you mean, by "foundationally flawed"?

mathwonk said:
i have explained all this in an earlier response to a question about product rules and derivatives of vector valued functions.
I guess you're referring to the thread https://www.physicsforums.com/showthread.php?t=124010". I have read you're response there, but I still have some questions: I understand perfectly fine your example for the bilinear map "multiplication in R". However, I can't quite follow when you proceed to bilinear maps in general: The proof you presented for multiplication makes use of the multiplication, how can one generalize then?

In a different thread, https://www.physicsforums.com/showthread.php?t=67268" you wrote:
mathwonk said:
I.e. a differential is a linear function on the tangent space. Since the tangent space to a curve is one dimensional, the space of linear functions is also one dimensional.

thus any two linear functionals are scalar multiples of each other, so their quotient is a scalar. this is not true for differentials on higher dimensional tangent spaces.

I cannot explain why this pont of view is prohibited in elementary calculus. perhaps they do not wish to do the work necessary to justify it.
As I am traditionally taught (and therefore quite confused): Could you explain this point of view a little more?
Thanks a jolly lot in advance.
Best regards...Cliowa
@mathwonk: BTW, thanks a lot for your lengthy and detailled explanations and valuable links in the de thread. That was very useful.
 
Last edited by a moderator:
  • #8
cliowa said:
However they are not talking about bilinear maps, nor have they (yet) introduced tensor products, so there must be some other way of doing it. But I can't see how!
It just struck me what might work -- you can "factor" the bilinear map into two linear maps.

If B is bilinear, then notice for each a that the map defined by:

[tex]B_a(b) := B(a, b)[/tex]

is a linear map. Also, the map [itex]a \rightarrow B_a[/itex] is linear. Can you work with this?
 
  • #9
Hurkyl said:
It just struck me what might work -- you can "factor" the bilinear map into two linear maps.

If B is bilinear, then notice for each a that the map defined by:

[tex]B_a(b) := B(a, b)[/tex]

is a linear map. Also, the map [itex]a \rightarrow B_a[/itex] is linear. Can you work with this?
Oh yes, that's a good idea. Basically this is using the natural isomorphism [itex]L(F_1,F_2;G)\simeq L(F_1;L(F_2;G))[/itex]. But how would you work that out in detail (the idea is perfectly clear, but how do I write this whole thing down nicely?)?
Best regards...Cliowa
 
  • #10
I am saying that, unles the years have clouded my miknd, that even a linear map is not differentiable unkless iot is continuous.

linearmaps are not always continuous on banach spaces. my apologies if this was not an issue for you.

similarly bilinear maps are not differentiable unless they are continuous and this is not always the case.
 
Last edited:
  • #11
mathwonk said:
I am saying that, unles the years have clouded my miknd, that even a linear map is not differentiable unkless iot is continuous.

linearmaps are not always continuous on banach spaces. my apologies if this was not an issue for you.

Thanks for pointing that out. I should have mentioned this, but it never seemed like a problem to me, because in my mind I was working in finite dimensions.
Could you give an example of a linear map between banach spaces which is not continuous?
I still don't get how you transfer the derivative stuff from linear to bilinear forms: Is it so simple you don't even think it's worth mentioning?
Best regards...Cliowa
 
  • #12
Your goal is impossible. You cannot derive the product rule using just the chain rule. You must exploit the fact that the derivative evaluated at a point is a linear transformation.

Proof: define the "fake derivative operator" Q by:

Q{f}(a) = the constant function whose value is f(a)

This clearly satisfies the chain rule, and Q is even a linear map! But Q does not satisfy the product rule.


Therefore, it is impossible to derive the product rule using only the chain rule.
 
  • #13
Hurkyl said:
Your goal is impossible. You cannot derive the product rule using just the chain rule. You must exploit the fact that the derivative evaluated at a point is a linear transformation.
That's alright, fine with me. I am willing to exploit this fact. I have seen how the product rule then follows for the bilinear map of multiplication.

But if you have a look at the excerpts from the book I mentioned you'll see that what they get is the Leibniz Rule in a general setting, i.e. where B is not necessarily multiplication but just some bilinear map. Now, it is this, that I do not understand. I don't know how to take this derivative:
[tex]D \{B\circ (f_1\times f_2)\}(u)[/tex]

In my first post I wrote down a few steps which don't lead to the right thing, as we have discussed. What I should find (if the book is correct here), is that
[tex]D \{B\circ (f_1\times f_2)\}(u)e=B(Df_1(u)e,f_2(u))+B(f_1(u),Df_2(u)e)[/tex]. How do I find that?

(P.S.: Your idea about splitting the bilinear map into to linear ones seems good to me, but I can't write it down, I can't formalize it. How would you do that?)
 
  • #14
It might help to stop thinking about [itex]f_1[/itex] and [itex]f_2[/itex], and just look at the derivative of B. What you need to show is that:

DB(a, b) . (c, d) = B(c, b) + B(a, d)

(where I use the notation `S.v' to signify the application of the linear transformation S to v, so that the expressions are easier for me to parse)

That's alright, fine with me. I am willing to exploit this fact. I have seen how the product rule then follows for the bilinear map of multiplication.
You try mimicing the proof, but replacing multiplication with an arbitrary bilinear form?
 
Last edited:
  • #15
Hurkyl said:
It might help to stop thinking about [itex]f_1[/itex] and [itex]f_2[/itex], and just look at the derivative of B. What you need to show is that:

DB(a, b) . (c, d) = B(c, b) + B(a, d)

(where I use the notation `S.v' to signify the application of the linear transformation S to v, so that the expressions are easier for me to parse)


You try mimicing the proof, but replacing multiplication with an arbitrary bilinear form?
Oh geez, I must have been blind! But how the hell could I be so... nevermind. Thankfully this spiritual nightmare has come to an end. Thanks alot.
Is there a second way of doing it? Does this factoring the bilinear maps into two linear ones lead anywhere?
humble regards...Cliowa
 
  • #16
Cliowa said:
Does this factoring the bilinear maps into two linear ones lead anywhere?
Probably. But to get there, I think you need to find the derivative of the evaluation map:

S x L(S, T) --> T

or maybe the composition map

L(U, V) x L(V, W) --> L(U, W)

and I didn't see an easy way to find that derivative.
 
  • #17
Hurkyl said:
and I didn't see an easy way to find that derivative.
as you already guessed: It's the same thing for me...
 

Related to Differentiation on banach spaces

1. What is differentiation on Banach spaces?

Differentiation on Banach spaces is a mathematical concept that involves calculating the rate of change of a function on a Banach space. It is similar to traditional differentiation in calculus, but takes into account the unique properties of Banach spaces, such as completeness and normed structure.

2. Why is differentiation on Banach spaces important?

Differentiation on Banach spaces is important because it allows us to study the behavior of functions on these spaces and make predictions about their rate of change. This is useful in many areas of mathematics and physics, such as in optimization problems and in the study of differential equations.

3. What are some common methods used for differentiation on Banach spaces?

Some common methods used for differentiation on Banach spaces include the Gateaux derivative, the Fréchet derivative, and the Lebesgue derivative. These methods take into account the specific structure of Banach spaces and allow for the calculation of derivatives in a rigorous and efficient manner.

4. Can differentiation on Banach spaces be extended to infinite-dimensional spaces?

Yes, differentiation on Banach spaces can be extended to infinite-dimensional spaces, such as Hilbert spaces and function spaces. This allows for the study of more complex functions and systems that cannot be represented in finite-dimensional spaces.

5. What are some applications of differentiation on Banach spaces?

Differentiation on Banach spaces has many applications in various fields of mathematics, including functional analysis, optimization, and partial differential equations. It is also used in physics and engineering for modeling and predicting the behavior of systems and processes.

Similar threads

Replies
3
Views
1K
Replies
1
Views
400
Replies
1
Views
1K
  • Differential Geometry
Replies
2
Views
685
  • Calculus and Beyond Homework Help
Replies
0
Views
490
  • Calculus
Replies
3
Views
1K
  • Calculus
Replies
9
Views
2K
Replies
9
Views
2K
Replies
4
Views
915
  • Calculus and Beyond Homework Help
Replies
1
Views
571
Back
Top