Lorentz transformation derivation. What exactly is wrong?

epovo · Jun 21, 2013

This is probably a stupid mistake I am making, but I can't figure it out. My apologies in advance...
I am familiar with the text-book derivation of the Lorentz transformation (I don't have any problem with it). It starts out stating:

x²+y²+z²-c²t² = x'² + y'²+z'²-c²t'²

meaning that a sphere of light radiating from the point where both coordinates coincide should have the same radius. Also, the assumption is made that x' and t' can be expressed as a linear combination of x and t, (x'=a₁x+a₂t and t'=b₁x + b₂t )while y=y' and z=z'. Doing some boring algebraic manipulation, a₁, a₂ ,b₁ and b₂ are found.

So I thought: why bother with y and z coordinates since they are the same?
So let's concentrate on events happening along the x and x' axis. I don't need the sphere, I just need to consider the ray of light along the axis and write instead:

x-ct =x'-ct'

But that is obviously different from

x²-c²t² = x'² - c²t'²

So of course it does not leave anywhere. My naive question is then: where's the flaw in my reasoning?

Bill_K · Jun 21, 2013

You need to consider both directions: x - ct = x' - ct' and x + ct = x' + ct'.

Multiply them together and you get (x - ct)(x + ct) = (x' - ct')(x' + ct') or x² - c²t² = x'² - c²t'².

epovo · Jun 21, 2013

Yes, but why does it all go wrong if I restrict my analysis to a light beam moving to the right? I'm confused :(

Bill_K · Jun 21, 2013

epovo said:

Yes, but why does it all go wrong if I restrict my analysis to a light beam moving to the right? I'm confused :(

You started out with a three-dimensional sphere enclosing the origin. If you restrict that to two dimensions, you get a two-dimensional "sphere" (a circle) enclosing the origin.

If you further restrict that to one dimension, you get a one-dimensional sphere.

The "sphere in one dimension" still encloses the origin, but it is disconnected - it consists of two points: one point lying along the positive and one point lying along the negative axis. You need to keep both of them, i.e. consider waves traveling in both directions.

If you fail to consider both, you really do get transformations more general than the Lorentz transformation. The result will not be reflection symmetric (x → -x) which is also an essential part of the Lorentz group.

epovo · Jun 21, 2013

Thank you for your response. I have done the math and I get the same transformation but with the idiotic value for γ:

γ = 1 / (1+v/c)

I still don't understand why. I just started with the assumption: "let there be a light ray moving along the x-axis in the positive direction". Then I consider that each of the observers see the ray moving at the same speed c, and everything follows from there...

Dale · Jun 21, 2013

Unless you post your derivation it will just be a guessing game.

epovo · Jun 21, 2013

Okay, there it goes:

I start with

x' = a₁x + a₂t [1]
t' = b₁x + b₂t [2]

but x'=0 is moving at speed v for the unprimed system. Substituting in [1]:

0 = a₁x + a₂t ∴ v = -a₂/a₁ [3]

Substituting into [1]:

x'=a₁ (x - vt) [4]

and x=0 is moving at speed -v for the primed system. Dividing [1] and [2] for x=0:

x'/t' = a₂/a₂ = -v

and because of [3] we get

a₁ = b₂ [5]

A photon moving along the positive x-axis will be measured at speed c for both:

x' - ct' = x - ct [6]

Substituting [4] and [5] into [6]

a₁ (x - vt) + c (b₁x + a₁)t = x - ct

Rearranging:

(a₁ - c b₁) x - (a₁v + a₁c)t = x - ct

This is an identity which holds true for every x, t so we can equate the coefficients:

a₁ - c b₁ = 1
a₁v + a₁c = c

So

a₁ = 1/(1+v/c) = γ
a₂ = -vγ
b₁ = γ v/c²
b₂ = γ

Which is the same transformation with the absurd value for γ:

γ = 1/(1+v/c)

Bill_K · Jun 21, 2013

Your Eq (6) is false. For a Lorentz transformation it is NOT true that x - ct = x' - ct'. It is only true that (for a light ray) x - ct = 0 if and only if x' - ct' = 0. In fact, one is a constant times the other: x - ct = (k)(x' -ct'). At the same time, using the relationship for left-going light rays we can conclude that (x + ct) = (1/k)(x' + ct'). Multiplying the two together, the k's cancel, and (x² - c²t²) = (x'² - c²t'²)

epovo · Jun 21, 2013

My Eq (6) was inspired in the same thought that is used in the text-book derivation, which imagines light being emitted at x=x'=t=t'=0. It is reasoned that both observers can express the fact that the light moves away at speed c in their own coordinates:

x² + y² + z² - c²t² = 0

and

x'² + y'² + z'² - c²t'² = 0

and therefore the derivation starts with:

x² + y² + z² - c²t² = x'² + y'² + z'² - c²t'²

I thought I was saying the same thing with my Eq (6), the difference being that it was simpler because my light ray moved along the x axis. We're both saying that each observer sees the light moving away at speed c.

What I don't quite get is why in the textbook derivation the concept expressed in the above equation is valid and my concept in [6] is not... I don't understand your k, because you say it comes from the Lorentz transformation. But the Lorentz transformation is precisely what I am trying to derive from first principles, so how can it be used as an argument in the derivation itself?

Bill_K · Jun 21, 2013

epovo said:

I don't understand your k, because you say it comes from the Lorentz transformation. But the Lorentz transformation is precisely what I am trying to derive from first principles, so how can it be used as an argument in the derivation itself?

You can't use it as an argument in the derivation, but you must not throw it away! Throwing it away (i.e.setting it equal to 1) is unjustified, and guaranteed to get you the wrong answer.

I just gave it a name k to avoid writing out its value, but it's easy to understand what it represents physically -

x - ct = (k)(x' - ct')

it's just the Doppler shift factor. And you can easily calculate its value from the Lorentz transformation:

x = γ(x' - vt')
t = γ(t' - vx'/c²)
implies
x - ct = γ(1 + v/c))(x' - ct') so k = γ(1 + v/c)

As I said, for the left-moving rays you get
x + ct = γ(1 - v/c)(x' + ct') and this time the factor is γ(1 - v/c), which happens to equal 1/k.

A more sophisticated way of understanding this is, if you define two null vectors v₁ and v₂ with components
v₁ = (1, 1, 0, 0)
v₂ = (1, -1, 0, 0)
they are the propagation vectors of the left- and right-going light rays, and they are eigenvectors of the Lorentz transformation, with eigenvalues k and 1/k. Under a Lorentz transformation v₁ is stretched by a factor k, while v₂ is reduced by the same factor.

pervect · Jun 21, 2013

As someone here pointed out to me a while ago, another way about thinking about 'k' is that light rays are the eigenvectors of the Lorentz transform, and k, the doppler shift, is the corresponding eigenvalue.

Thus null vectors, or light rays (x,t) must, by the Lorentz transform be mapped to other null vectors (x',t'). They don't have to be the same null vector, though, they can (and generally do) differ by a multiplicative constant. This is k, the dopper factor.

Because the Lorentz transformation is linear, other sorts of mapping other than the linear one (multiplying by k) aren't possible.

This may or may not help the OP, but I thought I'd mention it.

epovo · Jun 22, 2013

Thank you for your patience. I can see it clearly now. More to myself than to anyone else, I would explain it like this:
When the textbook considers the sphere of light, each observer can write:

x² + y² + z² - c²t² = 0
x'² + y'² + z'² - c²t'² = 0

These equations are both right for the photons moving away. However, when we write:

x² + y² + z² - c²t² = x'² + y'² + z'² - c²t'²

what we are doing is proposing that this will hold for every event, not just for that ray of light. We are proposing an invariant.

When I write for my right-moving light ray:

x - ct = 0 ; x' - ct'=0

That's fine, but when I say:

x' - ct' = x - ct

I am proposing x - ct as an invariant, which of course it's not. It does not even hold for the left-moving photon, as you said, let alone an arbitrary event!

This illustrates what the Lorentz transformation derivation really means. The derivation states a number of reasonable hypothesis, but it's not a derivation in a mathematical sense. There is no a-priori justification for introducing the invariant, only that the result is consistent and works. Just as assuming that x and t are a linear combination of x' and t' and that the y and z coordinates are not affected by the motion along x.

Fredrik · Jun 22, 2013

epovo said:

This illustrates what the Lorentz transformation derivation really means. The derivation states a number of reasonable hypothesis, but it's not a derivation in a mathematical sense.

I like to think of "derivations" of the Lorentz transformation as ways to find a theory in which there are theorems that can be thought of as mathematically precise statements of the loosely stated ideas we began with.

Lorentz transformation derivation. What exactly is wrong?

Related to Lorentz transformation derivation. What exactly is wrong?

1. What is a Lorentz transformation?

2. How does one derive the Lorentz transformation?

3. Why is the Lorentz transformation important?

4. What are some common misconceptions about the Lorentz transformation?

5. Are there any limitations to the Lorentz transformation?

Similar threads

Hot Threads

Recent Insights