Subtraction of normal distributed stochastic variables

In summary, the conversation discusses the concept of adding and subtracting stochastic variables and their associated means and standard deviations. It is explained that when the variables are independent, the means add or subtract, but the variances always add. The conversation also explores an example of washing and drying clothes and how the variability in the elapsed time can increase due to the separate processes being performed in sequence. The speaker still has some questions about proving the increase in variance in certain cases.
  • #1
Lobotomy
58
0
hello
if we have set of stochastic variables representing the random time it takes to do something: X,Y,Z,W and C where C is the sum of X Y Z W, thus the time it takes to do these things in sequence. If:
X: N(30,5)
Y: N(30,3)
Z: N(20,2)
W: N(40,7)

makes C adding these together right, mean plus mean and std dev + std dev?
C: N(30+30+20+40,5+3+2+7)=N(120,17) is this correct?

subtraction is the same, assuming we want to subtract W from C, naming it S we get:
S: N(120-40,17-7)=N(80,10)

is this also correct?

if yes, please link to a reliable source, i have googled but not found the proof other than the proof for addition.
would these operations also work for lognormally distributions?edit: it seems like, in the case when the variables are independent, that the std deviation is ADDED also when subtracting. i don't understand this, take this example
you have a washing machine and a drying machine, the time they take for doing a batch can be represented by a stochastic variable that are normally distributed and independent of each other.
time it takes to wash your clothes Normaldist 45min std dev 5min
time it takes to dry the clothes Normaldist 50min std dev 10min

time it takes to wash and dry the clothes N(45+50,5+10)

then if we want to subtract the time it takes to dry the clothes again we get N(45+50-50,5+10+5)=
N(45,20) this does not make sense, why would it suddenly be more variation when washing the clothes? we use the same washing machine as before. The thing here is that washing and drying are separate processes in a sequence and they are not mixed together.
 
Last edited:
Physics news on Phys.org
  • #2
When the variables are independent, the means add or subtract, just as the variables. However, when it comes to variability, [tex] the variances [/tex] always add.

If [itex] \sigma_X = 50, \quad \sigma_Y = 100 [/itex]

then

[tex]
\begin{align*}
Var(X + Y) & = 50^2 + 100^2 \\
Var(X - Y) & = 50^2 + 100^2
\end{align*}
[/tex]

Think about the contradiction you'd have if variances subtracted (as, admittedly, first exposure makes it seem they should). If they subtracted, what would you get for the variance of [itex] X - Y [/itex] in my example?
 
  • #3
statdad said:
When the variables are independent, the means add or subtract, just as the variables. However, when it comes to variability, [tex] the variances [/tex] always add.

If [itex] \sigma_X = 50, \quad \sigma_Y = 100 [/itex]

then

[tex]
\begin{align*}
Var(X + Y) & = 50^2 + 100^2 \\
Var(X - Y) & = 50^2 + 100^2
\end{align*}
[/tex]

Think about the contradiction you'd have if variances subtracted (as, admittedly, first exposure makes it seem they should). If they subtracted, what would you get for the variance of [itex] X - Y [/itex] in my example?


thanks i know this, but how do you explain my washing machine example then. how come that the washing machine suddenly have an increased variance?
 
  • #4
"thanks I know this."

Apparently not.
"it seems like, in the case when the variables are independent, that the std deviation is ADDED also when subtracting."
It is not the standard deviation that adds, it is the variance. They are not the same.

You repeat the error here.
"i don't understand this, take this example
you have a washing machine and a drying machine, the time they take for doing a batch can be represented by a stochastic variable that are normally distributed and independent of each other.
time it takes to wash your clothes Normaldist 45min std dev 5min
time it takes to dry the clothes Normaldist 50min std dev 10min

time it takes to wash and dry the clothes N(45+50,5+10)"

where you again add the standard deviations - incorrect. The time to wash and dry clothes would be

[tex]
N(40 + 50, \sqrt{5^2 + 10^2})
[/tex]

Finally, here
"then if we want to subtract the time it takes to dry the clothes again we get N(45+50-50,5+10+5)=
N(45,20) this does not make sense, why would it suddenly be more variation when washing the clothes? we use the same washing machine as before. The thing here is that washing and drying are separate processes in a sequence and they are not mixed together."

you do it again. The time would be

[tex]
N(40 + 50 - 50, \sqrt{5^2 + 10^2 + 5^2})
[/tex]

why does the variability increase? Your algebraic act of addition can't be viewed as purely mathematics: it would represent some other operation done to the clothes after the initial washing and drying: you aren't subtracting the physical act of drying. With another activity, the variability in elapsed time will increase.
 
  • #5
statdad said:
"thanks I know this."

Apparently not.
"it seems like, in the case when the variables are independent, that the std deviation is ADDED also when subtracting."
It is not the standard deviation that adds, it is the variance. They are not the same.

You repeat the error here.
"i don't understand this, take this example
you have a washing machine and a drying machine, the time they take for doing a batch can be represented by a stochastic variable that are normally distributed and independent of each other.
time it takes to wash your clothes Normaldist 45min std dev 5min
time it takes to dry the clothes Normaldist 50min std dev 10min

time it takes to wash and dry the clothes N(45+50,5+10)"

where you again add the standard deviations - incorrect. The time to wash and dry clothes would be

[tex]
N(40 + 50, \sqrt{5^2 + 10^2})
[/tex]

Finally, here
"then if we want to subtract the time it takes to dry the clothes again we get N(45+50-50,5+10+5)=
N(45,20) this does not make sense, why would it suddenly be more variation when washing the clothes? we use the same washing machine as before. The thing here is that washing and drying are separate processes in a sequence and they are not mixed together."

you do it again. The time would be

[tex]
N(40 + 50 - 50, \sqrt{5^2 + 10^2 + 5^2})
[/tex]

why does the variability increase? Your algebraic act of addition can't be viewed as purely mathematics: it would represent some other operation done to the clothes after the initial washing and drying: you aren't subtracting the physical act of drying. With another activity, the variability in elapsed time will increase.
ok thanks i understand a bit more. so arent there a way to calculate the "subtraction of the physical act" ?

the reason I am asking is related to the washing machine example.

im working in a factory and we have separate processes which each require a certain amount of time (similar to washing and drying machine). between the processes there is random waiting time

therefore the total production time is equal to the sum of each of the production times + the waiting times in between. all of these times I see as stochastic variables.

Now what i want to do is to remove the waiting times and physically removing some unnecessary processes through process integration to shorten lead time, but i also intuitively thought that the total time variation would decrease.

so what should i tell my boss. that removing waiting time and unnecessary processes will reduce the total lead time, but it will increase the variation in total lead time? is there a difference between removing the physical processes and removing the waiting time which are not as "physical"
 
Last edited:
  • #6
Lobotomy,

I think what you are calculating then is A + B + C - B = A + C so then you can use the regular formulas for A + C, if you know those.
 
  • #7
flat man said:
Lobotomy,

I think what you are calculating then is A + B + C - B = A + C so then you can use the regular formulas for A + C, if you know those.

ok in other words variance do decrease when removing processes physically? removing the process with 5^2 variance from the two processes which together has 5^2+10^2 then gives us 10^2 variance right? i mean this sounds perfectly logic to me anyways.

just for fun. can anyone give an example from the real world for instance using stochastic variables with time where the "normal" subtraction rule with addition is used... i don't understand how this could relate to anything concrete
 
  • #8
Suppose the total time of the process as it currently exists is [itex] N(\mu, \sigma_T) [/itex] (I use [itex] \sigma_T [/itex] to indicate the standard deviation of the Total time).

And, suppose you can break the current process down into its component parts (illustrated with 3 just for briefness) so that the total process can be thought of as Step 1, Step 2, Step 3, and

[tex]
\mu = \mu_1 + \mu_2 + \mu_3
[/tex]

[tex]
\sigma_T^2 = \sigma_1^2 + \sigma_2^2 + \sigma_3^2
[/tex]

If you can eliminate step 3, then the mean time for the reduced process is

[tex]
\mu_1 + \mu_2
[/tex]

and the variance is only

[tex]
\sigma_1^2 + \sigma_2^2
[/tex]

but - you have to have the original process broken into components as shown here in order to determine the amount by which mean time and variability decrease.

part of the confusion in the earlier give-and-take was a lack of precision in exactly what you wanted and lack of understanding on my part.
 
  • #9
Lobotomy,

Imagine if my car rattles for A minutes before it dies where A ~ N(15,3). Further imagine that it usually dies B minutes after start up, where B ~ N(45,4), then how long after start up will it start to rattle? C = B - A so, C ~ N(30,5).
 
  • #10
statdad said:
Suppose the total time of the process as it currently exists is [itex] N(\mu, \sigma_T) [/itex] (I use [itex] \sigma_T [/itex] to indicate the standard deviation of the Total time).

And, suppose you can break the current process down into its component parts (illustrated with 3 just for briefness) so that the total process can be thought of as Step 1, Step 2, Step 3, and

[tex]
\mu = \mu_1 + \mu_2 + \mu_3
[/tex]

[tex]
\sigma_T^2 = \sigma_1^2 + \sigma_2^2 + \sigma_3^2
[/tex]

If you can eliminate step 3, then the mean time for the reduced process is

[tex]
\mu_1 + \mu_2
[/tex]

and the variance is only

[tex]
\sigma_1^2 + \sigma_2^2
[/tex]

but - you have to have the original process broken into components as shown here in order to determine the amount by which mean time and variability decrease.

part of the confusion in the earlier give-and-take was a lack of precision in exactly what you wanted and lack of understanding on my part.


ok yeah this makes sense. i just got confused by that formula i found.
but even if i don't know the exact variation i know that individual physical processes have SOME KIND of variation, and reducing that process physically will reduce the total variation with THAT AMOUNT of variation. and i guess the same holds true for random waiting times...

how about dependency. adding and subtracting variation then also includes a factor of covariance. but i guess that won't make any difference for the general conclusion other than that the amount of variance reduction will be another
 

Related to Subtraction of normal distributed stochastic variables

What is the concept of subtraction of normal distributed stochastic variables?

The concept of subtraction of normal distributed stochastic variables involves calculating the difference between two random variables that follow a normal distribution. This is done by subtracting the mean values of the two variables and then taking the difference between their standard deviations.

What is the significance of subtracting normal distributed stochastic variables in scientific research?

Subtracting normal distributed stochastic variables is commonly used in scientific research to compare two sets of data or to analyze the effects of an intervention or treatment. It allows for the quantification of the difference between two variables and helps researchers to determine if there is a statistically significant difference between them.

How is the subtraction of normal distributed stochastic variables calculated?

The subtraction of normal distributed stochastic variables is calculated by first subtracting the mean values of the two variables and then taking the difference between their standard deviations. This can be represented by the formula: (X-Y) = (μx-μy) ± (σx-σy).

What are the assumptions that must be met for subtracting normal distributed stochastic variables?

In order to accurately subtract normal distributed stochastic variables, the following assumptions must be met:

  • The two variables being subtracted must follow a normal distribution
  • The two variables must be independent of each other
  • The standard deviations of the two variables must be known
If these assumptions are not met, alternative statistical methods may need to be used.

How can the results of subtracting normal distributed stochastic variables be interpreted?

The results of subtracting normal distributed stochastic variables can be interpreted as the difference between the two variables. This difference can be positive, negative, or zero, depending on the values of the two variables. The magnitude of the difference can also be used to determine the strength of the relationship between the two variables, with a larger difference indicating a stronger relationship.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
9
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
2K
Replies
12
Views
775
  • Classical Physics
Replies
0
Views
303
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
2K
Back
Top