Probability Events in Statistics

In summary: The statement "If A and B are the only sets, presumably they must be disjoint and together give the whole space." is true and the conclusion is false. It is not possible for P(A)+P(B)=1 and P(A)+P(B)>1.You could have said the same thing as my last sentence and it would have been fine. I was not trying to say anything about the probability of A and B being disjoint.When you say, "If P(A)+P(B)<1 then AUB is not the entire sample space." You could just as easily say "If P(A)+P(B)=1 then AUB is the entire sample space." and it is true. And
  • #1
eric.l
5
0

Homework Statement



Given two events, A and B, state why each of the following is not possible. Use formulas or equations to illustrate your answer.

d. P(B) = 0.24 and P(B|A) = 0.32

Homework Equations



P(A ∩ B) = P(A) * P(B|A)
P(B|A)= (P(A ∩ B))/P(A)
P(A|B)= (P(A ∩ B))/P(B)

The Attempt at a Solution



My teacher gave us the hint that P(B) ≥ P(A ∩ B), so:
P(B) ≥ P(A ∩ B)
P(B) ≥ P(A) * P(B|A)
P(B) ≥ P(A) * P(A ∩ B)/P(A), the P(A)'s cancel to get right back to
P(B) ≥ P(A ∩ B)
and here I am, lost. I've no idea if this is even remotely on the right track, but that was my attempt and fail.
 
Physics news on Phys.org
  • #2
eric.l said:

Homework Statement



Given two events, A and B, state why each of the following is not possible. Use formulas or equations to illustrate your answer.

d. P(B) = 0.24 and P(B|A) = 0.32

Homework Equations



P(A ∩ B) = P(A) * P(B|A)
P(B|A)= (P(A ∩ B))/P(A)
P(A|B)= (P(A ∩ B))/P(B)

The Attempt at a Solution



My teacher gave us the hint that P(B) ≥ P(A ∩ B), so:
P(B) ≥ P(A ∩ B)
P(B) ≥ P(A) * P(B|A)
P(B) ≥ P(A) * P(A ∩ B)/P(A), the P(A)'s cancel to get right back to
P(B) ≥ P(A ∩ B)
and here I am, lost. I've no idea if this is even remotely on the right track, but that was my attempt and fail.

Homework Statement


Homework Equations


The Attempt at a Solution


I really don't see any reason why that should be not possible. Suppose you have 1000 events. B contains 240 events. Suppose A contains 100 events and AnB contains 32 events. Then P(B)=0.24 and P(B|A) is 0.32. I don't see any problem with that. Do you?
 
  • #3
d. P(B) = 0.24 and P(B|A) = 0.32
Is there more information? - as written, this is possible because:
Dick said:
Suppose you have 1000 events. B contains 240 events. Suppose A contains 100 events and AnB contains 32 events. Then P(B)=0.24 and P(B|A) is 0.32.
OPs teacher supplied the following hint:
P(B) ≥ P(A ∩ B)
... in Dick's example, P(AnB)=32/1000=0.032 < P(B) - so it still fits... except ... Dick gives the system a superset of 1000 members of which A and B are subsets.

But the question only mentions two categories: A and B ... if A and B are all there is, (perhaps we are supposed to take this as implicit) then P(A) + P(B) ≥ 1

... and the argument goes like this:

P(B|A) = P(AnB)/P(A) ≤ P(B)/P(A)... since P(AnB) ≤ P(B)

Hence: P(A) ≤ P(B)/P(B|A) eg. P(A) ≤ 0.75

Since we know that P(B)=0.24, then P(A) + P(B) ≤ 0.99 < 1

Which contradicts the requirement that P(A) + P(B) ≥ 1
Hence: impossible.

[edit] I wouldn't normally do someones homework for them - but (a) there is the possibility I've made a mistake, and (b) it goes to an ambiguity in the question: I'm unsure how to encourage OP to the realization, and (c) I'd already hit save by accident :) ... so OP already got the email notification.
 
Last edited:
  • #4
Hah, hit save by accident. I've done that. Usually I try to delete as fast as possible and hope for the best. I think there might be a delay on the email thing, but I might be wrong. You raise a good point. If A and B are the only allowable outcomes then things change. I usually don't think of questions like this that way, but that would make sense.
 
Last edited:
  • #5
I don't normally read them that way either - I got there by wondering if there was some extra information left out ... perhaps implicit in the context. i.e. we have not seen the other "impossible" situations ... perhaps they all rely on A and B being the only sets?

When the numbers panned out so nicely I figured it was probably the case... even though the wording does leave room for there to be events that are neither A nor B. In fact - if you came across those statistics, you'd take it as evidence that there was at least one other possible event.
 
  • #6
Simon Bridge said:
I don't normally read them that way either - I got there by wondering if there was some extra information left out ... perhaps implicit in the context. i.e. we have not seen the other "impossible" situations ... perhaps they all rely on A and B being the only sets?

When the numbers panned out so nicely I figured it was probably the case... even though the wording does leave room for there to be events that are neither A nor B. In fact - if you came across those statistics, you'd take it as evidence that there was at least one other possible event.

If A and B are the only sets, presumably they must be disjoint and together give the whole space. But that would give them an empty intersection, so P(A|B) = P(B|A) = 0. On the other hand, if they are not disjoint we also have "other" sets, such as ## A - B, \; B - A, \; A \cap B. ## Presumably we would have ##A \cup B = \Omega## in this case, where Ω is the sample space. So, I cannot really make sense of the problem as stated.
 
  • #7
If AUB is the entire sample-space, the A and B are disjoint if P(A)+P(B)=1
They intersect if P(A)+P(B)>1

If P(A)+P(B)<1 then AUB is not the entire sample space.

For instance, I could have set A being "women" and set B being "scientists". AnB would be "women scientists". If I carefully filled the room so that all the men present were scientists then sets A and B would be all there was and there would be a non-empty intersection.
 
  • #8
Simon Bridge said:
If AUB is the entire sample-space, the A and B are disjoint if P(A)+P(B)=1
They intersect if P(A)+P(B)>1

If P(A)+P(B)<1 then AUB is not the entire sample space.

For instance, I could have set A being "women" and set B being "scientists". AnB would be "women scientists". If I carefully filled the room so that all the men present were scientists then sets A and B would be all there was and there would be a non-empty intersection.

Right. All I was saying was that if A and B are the "only" sets then they must be disjoint and their union must be the whole space, in which case P(A|B) = 0, etc. If they have a nonempty intersection then they cannot be the "only" sets---there are others (that can be constructed from them by set operations).
 
  • #9
In short, either the problem is wrong or eric.l did not specify the full problem; something's missing.

There certainly is nothing wrong with P(B) being very small and P(B|A) being large. That is essentially what automated fault detection is all about, looking for extremely low probability events (failures should be extremely unlikely) in light of evidence.
 
  • #10
In short, either the problem is wrong or eric.
Well the problem is supposed to be wrong - the question asks to explain how it is wrong.
All I was saying was that if A and B are the "only" sets then they must be disjoint and their union must be the whole space, in which case P(A|B) = 0, etc. If they have a nonempty intersection then they cannot be the "only" sets---there are others (that can be constructed from them by set operations).
The given A and B, in the problem, you mean?

Or perhaps I am not using the right terminology to describe what I have in mind?
 
  • #11
Basically everything above is what I ended up getting to and all of it seemed to be possible, but my teacher stated that it wasn't possible. Thank you so much for all the help though, at least I was on the right track with what I did afterwards
 
  • #12
eric.l said:
Basically everything above is what I ended up getting to and all of it seemed to be possible, but my teacher stated that it wasn't possible. Thank you so much for all the help though, at least I was on the right track with what I did afterwards

I'm sorry, but the two items "P(B) = 0.24 and P(B|A) = 0.32" ARE possible, so either your teacher is wrong or you have left out some conditions.
 
  • #13
Ray Vickson said:
the two items "P(B) = 0.24 and P(B|A) = 0.32" ARE possible
Exactly. DNA testing, drug testing, and all other kinds of statistical inferencing rely on this.
 
  • #14
Ray Vickson said:
I'm sorry, but the two items "P(B) = 0.24 and P(B|A) = 0.32" ARE possible, so either your teacher is wrong or you have left out some conditions.

Simon Bridge said:
... and the argument goes like this:

P(B|A) = P(AnB)/P(A) ≤ P(B)/P(A)... since P(AnB) ≤ P(B)

Hence: P(A) ≤ P(B)/P(B|A) eg. P(A) ≤ 0.75

Since we know that P(B)=0.24, then P(A) + P(B) ≤ 0.99 < 1

Which contradicts the requirement that P(A) + P(B) ≥ 1
Hence: impossible.

Showed the above to my teacher and he verified how that was the correct way to show that it was impossible. I double and triple checked before I even posted to make sure I didn't forget anything and I did not, and I'm almost positive the question came off a previous AP exam and I doubt that whoever issues the tests would give a question that doesn't go along with the instructions given.
 
  • #15
If this is supposed to be a generic statement, it's wrong. Flat out wrong.
 
  • #16
eric.l said:
Showed the above to my teacher and he verified how that was the correct way to show that it was impossible. I double and triple checked before I even posted to make sure I didn't forget anything and I did not, and I'm almost positive the question came off a previous AP exam and I doubt that whoever issues the tests would give a question that doesn't go along with the instructions given.

From the problem description, as you have written it, there is no reason why you need P(A) + P(B) ≥ 1. The problem talks about two events A and B and says nothing about "categories" or anything like that.
 
  • #17
If A and B are the only two events, then P(A)+P(B)≥1 ... this condition may not have been given explicitly. There may have been something in the original wording or the context of the lesson (perhaps the class has not considered the situation where there is a superset or a third event possible yet?)

The way the numbers pan out suggests this is the case. However - I'd be interested to see what the teacher in question has to say. If this was me, I'd present both Dick's example and the above for marking and maybe finesse a bonus mark ;)
 
  • #18
Simon Bridge said:
If A and B are the only two events, then P(A)+P(B)≥1 ... this condition may not have been given explicitly.
If A and B are the only two events then A is the complement of B. In that case, P(A)+P(B) is identically equal to one (not ≥1). Saying P(B|A) cannot be 0.32 in this case is a bit silly because P(B|A) is identically zero. The events are mutually exclusive.

If A is not the complement of B, there are (at least) four events: B, ~B, A, and ~A. Your P(A)+P(B)≥1 is not necessarily true.

An example: Suppose there is some boolean property P that pertains to each member of the population, and suppose there is a test Q for that condition. Set B comprises those members of the population for which property P is true. Set A comprises those members of the population who for which test Q yields a positive result. Note that P(B|A) is identically one if test Q has no false positives. (This does not mean that A=B. For example, the test can have false negatives.)

Suppose the test is not perfect; it yields false positives and false negatives. Suppose there are much more expensive tests than collectively are perfect, and suppose we subject one thousand members of the population to these definitive tests and to test Q, yielding the following confusion matrix:
Code:
      |  A    ~A  total
------+----------------
  B   | 220   20    240
 ~B   |   2  758    760
total | 222  778   1000

In this example, P(B) is 0.24 while P(B|A) is over 0.99. The test does a very good job of positively identifying members with the condition P, but it does so at the expense of a significant number of false negatives. A cheap screening test should have a low false negative rate. Suppose the confusion matrix is instead:
Code:
      |  A    ~A  total
------+----------------
  B   | 239    1    240
 ~B   |  26  734    760
total | 265  735   1000

This has a low number of false negatives (less than 0.5%) but at the expense of more significantly more false positives (almost 10%). Even with those false positives, P(B|A) is still over 90%.

Note that in these two examples, P(A)+P(B) is about 1/2.
 
Last edited:
  • #19
If A is not the complement of B, there are (at least) four events: B, ~B, A, and ~A. Your P(A)+P(B)≥1 is not necessarily true.
But you know what I'm trying to say?

I am imagining a ven diagram with two circles that overlap. There is nothing outside them.

This is a typical HS beginning example used when students are starting out... so I figure this is implicit the the class in question.
Supposing this and the question makes sense.

How would you put it?

I do understand and follow your example:
Suppose there is some boolean property P that pertains to each member of the population, and suppose there is a test Q for that condition. Set B comprises those members of the population for which property P is true. Set A comprises those members of the population who for which test Q yields a positive result.
But it has an implied superset which is the population ... there is a non-zero probability that a member of the populace is not in A and not in B isn't there? That means that A and B are not all there is.
 
  • #20
Simon Bridge said:
But you know what I'm trying to say?

I am imagining a ven diagram with two circles that overlap. There is nothing outside them.

This is a typical HS beginning example used when students are starting out... so I figure this is implicit the the class in question.
Supposing this and the question makes sense.

How would you put it?

I do understand and follow your example:But it has an implied superset which is the population ... there is a non-zero probability that a member of the populace is not in A and not in B isn't there? That means that A and B are not all there is.

I guess the implication is that we were supposed to know that ##A \cup B = \Omega## (the whole sample space). That does mean that A and B are not the only sets: there are also A-B,B-A and AB (the intersection). However, I suppose one can guess what is meant, and in that case the result is, indeed, true. If we let P(AB)=x, we have P(B) = x + P(B-A) = x + 1-P(A), so 0.24 = x + 1-P(A) ---> P(A) = x + 0.76. Thus, P(B|A) = x/(x + 0.76) = 0.32. so x = 152/425 = 0.3576. and that would make P(A) = 76/100+x = 19/17 > 1.
 

Related to Probability Events in Statistics

1. What is probability in statistics?

In statistics, probability is a measure of the likelihood of an event occurring. It is expressed as a number between 0 and 1, where 0 represents impossibility and 1 represents certainty.

2. How is probability calculated?

Probability is calculated by dividing the number of favorable outcomes by the total number of possible outcomes. This is known as the probability formula: P(A) = # of favorable outcomes / # of possible outcomes.

3. What is the difference between theoretical and experimental probability?

Theoretical probability is based on mathematical calculations and assumes that all outcomes are equally likely. Experimental probability, on the other hand, is based on actual trials or experiments and takes into account any variations or biases.

4. How does probability relate to statistics?

Probability is an essential concept in statistics because it helps us understand the likelihood of certain outcomes and make predictions based on data. It is used to analyze and interpret data, and to test hypotheses.

5. Can probability be used to predict the future?

While probability can be used to make predictions, it cannot guarantee the outcome of future events. Probability is based on past data and assumes that the same conditions will apply in the future, but it cannot account for unforeseen circumstances or random chance.

Similar threads

  • Calculus and Beyond Homework Help
Replies
4
Views
885
  • Calculus and Beyond Homework Help
Replies
4
Views
865
  • Calculus and Beyond Homework Help
Replies
4
Views
1K
  • Calculus and Beyond Homework Help
Replies
2
Views
1K
  • Calculus and Beyond Homework Help
Replies
3
Views
681
  • Calculus and Beyond Homework Help
Replies
1
Views
1K
  • Calculus and Beyond Homework Help
Replies
12
Views
1K
  • Calculus and Beyond Homework Help
Replies
7
Views
1K
  • Calculus and Beyond Homework Help
Replies
2
Views
2K
  • Calculus and Beyond Homework Help
Replies
9
Views
2K
Back
Top