Statistic: Pattern with context dilemma

  • Thread starter pyfgcr
  • Start date
  • Tags
    Statistic
In summary, the infinite-length binary text create from these patterns, with probability:"0": 0.8"11": 0.1"100": 0.05"101": 0.05
  • #1
pyfgcr
22
0

Homework Statement


An infinite-length binary text create from these patterns, with probability:
  • "0": 0.8
  • "11": 0.1
  • "100": 0.05
  • "101": 0.05
( For example: 0111001001010 ("0" "11" "100" "100" "101" "0") is a valid excerpt from the text)
I temporary call a,b,c,d for P("0"), P("11"), P("100"), P("101")
Problem: create a probability table for length-3 patterns (that is, find P(x): x="000","001","010",...,"111", x is excerpted at any point of the text)

Homework Equations


The Attempt at a Solution


Some of the patterns can be determined easily: 000, 011, 100, 101, 110 with

P(000)=a3=0.512
P(011)=P(110)=a*b=0.08

However, there are no evidence about 001,010,111. So I think of context, e.g., 001 is create with 2 "0" and anything start with 1 ("11","110","101") or 2 first zero from "100" and final 1 from anything start with 1, hence:

P(001)=a2*(b+c+d) + c*(b+c+d)=0.138

With this logic, we have:

P(010)=a*d+c*d+d*(b+c+d)=0.0525
P(111)=b*(b+c+d)+(b+d)*b=0.035

However, that logic must be applied for 000,011,100,101,110 as well:

P(000)=a3+c*a=0.552
P(011)=a*b+c*b+d*(b+c+d)=0.095
P(100)=c+(b+d)*a2=0.146
P(101)=d+(b+d)*a*(b+c+d)=0.11
P(110)=b*a=0.08

And after all, the sum of (1≤x≤8),P(x)=1.2085
I know that there is something wrong with taking the context to account, but I don't really know any other ways than considering the context.
 
Last edited:
Physics news on Phys.org
  • #2
For any bit in the sequence, there are 9 possibilities:
- the '0' in a '0' pattern
- one of the two bits in 11 pattern
- one the 3 in a 100 pattern
- one the 3 in a 101 pattern
Can you calculate the relative frequencies of these 9 cases?
Having done that, suppose you pick a bit as the first of a triplet. For each of the 9 possibilities, can you calculate the probabilities that the next two bits are 00, 01, 10, 11?
 
  • #3
Oh, now I understand my mistake: I don't take relative frequency in account. I calculate again and now the sum is 1. Thank you very much.
 
  • #4
pyfgcr said:
  • "0": 0.8
  • "11": 0.1
  • "100": 0.05
  • "101": 0.05

Some of the patterns can be determined easily: 000, 011, 100, 101, 110 with

P(000)=a3=0.512
P(011)=P(110)=a*b=0.08

This isn't right unless I have misunderstood.

000 could be generated by ("0" "0" "0") or ("100" "0")
011 could be generated by ("0" "11") or ("100" "11") or ("101" "100") or ("101" "11") or ...
 
  • #5
Yes, this was my mistake before, but I have calculated again with relative frequency by haruspex suggested and now the problem has been solve. The new result is 0.536, 0.134, 0.095, 0.085, 89/1500, 41/1500, 7/150, 1/60 of 000, 001, 010, 011, 100, 101, 110, 111 respectively
 

Related to Statistic: Pattern with context dilemma

1. What is the "pattern with context dilemma" in statistics?

The "pattern with context dilemma" is a common issue in statistics where there appears to be a significant relationship or pattern between two variables, but this relationship disappears when additional variables are taken into account. This can lead to incorrect conclusions and misinterpretation of data.

2. How does the "pattern with context dilemma" affect statistical analysis?

The "pattern with context dilemma" can significantly impact statistical analysis by misleading researchers into believing there is a strong relationship between variables when there isn't. This can result in faulty conclusions and incorrect decisions based on the data.

3. What can be done to address the "pattern with context dilemma" in statistical analysis?

To address the "pattern with context dilemma," it is important to carefully consider all relevant variables and their potential impact on the relationship between variables. This may involve conducting further research or using more advanced statistical techniques to control for confounding variables.

4. How can researchers avoid falling into the "pattern with context dilemma" trap?

Researchers can avoid the "pattern with context dilemma" by thoroughly understanding the context of their data and being aware of any potential confounding variables. It is also essential to use appropriate statistical methods and to interpret results carefully, taking into account all relevant factors.

5. Can the "pattern with context dilemma" be completely eliminated in statistical analysis?

While it may not be possible to completely eliminate the "pattern with context dilemma," researchers can minimize its impact by being diligent and thorough in their data analysis. Additionally, using advanced statistical techniques and considering all relevant variables can help to reduce the potential for this dilemma to occur.

Similar threads

  • Calculus and Beyond Homework Help
Replies
12
Views
1K
  • Precalculus Mathematics Homework Help
Replies
7
Views
862
  • Quantum Physics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • Engineering and Comp Sci Homework Help
Replies
1
Views
1K
  • Math Proof Training and Practice
2
Replies
61
Views
6K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
851
  • Engineering and Comp Sci Homework Help
Replies
24
Views
6K
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
3K
  • Calculus and Beyond Homework Help
Replies
2
Views
2K
Back
Top