# Correlation to Winning question

#### vballer

##### New member
I have a bunch of data for volleyball and I am trying to figure out how correlated certain stats are to winning a game. Here is a small example of the data set.

 Pts Err Diff EFF% Win 21 10 11 .360 1 21 10 11 .350 1 18 11 7 .250 1 14 6 8 .280 1 19 10 9 .380 1 17 6 11 .300 1 12 9 3 .200 0 14 10 4 .100 0 11 8 3 .050 0

I am trying to determine which of these items are most correlated to winning (the last column). As you can see if it is a win the value is 1 and if not then it is a 0. In addition, to determining the most useful stats for determining a win, I would like to know what levels each of the first 4 columns should be in order to generate a win at a certain confidence level.

Any help with this is greatly appreciated.

Thanks
Jamie

#### CaptainBlack

##### Well-known member
I'm not sure that you strictly mean correlated, the best predictor for winning for this data is EFF% where EFF%>=0.25 predicts a win and EFF%<=0 predicts a loss and between 0.2 and 0.25 is a no-man's land.

Diff will also provide a perfect predictor for this data but the no-man's land is relatively wider.

Without knowing what the real question is there is little more that it is worth saying.

CB

#### vballer

##### New member
Let me try this again with the actual data set attached.

Most importantly I am not looking for the actual answer but more how to derive the answer. I guess when you say predictor how did you determine this?

To recap, I have 7 columns of stats and the 8th column is 1 if the game was won and 0 if it was lost. So how do I determine what the best predictors of a win are?

Thanks

View attachment Stats.txt