What is Statistics: Definition and 998 Discussions

Statistics is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industrial, or social problem, it is conventional to begin with a statistical population or a statistical model to be studied. Populations can be diverse groups of people or objects such as "all people living in a country" or "every atom composing a crystal". Statistics deals with every aspect of data, including the planning of data collection in terms of the design of surveys and experiments.When census data cannot be collected, statisticians collect data by developing specific experiment designs and survey samples. Representative sampling assures that inferences and conclusions can reasonably extend from the sample to the population as a whole. An experimental study involves taking measurements of the system under study, manipulating the system, and then taking additional measurements using the same procedure to determine if the manipulation has modified the values of the measurements. In contrast, an observational study does not involve experimental manipulation.
Two main statistical methods are used in data analysis: descriptive statistics, which summarize data from a sample using indexes such as the mean or standard deviation, and inferential statistics, which draw conclusions from data that are subject to random variation (e.g., observational errors, sampling variation). Descriptive statistics are most often concerned with two sets of properties of a distribution (sample or population): central tendency (or location) seeks to characterize the distribution's central or typical value, while dispersion (or variability) characterizes the extent to which members of the distribution depart from its center and each other. Inferences on mathematical statistics are made under the framework of probability theory, which deals with the analysis of random phenomena.
A standard statistical procedure involves the collection of data leading to test of the relationship between two statistical data sets, or a data set and synthetic data drawn from an idealized model. A hypothesis is proposed for the statistical relationship between the two data sets, and this is compared as an alternative to an idealized null hypothesis of no relationship between two data sets. Rejecting or disproving the null hypothesis is done using statistical tests that quantify the sense in which the null can be proven false, given the data that are used in the test. Working from a null hypothesis, two basic forms of error are recognized: Type I errors (null hypothesis is falsely rejected giving a "false positive") and Type II errors (null hypothesis fails to be rejected and an actual relationship between populations is missed giving a "false negative"). Multiple problems have come to be associated with this framework, ranging from obtaining a sufficient sample size to specifying an adequate null hypothesis. Measurement processes that generate statistical data are also subject to error. Many of these errors are classified as random (noise) or systematic (bias), but other types of errors (e.g., blunder, such as when an analyst reports incorrect units) can also occur. The presence of missing data or censoring may result in biased estimates and specific techniques have been developed to address these problems.

View More On Wikipedia.org
  1. T

    Statistics: geometric distribution proof problem

    Statistics: geometric distribution "proof" problem Homework Statement If Y has a geometric distribution with success probability p, show that: P(Y = an odd integer) = \frac{p}{1-q^{2}} Homework Equations p(y)=p(q)^{2} The Attempt at a Solution p(1)=pq^0 p(3)=pq^2 p(5)=pq^4...
  2. X

    Statistics question (poisson distribution, multivariate )

    Homework Statement On average, 2.5 telephone calls per minute are received at a corporation's switchboard. Making appropriate assumptions about the distribution ( provide justification ), find the probability that at any given minute there will be more than 2 calls. Homework Equations No...
  3. L

    Probability of Peter Winning by 6 games to 3 or 6 games to 4 in a Tennis Match

    Peter and Alex plays tennis. Peter serves through out the first game, Alex serves throughout the second game. When Peter serves, the probability that he wins is 0.8. When Alex serves first the probability that Peter wins is 0.4. A game cannot be drawn. After 6 games Peter and Alex both have won...
  4. B

    Need help understanding how to interpret a statistics problem

    Homework Statement Let c be a constant, and define Y_{i}=1 if X_{i}\leq c and zero otherwise. Find a statistic that is a function of Y_{1},Y_{2},...Y_{n} and also unbiased for F_{X}(c)=\Phi\left(\dfrac{c-\mu}{\sigma}\right). Homework Equations The Attempt at a Solution not even...
  5. J

    Statistics - arrangement in a circle

    Homework Statement Delegates from the G8 are to be seated around a circular table. How many different seating arrangements are possible if the French and Canadian delegates are to be seated next to each other, but the Russian and Japanese are not to be next to each other? Homework...
  6. M

    Performing Statistics on Neuro-electrophysiology Data

    Hi all, I am working with long term potentiation (LTP) induction in hippocampal brain slices. I am wondering about how exactly to statistically analyze the data I am presented with. It looks like http://hmg.oxfordjournals.org/content/19/4/634/F5.small.gif Where each line of points...
  7. J

    Statistics - Within-subjects for categorical data

    Hello, I am struggling to find a suitable statistic test for one of my studies: I have 4 different conditions, and I want to test if my results differ on those conditions. When the dependent variable is continuous it is easy to find a statistical test; however when it is a categorical variable...
  8. T

    Statistics question (combinations/permutations)

    Homework Statement The eating club is hosting a make your own sundae at which the following are provided: Ice cream flavors: chocolate, cookies-n-cream, strawberry, vanilla Toppings: caramel, hot fudge, marshmellow, m & m's, nuts, strawberries a) How many sundaes are possible using...
  9. F

    Mastering Statistics: Calculating Fill Volumes and Probability Distributions

    To be honest I have no real idea how to do this. I know of the probability density function, and the cumulative probability function, but using them is another difficulty I'm trying to overcome. My questions are, Intravenous fluid bags are filled by an automated filling machine. Assume that...
  10. N

    Statistics - Hypothesis testing question

    Hello folks. This was in my exam and I wasn't sure how to solve it. Homework Statement A sample of 67 respondents was taken collectively by a population of 6,456 units. The number of those who said they were available to implement a development program were 35. To implement the program...
  11. X

    Transforming the data in order to create a linear relationship between x and y.

    Homework Statement A Fisher presents the following data, which give the logarithm to the base 10 of the volume occupied by algal cells on successive days, taken over a period for which the relative growth rate is approximately constant. Day 1 = 3.592, Day 2 = 3.823, Day 3 = 4.174, Day 4 =...
  12. M

    Statistics and Tchebysheff's theorum

    Homework Statement Let k\geq1. Show that, for any set of n measurements, the fraction included in the interval \overline{y}-ks to \overline{y}+ks is at least (1-1/k2). [Hint: s2 = 1/(n-1)[\sum(yi-\overline{y})2]. In this expression, replace all deviations for which the absolute value of...
  13. B

    Quantum statistics expectation value

    1. What is the expectation value, <x>, for the given distribution over the interval from – to + infinity of the function: f(x)=e^(-.5(x-mu)^2(sigma^-2)) 2. This is a statistics problem i think. I just need to know how this type of problem is worked out because it is relevant to my...
  14. pairofstrings

    What is the meaning of percentile in statistics?

    I want to know what is the meaning of percentile. what is the percentile : 23.79 means? Please give me explanation and also examples. Thanks!
  15. N

    How to quantize the ''mathematical'' fluctuation field in statistics?

    Please teach me this: The general effort is to quantize the fields of elementary particles and gravitons.But I wonder about ''mathematical'' fields such as the fluctuation fields in statistical physics.I think there may be many ''continuous'' fields in physics.Could the functional integral...
  16. 9

    Thermal Statistics - Microstate Probabilities

    Homework Statement For a system in equilibrium at temperature T, the probability of finding it in a microstate m is: P(m) = (1/Z)exp(-E/kT) where Z is the partition function. There are three accessible microstates, two with energy E_{a} and one with energy E_{b}. Two identical and...
  17. P

    Mathematical Statistics good for Computer Scientists?

    I am finished with the first bachelor year at Computer Science and the next courses are: Assembly, Linear Algebra, Statistics, Java. Many people from my class said that they will not take Statistics as it's not that helpful to them and I was thinking of how accurate that is. Is Statistics...
  18. I

    Programs Difficulty of a statistics major?

    Could anyone give an insight as to how difficult a statistics major is? My college doesn't offer any probability classes until the junior year, so I'd have no way of knowing what it's like until then. My fear is that I'll wind up taking all these prereqs and end up hating or not understanding...
  19. S

    What is the Probability and Risk Assessment for Motor Insurance Claims?

    Homework Statement (a) A survey on 50 claims for a particular class of customers of a motor insurance company found out that the average cost for car damage is £700 with a standard deviation of 400. The insurance manager believes instead that the average cost is £80 bigger. (i) Does the...
  20. B

    Statistics and integration problem

    Statistics and integration problem (urgent) can anyone help me with (6)?
  21. QuarkCharmer

    Gaussian Function in Statistics

    Homework Statement Not really a homework problem, but something that I was curious about. I was thinking about how a calculator finds the area under the standard normal distribution, and I started to assume that it most likely has the antiderivative for the function that makes the standard...
  22. G

    Studying Is studying a little bit of statistics worth it?

    I'm going to be a senior in high school next year, and I'm thinking about self-studying AP Statistics. I want to be some sort of mathematician or physicist, and I am thinking that having the equivalent of a first-year course in college statistics will significantly broaden my career choices and...
  23. K

    Programs Major in mathematics of statistics

    Hello, this is my first post in physics forum and I need some help in my future major selection. I plan to go into research in biology, and would like to go into computational biology in grad/med school (MD/PhD programs). I plan to double major in biochem and either math/statistics, and I will...
  24. B

    Beginners Guide to Probability & Statistics: Examples & Applications

    What book or course would you recommend to a beginner in Probability and Statistics? That is my question. However I would like to specify my intentions. Along my math courses I have observed (many people have) that mathematicians have a different way of thinking this common language...
  25. C

    Null hypothesis, alternative hypothesis, statistics

    Homework Statement Assume that the cholesterol level of college aged men nationwide is normally distributed with mean of 180 and standard deviation is 20. Twelve male MVCC students were randomly selected; the cholesterol level of each was determined: 250, 220, 145, 225, 170, 210, 200, 210...
  26. C

    Null Hypothesis/Alternative hypothesis, statistics help

    Null Hypothesis/Alternative hypothesis, statistics help! Homework Statement For each of the following questions, state(using symbols) the null and alternative hypotheses. a.)Does the average speed of vehicles on the Thruway exceed 65? b.) Is the mean height of 6th graders different...
  27. C

    Statistics, standard deviation help

    Homework Statement Suppose that the systolic blood pressure of 20 year old females is normally distributed with mean 120 and standard deviation 15. a.) What proportion of 20 year old females will have readings between 115 and 125? b.) If you selected a sample of 64 people from this...
  28. C

    Statistics help, Confidence interval

    Homework Statement You look at a random sample of 64 vehicles in one school parking lot and find that 24 are trucks. Determine the 80% confidence interval for the true population proportion of trucks in all the school lots. Homework Equations (work shown below) The Attempt at a...
  29. C

    Statistics, sample size determination

    Homework Statement Glucose level is normally distributed, standard deviation is known to be 25 units; determine the sample size needed to be 95% confident that the sample mean and the true population mean differ by at most 4 units. Homework Equations (work shown below) The Attempt...
  30. C

    What is the probability of a male living more than 86 years in the U.S.?

    Homework Statement Assume the life expectancy of U.S. males is normally distributed with a mean of 80 years and a standard deviation of 5 years. What is the probability that a randomly selected male lives more than 86 years? Homework Equations (work shown below) The Attempt at a...
  31. C

    Population mean help, Statistics

    Homework Statement Assume that the population is normal with some unknown mean and standard deviation of 25. A sample size 400 is taken. The resultant sample mean is 67. a.) What is your best estimate for the value of the population mean? b.) What is the value of the standard error...
  32. C

    Probability Interval, statistics help

    Probability Interval, statistics help! Homework Statement Assume that the population is normal with mean of 90 and standard deviation of 25. A sample of size 100 will be selected. a.) What is your best estimate for the value of the sample mean? b.) What is the value of the standard error...
  33. C

    Statistics, probability, basketball foul shots

    Homework Statement Records show that 72% of basketball foul shots attempted are successfully made during practice. Today, each of the 10 members of the team gets a practice shot. What is the probability that at least 8 team members make the shot successfully? Homework Equations...
  34. C

    What is the Probability of Owning Either a Microwave or CD Player?

    Homework Statement The probability that Joe owns a microwave oven is 75%; the probability that he owns a CD player is 45%; and the probability that he owns both is 40%. Find the probability that he owns either one but not both. Homework Equations (work posted below) The Attempt at...
  35. C

    Statistics, ping pong balls in bag probability question

    Homework Statement Twelve colored ping-pong balls are placed into a shopping bag and well mixed. There are two red balls, six blue balls and four green balls. One ball is selected at random, its color noted and then it is set aside. A second ball is then randomly selected and its color...
  36. C

    Statistics, Probability, OR equation

    Homework Statement An appliance store purchases electric ranges from two companies. From company A, 600 ranges are purchased, and 5% are found to be defective. From company B, 300 ranges are purchased, and 8% are found to be defective. Given that a range is defective, find the probability...
  37. C

    How Do We Calculate Expected Defects in Quality Control Samples?

    Homework Statement The number of defects in a sample of 20 parts is recorded for quality control purposes; over the last year the number of defects and their occurrence rate has been : 0 defects, 82%; 1 defect, 13%; 2 defects, 4%; 3 defects, 1%. Find the expected number of defects in the...
  38. C

    How Do You Calculate the Probability of Both Events Occurring?

    Homework Statement The probability of Mr. Jones being selected for jury duty is 70%. The probability that Mr. Smith will be selected is 40%. The probability that neither will be selected is 10%. What is the probability that both are selected? Homework Equations the "and" rule The...
  39. C

    How Do You Calculate the Probability of a Heart Attack in a High-Risk Group?

    Homework Statement In a certain high-risk group, the chance of a person having suffered a heart attack is 70%. If four persons are chosen from the group, find the probability that at least one will have had a heart attack. Homework Equations (work shown below) these would be dependent...
  40. K

    Statistics: paried observations

    Homework Statement To test the efficacy of a new cholesterol-lowering medication, 10 people are selected at random. Each has their LDL levels measured (shown below as Before), then take the medicine for 10 weeks, and then has their LDL levels measured again (After). Before After...
  41. C

    Calculating the Number of Ways to Answer True-False Exam Questions

    T/F, Please Help! Elementary Statistics! Homework Statement 5-If a true-false exam contains 10 questions, there are 20 different ways to answer all the questions. Homework Equations There are none for it is a true false question. The Attempt at a Solution I believe this is...
  42. C

    Statistics help Subjective Probability

    Homework Statement (T/F) Subjective probability based on intuition and experience has little use in the real world. Homework Equations (no equations needed for it is T/F) The Attempt at a Solution I believe this is False because subject probability is probability derived from...
  43. R

    Directional statistics - Entropy of wrapped normal (Jacobi theta) distribution

    If p(x) is a probability distribution on the real number line, the "wrapped" distribution around the unit circle is: p_w(\theta)=\sum_{n=-\infty}^\infty p(\theta+2\pi n) which is periodic with period 2π. The (differential) entropy is: H=-\int_\Gamma p_w(\theta)\ln[p_w(\theta)]\,d\theta where...
  44. R

    Calculate the standard deviance (statistics)

    (solved) Calculate the standard deviation (statistics) Homework Statement From this table of values, I'm supposed to calculate the standard deviance Homework Equations Up until this point, I've had no problem calculating the standard deviance with my calculator by going to STAT>...
  45. Borek

    Matura in Poland - exam results - skewed statistics

    This is ridiculous. I couldn't decide where to post it, as it is a little bit about politics and a little bit about stastics, so there are reasons to post in other forums as well. It can be even a brain teaser (see the question at the end of the post). For those of you who don't know -...
  46. T

    Need statistics help working with normal distribution

    Hello experts, Thanks to discussions with Stephen Tashi for getting me this far. See the problem statement in the attached PDF page 1. I need help solving for Qc in equation form, as a function of the other variables (N and C), preferably using erfc so I can program an accurate algorithm...
  47. K

    Statistics: normal distribution

    Homework Statement Suppose that X is normally distributed with mean 95 and standard deviation 17. A. What is the probability that is greater than 126.79? B. What value of X does only the top 16% exceed? Homework Equations z= (X-(mean of x))/standard deviation The...
  48. C

    Statistics, multiple choice, regression equation - 2nd question

    Statistics, multiple choice, regression equation -- 2nd question Homework Statement A regression equation was developed to predict gasoline mileage (mpg) for various car weights (pounds). The resultant equation was: Y = 35.2 - .0034 X. Which two answers can be concluded? A. The...
  49. C

    Statistics, multiple choice, correlation coefficient

    Homework Statement The correlation coefficient for each of two data sets is calculated. Set A has r = -0.75 while Set B has r = +0.55. Which set (A or B) has the stronger strength of relationship? Homework Equations I think -0.75? The Attempt at a Solution I think Set A r =...
  50. C

    Statistics help - Scatter plot, regression

    Statistics help -- Scatter plot, regression Homework Statement Consider the following data set for ten first grade students; the variables are the number of minutes spent learning a list of spelling words and the number wrong on the spelling test. [15 points total] (supposed to be a...
Back
Top