What is Statistics: Definition and 998 Discussions

Statistics is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industrial, or social problem, it is conventional to begin with a statistical population or a statistical model to be studied. Populations can be diverse groups of people or objects such as "all people living in a country" or "every atom composing a crystal". Statistics deals with every aspect of data, including the planning of data collection in terms of the design of surveys and experiments.When census data cannot be collected, statisticians collect data by developing specific experiment designs and survey samples. Representative sampling assures that inferences and conclusions can reasonably extend from the sample to the population as a whole. An experimental study involves taking measurements of the system under study, manipulating the system, and then taking additional measurements using the same procedure to determine if the manipulation has modified the values of the measurements. In contrast, an observational study does not involve experimental manipulation.
Two main statistical methods are used in data analysis: descriptive statistics, which summarize data from a sample using indexes such as the mean or standard deviation, and inferential statistics, which draw conclusions from data that are subject to random variation (e.g., observational errors, sampling variation). Descriptive statistics are most often concerned with two sets of properties of a distribution (sample or population): central tendency (or location) seeks to characterize the distribution's central or typical value, while dispersion (or variability) characterizes the extent to which members of the distribution depart from its center and each other. Inferences on mathematical statistics are made under the framework of probability theory, which deals with the analysis of random phenomena.
A standard statistical procedure involves the collection of data leading to test of the relationship between two statistical data sets, or a data set and synthetic data drawn from an idealized model. A hypothesis is proposed for the statistical relationship between the two data sets, and this is compared as an alternative to an idealized null hypothesis of no relationship between two data sets. Rejecting or disproving the null hypothesis is done using statistical tests that quantify the sense in which the null can be proven false, given the data that are used in the test. Working from a null hypothesis, two basic forms of error are recognized: Type I errors (null hypothesis is falsely rejected giving a "false positive") and Type II errors (null hypothesis fails to be rejected and an actual relationship between populations is missed giving a "false negative"). Multiple problems have come to be associated with this framework, ranging from obtaining a sufficient sample size to specifying an adequate null hypothesis. Measurement processes that generate statistical data are also subject to error. Many of these errors are classified as random (noise) or systematic (bias), but other types of errors (e.g., blunder, such as when an analyst reports incorrect units) can also occur. The presence of missing data or censoring may result in biased estimates and specific techniques have been developed to address these problems.

View More On Wikipedia.org
  1. A

    Statistics - Distribution Function Technique

    Homework Statement (From Probability and Statistical Inference, Hogg and Tanis, Eighth Edition, 5.1-5) The p.d.f. of X is f(x) = \theta x^{\theta - 1} for 0<x<1 and 0<\theta<\infty. Let Y = -2\theta \ln X. How is Y distributed? Homework Equations Um... Fundamental Theorem of...
  2. N

    Intro Statistics combination/probability problems.

    Hello. Am I counting right with these problems? I can't remember exactly how they were worded, but I remember what they were asking... 1) Given 20 men and 20 women, how many groups of 12 can you make with 6 men and 6 women. I said 20.Combination.6 * 20.Combination.6. 2) given S= {1, 2...
  3. S

    Simulation, distribution, statistics.

    I´m wondering how to find out something about distribution functions after I´ve calculated the scewness. I have 15 numbers, mean is 3.206, highest is 7.028 and lowest is 0.9209. Variance is 2.958195. I calculate the skewness and got 0.945. I´m wondering which of the following distributions is...
  4. X

    Calculating Least Squares: What is r and how do I use it?

    How do I calculate least squares? The way it's described is confusing to me. \hat{β}= r* sy/sx . sx and sy is the standard deviation of x and y, correct ? So let's say I have sets of numbers such as (16,19),(18,17) etc... 16 and 18 would be x and 19 and 17 would be y. So I have to calculate the...
  5. B

    Software to extract statistics (per month, per year) from a time series

    Hi. I work with rain (precipitation) time series, and would like to extract time statistics: - precipitations per month - precipitations per year - precipitations per month and year - precipitations per hour - precipitations per month and hour - mean precipitation, deviation, ... I...
  6. C

    How Do You Calculate the Range for Airline Bag Weights with 95% Confidence?

    Homework Statement Suppose that the weights of airline passenger bags are normally distributed with a mean of 48.14 pounds and a standard deviation of 3.71 pounds. Let X represent the weight of a randomly selected bag. For what value of c is P(E(X) - c < X < E(X) + c)=0.95? Give your answer...
  7. I

    CS, Math, Statistics where to go from here?

    I'm about to finish my lower division math requirements, which will cover everything for all my lower division coursework for all three subjects, bar 3 programing coures, one that can be taken next quarter, and the other 2 which can be taken concurrently in the beginning of next quarter. If I do...
  8. G

    Proving Sum of 2 Indep. Cauchy RVs is Cauchy

    Given the fact that X and Y are independent Cauchy random variables, I want to show that Z = X+Y is also a Cauchy random variable. I am given that X and Y are independent and identically distributed (both Cauchy), with density function f(x) = 1/(∏(1+x2)) . I also use the fact the...
  9. G

    Expected Value and Standard Deviation of A1 Computer's Rebate

    A1 Computer has 15 tablets stocked, but four of them were actually defectives. A client bought two tablets from A1 Computer. If both of them are good, things are fi ne. If the client gets one defective machine, A1 Computer will replace it and give a $100 rebate to the client. If the clients...
  10. G

    Probability Statistics Question

    In 2007, 52% of all immigrants to Canada were females, 25% were under 18 years old, and 12% were females under 18 years old. a. Find the probability that a randomly selected person who immigrated to Canada in 2007 was a female and over 18 years old. b. Find the probability that a randomly...
  11. B

    Statistics: Question about mild and extreme outliers.

    Statistics: Question about "mild" and "extreme" outliers. I am studying statistics, and have noticed the definitions of the mild and extreme outliers. Mild outlier: Between 1.5q and 3q away from the nearest quartile, where q denotes the interquartile range. Extreme outlier: More than 3q...
  12. M

    What Is the Help Variable in Swedish Regression Analysis?

    Homework Statement Ive two questions not directly relating to a problem but just stat in general. i) In Swedish what is called the "help variable" for a regression equation: y=u'*B where u' is the vector of explaining variables (x1,...,xn) can apparently be written in many ways.. One...
  13. K

    [Statistics] Conditional Probability questions?

    Homework Statement I've attached both the problems into one image to make life easier since problem 1 has a diagram and the other does not. Homework Equations Bayes Theorem : P(A|B)P(B) / [P(A|B)P(B) + P(A|B')P(B')] B' = B Complement The Attempt at a Solution well for the first one i don't...
  14. N

    Intro Statistics/ Probability help?

    Homework Statement 1) A computer firm presently has bids out on three projects. Let Ai = {awarded project i} for i = 1,2,3. Suppose that P(A1) = 0.20, P(A2) = 0.25, P(A3) = 0.28, P(A1∩A2) = 0.10, P(A1∩A3) = 0.06, P(A2∩A3) = 0.08, and P(A1∩A2∩A3) = 0.01. Compute the following...
  15. T

    Statistics basic question: Probability distrbution

    Homework Statement A random variable X has three, and only three, possible values, 0; 2; 4 with the following probability distribution: Probability: Getting a 0 is 0:7 Getting a 2 is 0:1 Getting a 4 is 0:2 Let X1 and X2 be two independent random variables with this distribution. (a)...
  16. M

    News Is Karl Denninger's Correct Adjustment Method for Analyzing BLS Data Valid?

    BLS has released data on unemployment for the month of January. http://www.bls.gov/news.release/pdf/empsit.pdf What I want to ask about is a blog from Karl Denninger on market-ticker. http://market-ticker.org/akcs-www?singlepost=2858099 He states that he used the "correct adjustment"...
  17. K

    Can the standard deviation calculation be generalized for other statistics?

    I've calculated the mean difference of my (normally distributed) data set. The mean difference is defined as: Now, I'm trying to calculate the "mean difference deviation" in order to generate a confidence interval for this quantity ( "95% of the differences in the set are greater than...
  18. M

    Statistics, how do you translate Bin(n,p) to say N(,) etc

    hey guys, some hopefully easy stat questions if you will Im wondering about translating one statistical distribution to another, like going from: Bin(n,p) to N(np,sqrt(npq)) where q=1-p or that Po(au) is roughly equal to N(xu,sqrt(xu)) Im mostly sitting scratching my head on which...
  19. D

    Programs Statistics of PhDs produced in each field?

    Does anyone have sources on how many PhDs of each field is being produced each year? i.e. how many physics PhDs graduate each other, how many in mathematics, etc.
  20. N

    [statistics] find the efficient estimator: where is my reasoning wrong?

    Homework Statement Find an efficient estimator for q(\lambda) := e^{-\lambda} in a Poisson model. (note: the efficient estimator is the one who reaches the Cramér-Rao lower bound) Homework Equations The Cramér-Rao lower bound: Let T be an unbiased estimator for q (as defined above), then...
  21. alemsalem

    What are the natural generalizations (if any) to Bose and Fermi statistics?

    What are the "natural" generalizations (if any) to Bose and Fermi statistics? fermions: 1 particle per state Bosons: unlimited number of particles per state do people consider things in between like states with a capacity n? are there other generalizations of these statistics? Thanks!
  22. J

    Basic Statistics question: which test to run?

    Hi everyone! Lately I have been trying to improve my typing speed, and have been playing a game called Type Race, where you type various short passages (the passages are selected at random from a text bank) and your score in WPM is recorded. What I want to do is determine whether or not my...
  23. J

    Statistics Help Bell shaped distribution

    Suppose that IQ scores have a bell shaped distribution with a mean of 104 and a standard deviation of 14. Using the empirical rule what percentage of IQ scores are less than 76? Please do not round your answer. So far this is what i have 76 -104= -28 -28 __ = -2 14 *How do I...
  24. D

    Integrals of Expeced Valute For Normal Order Statistics

    Integrals of Expeced Value For Normal Order Statistics 1. Find the expected value of the largest order statistic in a random sample of size 4 from the standard normal distribution. Homework Equations E(X(4,4))=4∫xf(x)(F(x))^3dx, (from minus infinite to plus infinite), where f(x) is the...
  25. I

    What is the difference between statistics and probability?

    Hello, I've just began my first probability class, and let me tell you, it's a doozy. It reminds me a lot of my physics classes: There's a general sort of way to go about solving a problem, but each problem is completely different from the other. If this was the entire major, I'd stop here...
  26. P

    Probability vs Statistics for CS

    So I have the option of either taking a year long sequence in probability theory or a year long sequence in mathematical statistics. Both require real analysis so both will be at the 'measure theoretic' level. I'm interested in these classes as they relate to computer science and specifically...
  27. J

    Extreme value theory and limiting distributions for i.i.d. order statistics

    (This question was previously posted to sci.math.research. I only received one reply; sadly the advice therein conflicted with section 9.1 of H.A. David's "Order Statistics" - and probably with the fact that there was such a field of study as "r-extreme order statistics" - hence my reposting it...
  28. G

    Statistics: Null hypothesis/chi-squared question.

    Homework Statement http://img687.imageshack.us/img687/7963/screenshot20120102at185.png Homework Equations The Attempt at a Solution I understand how to compute a Chi-squared test, but I'm a bit confused about the wording of parts a) and c). Could someone possibly simplify it for...
  29. J

    Would you say that these salary by degree statistics are accurate?

    Would you say that these "salary by degree" statistics are accurate? http://www.payscale.com/best-colleges/degrees.asp I know it's rude to ask people how much they make, but if you guys don't mind, could you at least tell me if these numbers are realistic, compared to what you've seen? This...
  30. kaniello

    Particle Statistics: Explaining Klimontovich's Formulas and Logic

    Hallo, I posted this in General Math, and I decided to post it here also because this room seems more appropriate. The formulas and part of the text are quoted from "Klimontovich - Statistical theory of non-equilibrium processes in a plasma": Let N_{a}(\textbf{x},t)...
  31. L

    Statistics Z Score (I think) Help?

    Homework Statement So my HW is 27. Assume the heights of high school basketball players are normally distributed. For boys the mean is 74 inches with a standard deviation of 4.5 inches, while girl players have a mean height of 70 inches and standard deviation 3 inches. At a mixed 2-on-2...
  32. X

    Great formulary for statistics?

    I'm looking for a great formulay for statistics? Maybe along with a quick explanation for each formula.. there are so many for different scenarios that I find it confusing. Like for : Conditional probability and independence, random variables and expectations, binomial and related...
  33. Z

    Evidence for fermion statistics among neutrinos

    Is there any evidence for quantum fermi-dirac distributions among neutrinos, besides the obvious fact about their spin? I was wondering how Pauli exclusion principle would work with a neutrino 'gas', and what kind of quantum numbers they could have. It has been expected that if we ever did...
  34. I

    Statistics: Multivariate Dist/Functions of RVs

    Homework Statement 1) A point (X1,X2,X3) are is chosen at random. X1-3 are uniform distributions across the interval [0,1] Determine P[(X1-.5)2 + X2-.5)2 + X3-.5)2 ≤ .25] 2) X and Y are random variables with the joint pdf: f(x,y) = 2(x + y) for 0≤x≤y≤1, 0 otherwise. Find the pdf of Z = X + Y...
  35. C

    Regression help (basic statistics)

    Homework Statement Hello, I have some data from a statewide standardized exam, and I am trying to do a regression model, but am having a bit of trouble. (Im a mathematician and not a statistician). Basically, I am trying to show some type of correlation between race and test scores...
  36. R

    Statistics of sinc function with normal argument

    Hi, Can anyone point me to the pdf, cdf, moments, etc of y = sinc(x) = sin(pi*x)/(pi*x) where x ~ N(0,s^2)? Thanks!
  37. P

    Statistics - Expected Value

    Hi, I have to work with Expected Values and I am extremely confused over the following: In the part of my book that teaches me about Probability Distribution, in order to calculate the Expected Value I have to: Lets say we toss a coin twice. We can get 0 Heads, 1 Heads or 2 Heads I then draw...
  38. L

    What does non-parametric (statistics) actually mean?

    I see this all the time, but I just want a simple explanation of what parametric, and nonparametric statistics means! Thanks so much.
  39. Vanadium 50

    News How Accurately Do Income and Wealth Statistics Reflect Reality?

    I've read a lot of posts, and think it might be helpful to point out some facts that I think would help clarify people's arguments. 1. Income is not wealth. Using one as a proxy for the other is like using velocity as a proxy for position. A small disparity in income, acting over time...
  40. C

    Statistics: Boxplot and confidence interval

    Homework Statement I'm doing a past paper for my statistics course, and there's no provided solutions. I want to know if my attempt is correct. This is part of the paper: Homework Equations N/A The Attempt at a Solution i) Q0 = 1.06, Q1 = 1.335, Q2 = 2.04, Q3 = 2.275, Q4 = 2.64 ii) IQR = Q3...
  41. K

    Statistics - 95% Confidence Level

    Homework Statement Two different types of injection–moulding machines are used to form plastic parts. A part is considered defective if it has excessive shrinkage or is discoloured. Two random samples, each of size 300, are selected and 15 defective parts are found in the sample from machine...
  42. X

    Statistics - bivariate density calculations

    Homework Statement Consider the bivariate density f(x,y)=c(x+y) for 0<=x<1, 0<=y<1 a) Obtain the appropriate normalization constant c. b) Obtain the marginal densities for X and Y, and calculate their means and variances. c) Obtain the covariance between X and Y, and check whether the...
  43. S

    Statistics: Proofs and Problems for Random Variables and their Distributions

    Homework Statement Before I get started here I have one really quick basic question: Lets say I want the probability that an survives two hours, and that the probability an engine will fail in any given hour is .02. Then I can get 1 - .02 - .98(.02) = .9604. This is found by a geometric...
  44. T

    Tchebysheff's Theorem questions (statistics)

    I have two different problems involving Tchebysheff's Theorem. Hopefully there isn't a rule about asking two different questions in one post. Number 1 Homework Statement The US mint produces dimes with an average diameter of .5 inch and a standard deviation of .01. Using Tchebysheff's...
  45. M

    Statistics: sample mean of normal distribution

    Homework Statement The diameter of a shaft in an optical storage drive is normally distributed N(μ,σ2). The drive specifies that the shaft be 0.2500 ± 0.0015 in. Suppose μ= 0.2508 in and σ = 0.0005 in. What fraction of shafts conform to the design specifications? The Attempt at a...
  46. S

    Use that relationship to answer the question.

    Homework Statement An airline flies the same route at the same time each day. The flight time varies according to a Normal distribution with unknown mean and standard deviation. On 15% of days, the flight takes more than an hour. On 3% of days, the flight lasts 75 minutes or more. Use this...
  47. L

    Statistics PDF and change of variable

    If the probability density of X is given by f(x) = {2(1 − x) for 0 ≤ x ≤ 1 {0 otherwise (a) Find the probability density function of Y1 = 2X − 1. I do not know how to start this problem can someone please help. Is there a formula that I am missing from my notes to solve this problem?
  48. X

    Statistics question Continous Random Variables

    Homework Statement 1) Let X have the p.d.f f(x) = 3(1-x)2, 0≤x<1. Compute: a) P(0.1 < X < 0.5) etc... 2) Find the mean and variance, and determine the 90th percentile , of each of the distributions given by the following densities: a) f(x) 2x, 0≤0<0 etc.. 3) Find the 50th...
  49. S

    Statistics Gamma Distribution question

    Homework Statement Let X have a gamma distribution with parameters α and β. Show that P(X ≥ 2αβ) ≤ (2/e)2 Homework Equations f(x) = pfd of a Gamma The Attempt at a Solution I began by solving for P(X ≥ 2αβ) by doing ∫ f(x) dx from 2αβ to ∞ I set y=x/β for substitution. and I...
  50. J

    Multiple regression analysis, econometrics, and statistics

    I am sooo lost in this class, please help. 1. Let the true (population) model be y = B0+B1x1+B2x2+u where u is an unobserved error term with u (conditional) x1, x2 and N(0, sigma^2). Hence, u is normally distributed with mean 0 and variance sigma^2 (i.e., E[u (conditional) x1, x2] = 0 and V...
Back
Top