What is Statistics: Definition and 998 Discussions

Statistics is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industrial, or social problem, it is conventional to begin with a statistical population or a statistical model to be studied. Populations can be diverse groups of people or objects such as "all people living in a country" or "every atom composing a crystal". Statistics deals with every aspect of data, including the planning of data collection in terms of the design of surveys and experiments.When census data cannot be collected, statisticians collect data by developing specific experiment designs and survey samples. Representative sampling assures that inferences and conclusions can reasonably extend from the sample to the population as a whole. An experimental study involves taking measurements of the system under study, manipulating the system, and then taking additional measurements using the same procedure to determine if the manipulation has modified the values of the measurements. In contrast, an observational study does not involve experimental manipulation.
Two main statistical methods are used in data analysis: descriptive statistics, which summarize data from a sample using indexes such as the mean or standard deviation, and inferential statistics, which draw conclusions from data that are subject to random variation (e.g., observational errors, sampling variation). Descriptive statistics are most often concerned with two sets of properties of a distribution (sample or population): central tendency (or location) seeks to characterize the distribution's central or typical value, while dispersion (or variability) characterizes the extent to which members of the distribution depart from its center and each other. Inferences on mathematical statistics are made under the framework of probability theory, which deals with the analysis of random phenomena.
A standard statistical procedure involves the collection of data leading to test of the relationship between two statistical data sets, or a data set and synthetic data drawn from an idealized model. A hypothesis is proposed for the statistical relationship between the two data sets, and this is compared as an alternative to an idealized null hypothesis of no relationship between two data sets. Rejecting or disproving the null hypothesis is done using statistical tests that quantify the sense in which the null can be proven false, given the data that are used in the test. Working from a null hypothesis, two basic forms of error are recognized: Type I errors (null hypothesis is falsely rejected giving a "false positive") and Type II errors (null hypothesis fails to be rejected and an actual relationship between populations is missed giving a "false negative"). Multiple problems have come to be associated with this framework, ranging from obtaining a sufficient sample size to specifying an adequate null hypothesis. Measurement processes that generate statistical data are also subject to error. Many of these errors are classified as random (noise) or systematic (bias), but other types of errors (e.g., blunder, such as when an analyst reports incorrect units) can also occur. The presence of missing data or censoring may result in biased estimates and specific techniques have been developed to address these problems.

View More On Wikipedia.org
  1. S

    [Statistics] Independent vs. Dependent Variables

    Hi everyone I have a quick question about independent and dependent variables. Homework Statement The following data set gives the number of miles traveled, and the travel time in hours for each of the 10 car's driving assignments. Miles Time 90 9 40 5 90 9 90...
  2. E

    Courses Taking Statistics vs Pure Math for Experimental Physics

    Currently I am studying a double major in Math and Physics, hopefully leading towards experimental physics, however come next semester I will have to pick my math "specialisations" from two of applied math (mostly chaos and optimisation etc.), pure math(mostly abstract algebra and analysis) and...
  3. Andreas C

    B A whole bunch of dice -- Statistics riddle

    Following the "enormous success" (it received a single like) of the last riddle, I decided to waste some bandwidth with a sequel! This one doesn't have an interesting concept or an unexpected solution, it's just hard to solve, so if you're looking for such a riddle, do yourself a favor and...
  4. Ackbach

    MHB Extremely Important Article on Statistics

    I came across this article, called "Ten Simple Rules for Effective Statistical Practice", and thought it was monumental in its importance for understanding statistics and using it practically, particularly in science. I hope you enjoy it!
  5. Greg Bernhardt

    Teaching statistics before calculus?

    Here is a short (and old) TED talk where a mathematics professor suggests we teach stats and probability in depth before teaching Calculus because it's math that is more relevant to a wider range of people. Have we got our math curriculum wrong? Thoughts?
  6. Ma Xie Er

    A Why Is the Equality in This Spectral Analysis Proof Correct?

    I'm reading "Time Series Analysis and Its Applications with R examples", 3rd edition, by Shumway and Stoffer, and I don't really understand a proof. This is not for homework, just my own edification. It goes like this: Σt=1n cos2(2πtj/n) = ¼ ∑t=1n (e2πitj/n - e2πitj/n)2 = ¼∑t=1ne4πtj/n + 1 + 1...
  7. J

    I Mandel-Q Parameter and Phonon Statistics

    Hi, I'm struggling to understand how the Mandel-Q parameter (MQ) can be used to evaluate the quantum dynamics of a single trapped ion. A trapped ion has a quantised degree of motional freedom so can be discussed in terms of the phonon. Im studying the dynamics of a trapped ion which is subject...
  8. micromass

    Challenge Micromass' big statistics challenge

    If we're having a thread about probability theory, then we must have one on statistics too! The following questions are all very open-ended and thus multiple answers may seem possible. Your goal is to find a strategy to find the answer to the questions. Furthermore, you must provide some kind of...
  9. T

    I How do you implement the Dickey-Fuller test?

    Hi there, I've recently start learning methods for determining whether or not time series are stationary. The first method I'm trying to learn is the 'Dickey-Fuller Test'. This test uses a time series modeled by an AR(1) process. The key is to find whether or not this process contains a unit...
  10. Z

    A Question about a particular paper on categorical data

    I am not sure this is the right forum for this -- I have a question about a particular paper: http://www-users.cs.umn.edu/~sboriah/PDFs/ChandolaCBK2009.pdf The authors describe 4 heuristics that can be derived from categorical data -- this is in order to map categorical data to numerical...
  11. E

    A Information contained in minimum value of truncated distribution

    Suppose that a given population is endowed with a pair of characteristics T and K. Let's think of these characteristics as random variables (T,K)∼BiNormal((μT,μS),(σT,σS),ρ) I observe the realisations of T for a sample consisting of those individuals with K<a, where the selection threshold a...
  12. dotsero

    Frequency Distribution Width & 'Rounding Up'

    Homework Statement I'm having trouble understanding setting up a frequency distribution. I am confident I am doing it right, but the book I'm using differs when calculating width. The problem gives a bunch of numbers representing the number of counties, divisions, or parishes for each of the...
  13. thegreengineer

    Binomial distribution problem

    Right now I'm having a problem with a statistics problem. More specifically with a binomial distribution problem. The problem says: There is a family composed by 8 children. Calculate the probability that 3 of them are girls As far as I know, binomial distribution formula says...
  14. CynicusRex

    Studying Guidelines to studying linear algebra and statistics.

    I'll try to be concise. I've been out of math for years and never truly learned to understand it. Until now. I want to put the growth mindset theory to the test and see if I can handle physics (or any STEM field) on a university difficulty. To verify if I'm up to it and even have the slightest...
  15. Mathman2013

    Statistics and Maple: Constructing a Team of 4 from 3 Groups"

    Homework Statement We have three groups, group 1 contains 21 people, group two contains 18 people and group 3 contain 50 people. First we need to construct a team of 4 people of three groups. How many ways can such a team be constructed? I use combinate such that it will be calculated in...
  16. chikou24i

    A Where Can I Find the Published Paper on Bose-Einstein Statistics by Einstein?

    I'm searching for Bose's paper/article that was published by Einstein. Can someone give me direct link to it ?
  17. Ryaners

    (Statistics) Blackbody spectrum in terms of wavelength?

    This is a question about transforming a probability distribution, using the blackbody spectrum as an example. Homework Statement An opaque, non-reflective body in thermal equilibrium emits blackbody radiation. The spectrum of this radiation is governed by B(f) = af3 / (ebf−1) , where a and b...
  18. kostoglotov

    Heading power side,do a unit in stats or computation?

    1st year, heading power side, should I do a unit in stats or "computational explorations"? My 2nd semester must needs consist of a foundations of EE unit, a foundations of ME unit, I've picked a unit for building IT systems (since a lot of power side these days involves SCADA and smart grids...
  19. A

    I Degeneracy in quantum statistics

    degeneracy,this word appears in my textbook many times,but i could not understand what it means in quantum statistics.also in my textbook it is said in bose-einstein statistics that " the deviation from perfect gas behaviour exhibited by bose-einstein gas is called gas degeneracy".but i can't...
  20. P

    MHB Help with Probability, Roulette, and Population Statistics

    Please help me, I will really appreciate it! 1) You are dealing a standard hand of 5 card stud, 1 card face down, 4 face up. Ignoring the cards dealt to all other players what is the probability of you drawing a 9, a heart, and a 9 of hearts assuming you have the 6, 7, 8 and 10 of hearts...
  21. J

    A Stats: would the sum of the variances be 1 in this case?

    Often in empirical studies you see statements that factor X explains some fraction of the variance in some other variable V, and thinking about what this means intuitively made me curious about the following question. Suppose you have a model where the values of some set of factors X1, X2, ...
  22. eskimotaro

    [University Introductory Statistics] DNA crime scene

    Hello everyone. I have been given a problem in my Introductory Mathematical Statistics class. Been thinking about this one for a while and I am simply stuck. 1. Homework Statement "There has been found a DNA of type S on a crime scene. We will assume a total population of N = 5000000 that are...
  23. SrVishi

    Prob/Stats Which Advanced Math Courses Prepare a Math Major for Statistics?

    Hello, I am looking to learn statistics. What would be a good rigorous statistics book for someone who is completely comfortable with proofs?
  24. M

    MHB Therefore, Kenny scores 31 points in the last test.

    After the fourth test, Kenny's average mark rises by 5 points, but after the fifth test, it drops by 9 points. If his total score in the last two tests is 122 points, How many points does he score in the last test?
  25. K

    MHB How Do You Calculate Binomial Probabilities for Survey Representation?

    Hello, I was hoping someone could help explain how to do this problem. I have been stuck on it for a while now. I know that you have to use a binomial with n=100, then n=1000 but I'm not sure how to set it up to solve for a range from 8-12%. Thanks! Any advice is appreciated. Also, for people...
  26. K

    MHB Statistics Question Probability

    Hello, I was hoping someone could help explain how to do this problem. I have been stuck on it for a while now. Thanks! Any advice is appreciated. Also, for people just out to block questions, I AM NOT ASKING FOR THE ANSWER - I AM NOT TRYING TO CHEAT. I just would like help. A hospital receives...
  27. D

    Rutherford Backscattering Spectroscopy, Counting Statistics

    Homework Statement I am given a numerical example (to be solved with pen, paper and calculator only) of an RBS spectrum of a AuAgCu-alloy on a glass-substrate. The question is "Can you get the composition? How accurate is the result?Homework Equations All the Rutherford Backscattering...
  28. Duncan R

    What are the interests and aspirations of a Cape Town math student?

    Hi there, I'm one of those rare undergrads happy with the idea of being a statistician, although I originally planned to focus on genetics. My mathematical ability and understanding is only at 1st year level, but it improves by the day! I am particularly interested in learning Mathematica and...
  29. samjohnny

    Rubber Band Boltzmann Statistics

    Homework Statement Homework Equations $$ Z(1) = \sum_{i=1}^{} e^{\frac{E_i}{K_bT}} $$ where ##E_i## is each of the possible energy states available to a single link (in this case the right and the left states). $$ P=\frac{\sum_{i=1}^{} e^{\frac{E_i}{K_bT}}}{Z} $$ The Attempt at a Solution...
  30. W

    Statistics of Voter Search by Candidates: Logistic Regressio

    Hi All, Just curious about the techniques used by current political candidates to determine potential voters. I can think of many-variable logistic regression, say, age, years of education, goes to church at least once a month (yes/no), etc. Do they use anything else? Thanks.
  31. O

    Do people who tend to be great at math suck at statistics?

    In my opinion, math is an exact science, while statistics is an art. Math: perfect Stats: estimation In my opinion, there has to be a difference between the population of mathematicians and statisticians. Mod note: Removed the poll, which isn't allowed in the technical sections.
  32. V

    On statistics of index what is Zoj

    Hi everybody, I have a statistical problem in interpreting the index The variables for different indicators are taken from different population distributions and they might be recorded in different units of measurement. The values of these indicators are not quite suitable for combined...
  33. F

    How Does Sampling Strategy Impact Measurement Accuracy in Statistics?

    Hello Forum, I am taking a lab and we are learning about measurement and uncertainty. Suppose we have to measure the length L of an object. Once the data has been collected we can calculate the mean (average) and the standard deviation s. The resulting measurement would be expressed as [ mean...
  34. R

    What is the Total Uncertainty in Measuring Force with a 0-10 lbf Load Cell?

    Say I have a 0-10 lbf load cell that can measure the force it takes to lift an object. The load cell is accurate to 1% of the full scale. I take 5 measurements and get the following readings: 5.2, 5.1, 4.9, 5.0, & 4.8, all in lbf. Now I am asked to give the mean with the associated...
  35. M

    How do I calculate the value of K using Tchebycheff's theorem for this data?

    Homework Statement Hello all, I have a one number stat: Xi 2 4 5 6 6 8 10 Tchebycheff : It ask me to find the value of K. Interval of confidence: 89% I know 1-1/k2 I have no idea how to calculate the value of K. Do I have to somehow use the interval of confidence? I checked my book and...
  36. B

    What is Excess Kurtosis and Why is it Important in Financial Analysis?

    I heard a guy mention in a debate that some math calculation didn't obey Gaussian statistics. It was a debate re: the economy (not important here, though). I was curious what was meant by "Gaussian statistics" and would appreciate if anyone could offer a sort of layman's definition. Thanks...
  37. S

    Statistics and Data Science applied to Physics

    I wasn't sure where to post this, but I figured this would be a topic under General Physics. I am aware that the next generation of observations, ranging from cosmological observations to post-LHC particle physics experiments, will produce overwhelmingly large and complex datasets, far larger...
  38. L

    The statistics of Deal or No Deal?

    Hi, I have a doubt on the statistics of this game, maybe more philosophical than mathematical. I assume most people are familiar with the game. Suppose you categorised the prizes into 'good' and 'bad'. The contestant plays a very lucky game, and halfway through the game is left with no bad...
  39. 24forChromium

    How to interpret the Pearson Correlation Index?

    Not sure if that's the technical name, but I refer the the number Excel give you between 0 and 1 when you use the "correl" command on two sets of numbers.
  40. Tone L

    Where Can I Find Physical Science Data Sets for Quick Statistical Analysis?

    Hello... I have a quick assignment for my statistics course, just calculate the mean, standard deviation etc.. We have our choice to calculate these statistics on any data set, anyone know of some cool physical science data, that would be quick to manipulate.. ASCII txt file please :D
  41. little neutrino

    France Unemployment Rate: MCQ Homework

    Homework Statement One month the actual unemployment rate in France was 13.4%. If during that month you took a SRS of 100 Frenchmen and constructed a confidence interval estimate of the unemployment rate, which of the following would have been true? A) The center of the interval was 13.4. B)...
  42. ElijahRockers

    Neurologist: What P-values should I be expecting?

    Inexperienced data analyst here with a real-world example, I have attached a zip-file with screenshots and p-values of the following data. The "reference regions" are Cerebellum White, Cerebellum Gray, and Temporal Cortex. The top-most graphs depict the curves in the indicated region for young...
  43. Aaron young

    Reworking of the Drude model using scattering statistics

    Homework Statement The problem I have been set is to rework the Drude model using clearly defined scattering statistics. Homework Equations The Drude model as we have been given it is in terms of momentum \vec{p}(t+dt)=(1-\frac{dt}{\tau})(\vec{p}(t)-q\vec{E}(t)dt)+(\frac{dt}{\tau})(0) Where...
  44. W

    I think this is about the Central Limit Theorem

    Homework Statement An engineer is measuring a quantity q. It is assumed that there is a random error in each measurement, so the engineer will take n measurements and reports the average of the measurements as the estimated value of q. Specifically, if Yi is the value that is obtained in the...
  45. W

    Cantelli's Inequality and Chebyshev's Inequality

    Homework Statement The number of customers visiting a store during a day is a random variable with mean EX=100and variance Var(X)=225. Using Chebyshev's inequality, find an upper bound for having more than 120 or less than 80customers in a day. That is, find an upper bound on P(X≤80 or X≥120)...
  46. W

    Markov's Inequality for Geometric Distribution.

    Homework Statement Let X∼Geometric(p). Using Markov's inequality find an upper bound for P(X≥a), for a positive integer a. Compare the upper bound with the real value of P(X≥a). Then, using Chebyshev's inequality, find an upper bound for P(|X - EX| ≥ b). Homework Equations P(X≥a) ≤ Ex / a...
  47. T

    Statistics expected values problem

    Homework Statement Let X1 and X2 be independent normal random variables, distributed as N(μ1,σ^2) and N(μ2,σ^2), respectively. Consider a random variable U = 2X1 − X2. (a) Find the mean of U. (b) Find the variance of U. (c) Find the distribution of U. The Attempt at a Solution a) E(U) =...
  48. little neutrino

    Finding k from Moment Generating Function at t=0

    Homework Statement If M[X(t)] = k (2 + 3e^t)^4 , what is the value of k Homework Equations M[X(t)] = integral ( e^tx * f(x) )dx if X is continuous The Attempt at a Solution I tried differentiating both sides to find f(x), but since it is a definite integral from negative infinity to infinity...
  49. little neutrino

    Calculating Probability of 3 Pennies in 30 Boxes Using Poisson Distribution

    One hundred pennies are being distributed independently and at random into 30 boxes, labeled 1, 2, ..., 30. What is the probability that there are exactly 3 pennies in box number 1? I tried using a Poisson distribution f(x) = (e^-λ)*(λ^x)/x! , with λ = 100/30 = 10/3 and x = 3. I got 0.22021 (5...
  50. W

    Conditional Expectation of Multiple Independent Random Varia

    Homework Statement Given X,Y,Z are 3 N(1,1) random variables, (1) Find E[ XY | Y + Z = 1] Homework EquationsThe Attempt at a Solution I'm honestly completely lost in statistics... I didn't quite grasp the intuitive aspect of expectation because my professor lives in the numbers side and...
Back
Top