What is Stats: Definition and 248 Discussions

Statistics is the discipline that concerns the collection, organization, analysis, interpretation, and presentation of data. In applying statistics to a scientific, industrial, or social problem, it is conventional to begin with a statistical population or a statistical model to be studied. Populations can be diverse groups of people or objects such as "all people living in a country" or "every atom composing a crystal". Statistics deals with every aspect of data, including the planning of data collection in terms of the design of surveys and experiments.When census data cannot be collected, statisticians collect data by developing specific experiment designs and survey samples. Representative sampling assures that inferences and conclusions can reasonably extend from the sample to the population as a whole. An experimental study involves taking measurements of the system under study, manipulating the system, and then taking additional measurements using the same procedure to determine if the manipulation has modified the values of the measurements. In contrast, an observational study does not involve experimental manipulation.
Two main statistical methods are used in data analysis: descriptive statistics, which summarize data from a sample using indexes such as the mean or standard deviation, and inferential statistics, which draw conclusions from data that are subject to random variation (e.g., observational errors, sampling variation). Descriptive statistics are most often concerned with two sets of properties of a distribution (sample or population): central tendency (or location) seeks to characterize the distribution's central or typical value, while dispersion (or variability) characterizes the extent to which members of the distribution depart from its center and each other. Inferences on mathematical statistics are made under the framework of probability theory, which deals with the analysis of random phenomena.
A standard statistical procedure involves the collection of data leading to test of the relationship between two statistical data sets, or a data set and synthetic data drawn from an idealized model. A hypothesis is proposed for the statistical relationship between the two data sets, and this is compared as an alternative to an idealized null hypothesis of no relationship between two data sets. Rejecting or disproving the null hypothesis is done using statistical tests that quantify the sense in which the null can be proven false, given the data that are used in the test. Working from a null hypothesis, two basic forms of error are recognized: Type I errors (null hypothesis is falsely rejected giving a "false positive") and Type II errors (null hypothesis fails to be rejected and an actual relationship between populations is missed giving a "false negative"). Multiple problems have come to be associated with this framework, ranging from obtaining a sufficient sample size to specifying an adequate null hypothesis. Measurement processes that generate statistical data are also subject to error. Many of these errors are classified as random (noise) or systematic (bias), but other types of errors (e.g., blunder, such as when an analyst reports incorrect units) can also occur. The presence of missing data or censoring may result in biased estimates and specific techniques have been developed to address these problems.

View More On Wikipedia.org
  1. W

    Deciphering Asteroid Stats to Divert an Impact on Earth

    I'm currently in grade 12 physics. For my summative project in the class (which is worth a large part of my final mark) I have to think of three separate ways to divert a large asteroid away from hitting earth. We were each given a real object and stats on the object. My first problem is that...
  2. V

    Can You Self-Learn Bayesian Statistics and R Programming?

    There isn't a course at my university for Bayesian stats, but I was wondering if it would be possible to teach myself. Does anyone know of any good intro texts or online resources for leaning Bayesian Statistics? Also I want to learn to use R some over the break. Does anyone know of any good...
  3. K

    Stats seating arrangement problem

    How many ways can five people, A, B, C, D, E, sit in a row at a movie theater if D & E will not sit next to each other? If everyone "would" sit next to each other, then it'd just be 5! or 120. However, without actually drawing out a picture, I'm not sure exactly how to work the problem with...
  4. B

    Calculating Probability for a Discrete Random Variable

    Homework Statement If X is a discrete random variable with p.d.f. f(x) = {cx if x=1,2,3,4 ; 0 otherwise} where c is a constant. Find P(1< X =< 3.5) Homework Equations I solved for c: f(x) = 1 = c(1+2+3+4) 10c = 1 c = 1/10 The Attempt at a Solution I have the following solution...
  5. Z

    Solve 4-Digit Numbers w/ Same Digit Repeated Twice & Thrice

    I have no experience in stats and I was wondering if this problem can be solved using stats instead of creating a java program. Can someone please help me. I am trying to find out how many 4 digits numbers exist that have the same digit repeated in that number twice. And how many 4 digits...
  6. Q

    Puzzling Prob Stats / Bayes problem

    Hello All, I have a problem that goes like this:I have one fair coin and one double heads coin. I pick one at random and flip the same coin three times. All three times it comes up heads. What is the probability that this same coin will come up heads a fourth time? I said that the prob...
  7. P

    Stats unknown variance hypothesis testing

    Homework Statement A machine produces metal rods used in an automobile suspension system. A random sample of 12 rods is selected and the diameter is measred.The sample mean is 8.28. and the significance level is 0.05. Is there strong evidence to indicate that mean rod diameter is not 8.20 mm...
  8. P

    Calculating P-Values for H_0: \mu =\mu_0

    Homework Statement Suppose that we are testing H_0: \mu =\mu_0 versus H_1: \mu \neq\mu_0. Calculate the P-value for the following observed values of the test statistic. a: z_0 =2.45 e: z_0=-0.25 Homework Equations none The Attempt at a Solution I got part a by using the...
  9. Saladsamurai

    How do I determine the class size for creating a histogram in statistics?

    I have a quick question regarding a stats problem in my girlfriend's text. It is pretty easy I suppose, but I am not quite sure; the text does not appear to give a "general form" of how to obtain the info. Feel free to correct me if I misuse words here, I am not familiar with the stats lingo...
  10. P

    How do these statistics courses compare to other schools?

    It's time for me to start deciding if I should head the applied mathematics route or statistics route. I know the applied mathematics program at my school is is on par with others but I'm not sure about the statistics program. I do not know much about other school's statistics programs, so I...
  11. D

    Covariance between data stats problem

    Hi. At the moment in class we are going over statistics Anyway, the formula I've been using for covariance between two sets of data is: s_{xy} = \frac{1}{n}\sum\limits_{i = 1}^n {x_i y_i } - \overline x \overline y Now, if i was to get a question such as: "If all the...
  12. R

    Searching for Calculus-Based Physics, Stats & Advanced Calc Books

    I'll get right to it. I've been looking for a Calculus based Physics textbook to self-study over summer, so far I've already taken an algebraic based AP Physics course (this book: https://www.amazon.com/dp/0136119719/?tag=pfamazon01-20 But I didn't like Giancoli's book a whole lot, it...
  13. I

    Why is there a b in the equation for mean?

    [Solved]Stats question regarding mean Homework Statement Yo.Exam marks,X've mean 70 and standard deviation 8.7.The marks need to be scaled using the formula Y=aX+b so that the scaled marks,Y've mean 55 and standard deviation 6.96.Find the values of a and b.*From the answer sheet -to form...
  14. L

    Help Visualisation and Stats for a Determinist

    Hello, I am a new user here. Before I go any further I would like to say "hi" to all members. Now on with the post (and I hope I have chosen the right forum). I am a bit of a philospoher and am a "determinist" ie. believe that my actions and thoughts etc are basically a product of the forces...
  15. D

    Mutually exclusive stats homework

    Homework Statement Suppose that P(A) = 0.8 and P(A or B) = 0.9 , determine P(B) If a) A and B are independent b) A and B are mutually exclusive Homework Equations for Independence P(A and B) = P(A) x P(B) for Mutually Exclusive P(A or B) = P(A) + P(B) The Attempt at a Solution...
  16. marcus

    ArXiv Research Output Trends 2006 - Stats Visualization

    http://arxiv.org/Stats/hcamonthly.html the arxiv stats graph research output trends by field. they just updated for yearend 2006
  17. H

    Qualitative Stats question (central limit theorem)

    3) Which of the following are consequences of the Central Limit Theorem? I) A SRS of resale house prices for 100 randomly selected transactions from all sale transactions in 2001 (in Toronto) will be obtained. Since the sample is large, we should expect the histogram for the sample to be...
  18. N

    Calculate P(x > 2): Binomial Distribution

    Suppose x is a discrete, binomial random variable Calculate P(x > 2), given trails n = 8, success probability p = 0.3 and given trails n = 6, success probability p = 0.1
  19. S

    Calculating Probability for Mean Time Between Events

    Ok, This problem sounds really easy, but I think I am doing something wrong. Question : If the mean time between some random event occurring is 6 months, what is the probablity that in one year the event does not happen. I think its like flipping a coin. There is a 0.5 chance of the...
  20. russ_watters

    Discover the Latest Airliner Autoland Statistics | Uncover the Truth Now!

    Does anyone have any stats on how often airliners land themselves? I googled a little and couldn't find any.
  21. R

    How Do You Calculate the Probability of Getting at Most One Brown M&M?

    Okay so I did this problem and got it wrong but I get one more chance to get it right. I tried using Binomial Dist to solve it but I failed. 30% of all M&Ms are brown. If 7 M&Ms are randomly selected, what is the probability that at most 1 is brown? I thought I would use 0 and 1 but I...
  22. P

    Solving Forearm Length Stats Problem

    Hi Everyone, I am new to the forum and really need so help before I have a test tomorrow. I have been going over my book and working through the problems but I am stuck on this problem. Would someone please help me? thank you very much for your time From a study average height of men...
  23. F

    Are There Unsolved Problems in Statistics?

    it seems like all the major problems in math/stats are only in math. why is that? is it because stats is a relatively new field of study (got started ~50yrs ago i think by florence nightingale?), not counting gauss' central limit theorem? or is it because statisticians only work with data...
  24. mattmns

    Stats - Independence (circuits)

    Hello, my book has this question, and no examples (very) similar to it, so I am wondering if I did it correct :smile: --------- The following circuit operates if and only if there is a path of functional devices from left to right. The probability that deach device functions is as shown...
  25. M

    1st year stats, empirical rule- range of values

    Hello I am taking 1st year stats at university, and I have lab questions I am supposed to answer. I am VERY confused. My data set is: 53, 33, 25, 63,26, 64, 32, 21, 45, 64, 38 I calculated the mean:42.182 the sample variance:272.147 The range:43 The percentile rank of the data value 45...
  26. I

    Help With Stats Concept, Please?

    Hello everyone, I just started taking Stats and I have a question that the book doesn't explain really well. I was hoping someone could explain how an answer was achieved. The question is about trimmed means -- not the means, I can do those, but specifically how you figure the percentages...
  27. marcus

    Arxiv stats for 2005 are ready

    http://arxiv.org/Stats/ they plot the average monthly posting rate for preprints in several categories: hep, astro, mathphys, condensedmatter the solid bar is the actual posting in that category, the blank part is crossposting from other categories so looking at the solid blue, for...
  28. D

    Probability of Meeting Italian Who Speaks English in Italy

    1 out of 5 italians speak english. 1 out of 5 people in italy are tourists. 1 out 2 tourists speaks english. You meet a english-speaking person in italy, what is the probability that this person is italian. The way I see the "population": P(I) = \frac{2}{10} are italians who speaks english...
  29. C

    Desperately seeking help with stats 1001

    hello, I'm a newcomer here :smile: i am taking statistics 1001 and am having a horrible time with this weeks homework. it basically has to do with regression. unfortunately my teacher barely speaks english and it was especially hard to understand her this week (is she speaking about z-scores -...
  30. M

    What Are the Probabilities Linked to Business and Financial Media Consumption?

    hi, need some help on these stats problems that i am confused about? 1. market research in a particular city indicated that during a week 18% of all adults watched a television program oriented to business and financial issues, 12% read a publication oriented to these issues, and 10% do...
  31. A

    Calculating Probability and Profit for Drilling Wells: A Binomial Approach

    QUESTION3: Alexander Da Costa & Priyanka Kapadia Petroleum, plans to drill 5 exploratory wells. Jenny Wong & Danny Tieu Consulting geologists give the probability that a well results in a discovery as 0.20 for the first 3 wells. If there is a discovery in the first 3 wells the geologist's...
  32. P

    Its deciding which Stats method to use that the problem

    Its deciding which Stats method to use that the problem... Hello, i hope you don't mind a biologist interloping on your boards :) I have a statistics problem that i could do with some help with if anyone can (please!). I'm taking 2 8ml samples from a volume of 200mls. Distributed evenly...
  33. M

    How Did Fuji's APS System Impact the Point and Shoot Camera Market?

    In the early 1990s, Fuji Photo Film, USA, joined forces with four of its rivals to create the Advanced Photo System (APS), which is hailed as the first major development in the film industry since 35 - millimeter technology was introduced. In February 1996, the new 24 millimeter system...
  34. F

    News Some humiliating stats on the USA

    first of all (just so people don't try to call me a hypocrite) if I saw a list like this showing such scandalous (to put it mildly) stats on Canada I would be mortified. (actually the homeless & poverty we've got here is pathetic given Canada's wealth, lack of enemies, etc) but look at what what...
  35. M

    Solve Stats Prob: Colgate Total US Market Results

    help please: "Within three months, Colgate grabbed the number one market share for toothpaste. Ten months later, 21% of all U.S. households had purchased Total (a product of Colgate) for the first time. During this same period, 43% of those who initially tried Total purchased it again...
  36. marcus

    Arxiv stats as of December 2004

    this page of stats is an update of one that Alejandro flagged some months back. It has a different URL now, and it continues the graphs up to December 04. http://arxiv.org/Stats/ Dont expect to see anything about quantum gravity or stringy research explicitly. this is only graphing the...
  37. A

    Stats regression problem, need hlep QUICK

    If the regression equation is y=2.3-1(x) and r^2 = 0.78, what is the value of the coefficient of correlation? my answer was 0.88 and i got it wrong, is it -0.88 because of the negative slope in the equation? please help fast.. thnx
  38. P

    Characterizing Volume in Equilibrium Thermodynamics

    from callen, equation 16.10 reads Z = sum(e^-BE) the text later says that F = -kT ln Z, and states that it gives the helmholtz potential as a function of B, V, N where B = 1/kT my question is, what part of this relationship characterizes the volume?
  39. M

    Are Independence and Disjoint Events the Same in Probability?

    I'm working on a question which requires you to understand the difference between Independence and disjoint events. The question is: Suppose 24% of a population have 4 years of college, and 15% are laborers/workers. From this, can you conclude that 0.24 x 0.15 = 0.036=3.6% of the population are...
  40. T

    Solving Probability Problems for A-Level Statistics Students

    "A" Level Stats Question I was once asked to calculate the probability, that someone will come of a lift, in a large tall building? What information do I require to answer this question correctly? What probability theory do I use to solve this problem? Once I have this information...
  41. T

    Can NHTSA Estimate Tire Failure Rates Within Budget?

    Destructive SAmpling, in which the test to determine whether an item is defective destroys the item, is generally exensive, and the high costs involved often prohibit large sample sizes. For exampl, suppose the National Highway TRaffic SAfety Administrator wishes to detrmine the proportion of...
  42. S

    Two Stats questions for Math nerds (std. deviation, mean, subsets)

    1. If a school offers 3200 separate courses and a survey of these courses determines that the class size is 50 with a standard deviation of 2, what would one expect for the average and standard deviation of a subset of 50 of these classes selected randomly? 2. In a survey to estimate the...
  43. B

    Seeking Help with Stats: Sample Size Guidance Needed

    Hi! I'm new to this forum. I took some stats in University but that was so long ago that I don't remember much. I'm trying to figure out how big of a sample size I need to generate some stats for the company I work for. I'm not sure what info is needed but if anyone is interested in helping...
  44. L

    Stats Question: Anemia, Flu, and Drug Analysis

    i've got a stats issue that i'd love some ruling on if anyone cares to help.. i've got a population of hospital patients with a diagnosis of anemia admitted in 2004 - n=191 the rate at which these 191 catch the flu while in stay is 39% (or 75 of them) during that same time (2004) 90 of...
  45. S

    Poisson stats: signal to noise

    A star was measured to have an apparent magnitude m=16 with S/N=10 integrated over a minute. What is the uncertainty in the measurement? signal=flux*area*time noise=sqrt(signal)=sqrt(fAt) So, S/N=sqrt(fAt) How can I find fA? m=-2.5logfAt+K 16=-2.5log(fAt)+K Hoping that K is arbitrary...
  46. Monique

    PF Birthdate Stats: Analysis of 590 Entries

    What to do when you have some extra time? Play with some numbers ofcourse :) I downloaded all the birthdays registered in PF and did a few spreadsheet calculations: In total there were 592 entries, of which one was of a 3 year old and another of a 101 year old person.. those were not...
Back
Top