Brief introduction to the use of statistics

In summary, the conversation revolves around a Medium article titled "The 5 Basic Statistics Concepts Data Scientists Need to Know" by George Seif. The participants discuss their thoughts on the article, with one person expressing dissatisfaction with the way the material is presented and another mentioning a website with problems to practice statistics concepts. They also discuss the importance of understanding basic concepts when learning new material.
  • #1
scottdave
Science Advisor
Homework Helper
Insights Author
Gold Member
1,989
964
I'm not really sure where to put this. I came across this nice article on Medium. “The 5 Basic Statistics Concepts Data Scientists Need to Know” by George Seif https://link.medium.com/APtnCOapOV
 
Physics news on Phys.org
  • #2
What do you like about that article? I haven't read it all (I gave up at "the first quartile is essentially the 25th percentile") but personally I disagree with the way almost all I have read so far is presented.
 
  • #3
Hi @pbuk I would certainly like to know if something I'm reading or Sharing is wrong. It has been years since I've taken a Statistics course, but I am taking a course which uses some. I was looking for something to brush up and came across this. I didn't notice anything glaringly wrong. Please let me know.
 
  • #4
What has worked for me is to combine applied and theoretical approaches. I used D.Montgomery's Statistics for Engineers and Schaum's for applied and my old Freund's textbook. Then go over a lot of sites with Stats content.
 
  • Informative
Likes scottdave
  • #5
Its not that I have noticed anything wrong (although as I say I haven't read it all), it's just the way the material is presented - for instance "[the] Median is used over the mean since it is more robust to outlier values". When is the median used in preference over the mean? What even is the mean? Or the median (it is explained as "the line in the middle!")? What does the symbol "IQR" on the chart mean (it is "inter-quartile range of course, and the author talks about the box-plot being "short" or "tall" (in relation to what?) without referring to this label). And then the wonderful tautology I have already quoted - "the first quartile is essentially the 25th percentile". If I don't know what a quartile is how on Earth am I going to know what a percentile is?

When I am looking for an introductory or refresher text, a key indicator is how well it explains the things I already know. If it does a good job, I am inclined to trust the author to explain new material. If it doesn't, then I turn elsewhere.
 
  • Like
Likes lomidrevo, WWGD and PeroK
  • #6
pbuk said:
Its not that I have noticed anything wrong (although as I say I haven't read it all), it's just the way the material is presented - for instance "[the] Median is used over the mean since it is more robust to outlier values". When is the median used in preference over the mean? What even is the mean? Or the median (it is explained as "the line in the middle!")? What does the symbol "IQR" on the chart mean (it is "inter-quartile range of course, and the author talks about the box-plot being "short" or "tall" (in relation to what?) without referring to this label). And then the wonderful tautology I have already quoted - "the first quartile is essentially the 25th percentile". If I don't know what a quartile is how on Earth am I going to know what a percentile is?

When I am looking for an introductory or refresher text, a key indicator is how well it explains the things I already know. If it does a good job, I am inclined to trust the author to explain new material. If it doesn't, then I turn elsewhere.
That makes sense. Thanks.
 
  • #7
Having just very quick view on the article, I found it quite vague. For example:
A Poisson Distribution is similar to the Normal but with an added factor of skewness.
This doesn't explain anything about Poisson distribution. And no single mention that it is a discrete distribution comparing to Normal, which is continuous.
 
  • #8
lomidrevo said:
Having just very quick view on the article, I found it quite vague. For example:

This doesn't explain anything about Poisson...
After rereading it, I agree. It seems like "here are some things to go learn more about."

In doing some searching I came across this site http://rosalind.info It has some problems to try to solve, related to bioinformatics, many of which use statistics concepts. To solve some, writing a program is helpful, as they give a 5 minute time limit, then the data changes. It kind of reminds me of Project Euler style, in a way.
 

Related to Brief introduction to the use of statistics

What is statistics?

Statistics is a branch of mathematics that involves collecting, organizing, analyzing, and interpreting data. It is used to make informed decisions, identify patterns and trends, and make predictions based on data.

Why is statistics important?

Statistics is important because it allows us to make sense of large amounts of data and draw meaningful conclusions. It is used in a variety of fields such as business, healthcare, social sciences, and more to inform decision-making and solve problems.

What are the main types of statistics?

The main types of statistics are descriptive statistics, which involves summarizing and describing data, and inferential statistics, which involves making predictions and generalizations about a larger population based on a sample of data.

What are some common statistical measures?

Some common statistical measures include mean, median, mode, range, standard deviation, and correlation coefficient. These measures help to describe and understand the data by providing information about central tendency, variability, and relationships between variables.

How is statistics used in research?

Statistics is used in research to analyze data collected from experiments or surveys and draw conclusions. It helps researchers to identify patterns and relationships between variables, test hypotheses, and make predictions. It also allows for the replication and verification of results.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
778
  • Set Theory, Logic, Probability, Statistics
Replies
21
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
13
Views
981
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
645
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
885
Back
Top