How cosmologists determine universe flatness

Buzz Bloom · Dec 19, 2015

What criteria are, or reasonably might be, used by cosmologists to decide whether or not the assumption that curvature equals zero produces a better cosmological model than one with a non-zero curvature?

I read (as best as I could) Section 6.2.4. Curvature, pp 37-39 of
http://planck.caltech.edu/pub/2015results/Planck_2015_Results_XIII_Cosmological_Parameters.pdf .
The following is a summary of what I interpreted Section 6.2.4 to be saying.

Several cosmological models were discussed, all of which seems to have the following parameters : H₀, Ω_m, Ω_Λ, and Ω_k. For some of the datasets there may also have been some additional model parameters that I did not understand. A variety of combined data sets were used to create the various models. I did not understand the labels used to describe the combinations of datasets :

1. Planck TT+lowP posterior (Figure 25)
2. Planck TT,TE,EE+lowP (Figure 26)
3. Planck TT,TE,EE+lowP+lensing (Figure 26)
4. Planck TT,TE,EE+lowP+lensing+BAO (Figure 26)

The text then gives the following respective values for Ω_k, each with a 2 sigma, 95% confidence level error range.

1. (Equation 47) -0.053 +0.049 -0.055
2. (Equation 48) -0.040 +0.038 -0.041
3. (Equation 49) +0.005 +0.016 -0.017
4. (Equation 50) +0.000 +/-0.005

The text adds, “We adopt Eq. (50) as our most reliable constraint on spatial curvature. Our universe appears to be spatially flat to an accuracy of 0.5%”

As best as I can tell, the text seems to be saying that the choice of Equation 50 is based on the fact that the error range is the smallest. I would appreciate it if someone can authoritatively say whether or not this is the case. If this is the case, I have some concerns about the protocol.

In my work for several years before I retired, I was involved in the development of data mining software, and became aware of the phenomenon of over-fitting a model to a dataset. This means continuing to “improve” a model's figure of merit past the optimum point of the model's ability to make good predictions on data that was not used to build the model. The concept is that although the figure of merit improved, what was happening was that the specifics of the training data set, including any statistical anomalies or outliers, were being modeled rather than the the general characteristics of the population from which the dataset was a sample.

A protocol commonly used to avoid over-fitting is to divide the dataset into two subsets. One subset is used to build the model, and the second is used to determine how good a predictor that model is. I am unable to tell whether such a protocol was used in developing the models described in the article.

Chalnoth · Dec 19, 2015

That was selected because that choice includes the most data. To get an accurate measurement of the spatial flatness, it's best to combine data over a very large range of distances. So data which combines the extremely far-away CMB with relatively nearby data is the best. The "TT+TE+EE" data is the far-away data set, while lowP, lensing ,and BAO are all different methods of measuring the distribution of structure in the (relatively) nearby universe.

The problem with overfitting is an issue that arises if you have too many parameters, which can become very easy to do if you start adding parameters. What they're doing here is the opposite: they're adding additional data to constrain the same number of parameters. When your model is a good one, you expect the addition of new data to reduce the errors.

Buzz Bloom · Dec 23, 2015

Hi @Chalnoth:

Thank you for your explanation. I understand that most issues regarding overfitting involve a large number parameters and an iterative process for adjusting parameter values, but overfitting can also occur with a relatively small number of parameters if the process of building a series of models involves adding parameters.

The earlier models were built to fit observations of brightness vs. red-shift. The later models involve fitting other observation that I confess I don't understand. How certain are you that when fitting a model to this different data for CMB observations and lensing observations that additional parameters were not needed so apples could be fitted along with oranges?

Regards,
Buzz

Chalnoth · Dec 23, 2015

Buzz Bloom said:

Hi @Chalnoth:

Thank you for your explanation. I understand that most issues regarding overfitting involve a large number parameters and an iterative process for adjusting parameter values, but overfitting can also occur with a relatively small number of parameters if the process of building a series of models involves adding parameters.

The earlier models were built to fit observations of brightness vs. red-shift. The later models involve fitting other observation that I confess I don't understand. How certain are you that when fitting a model to this different data for CMB observations and lensing observations that additional parameters were not needed so apples could be fitted along with oranges?

Regards,
Buzz

Pretty confident. One of the nice things about working with large-scale astrophysics observations is that there are generally independent ways of checking the data. For example, with the CMB, if you get the calibration of the detectors off, you'll tend to have huge obvious stripes in the data that run along the path that the telescope traces as it scans the sky. If you get the average value of the calibration off, then some methods to extract the foregrounds from the CMB will utterly fail. Then you can check the result against the results of other experiments.

Data mining is usually dealing with far, far more complicated systems (human systems), where simple models to describe the system can never be accurate. It becomes much harder to separate assumption from measurement, as changes in some assumptions can change the data pretty dramatically.

With large-scale astrophysics data, for the most part assumptions only change the data by about one standard deviation or so, which isn't enough to change the overall meaning (since we expect one-standard-deviation differences anyway).

This breaks down when we start looking at smaller-scale data, such as galaxies and galaxy clusters. The physics there gets complicated enough that there are pretty big uncertainties due to assumptions made (typically approximations performed to make the calculations tractable). But at large scales, all those niggling details average out and we get a very clean result that is highly independent of these kind of assumptions.

Buzz Bloom · Dec 23, 2015

Chalnoth said:

Pretty confident

Hi @Chalnoth:

Thank you very much for your thoughtful and informative answers to my questions.

Regards,
Buzz

How cosmologists determine universe flatness

Related to How cosmologists determine universe flatness

1. How do cosmologists determine if the universe is flat?

2. What evidence supports the idea of a flat universe?

3. What does it mean for the universe to be flat?

4. How does the concept of dark energy affect the determination of universe flatness?

5. Are there any competing theories to the idea of a flat universe?

Similar threads

Hot Threads

Recent Insights