Fitting Highly Non-linear Data

In summary, the conversation is about an individual trying to find a best fit equation for radial velocity data of a planet. They are using a maximum likelihood method but are struggling to achieve a good fit. Other methods, such as mapping the parameter space and using Markov Chain Monte Carlo, are suggested as possible solutions. Additionally, suggestions for specific math models and software are given to help with the fitting process.
  • #1
NoobixCube
155
0
Hi all,
At the moment I am trying to find a best fit equation to radial velocity data vs. time of a planet HD17156b. The paper with the Authors fit parameters (I am trying to mimic the fit) has the arXiv ref number of :0704.1191v2
From this paper I extract their data, which is on the final few pages and take note of their fit parameters.
I use the relevant equations to make a continuous curve using there fit parameters. They say in the paper they achieve a fit with a normalised Chi squared value = 1.17
When I plot the data, and the continuous curve I note that the fit is definitely not close to 1. I believe this is because of the nature of the equations used are highly non-linear and the error in their fit parameters are throwing off the continuous curve.
When I try to find a best fit using the Newton-Raphson method in conjunction with the Levenberg-Marquardt Algorithm I use their best fit parameters as my initial starting point. Assuming surely this would find a good fit to the data. But I achieve a horrible fit. Anybody have any ideas that could help me achieve a better fit similar to the original Authors fit?
Below is a plot of the original Data and the Authors best fit parameters leading to the continuous curve on the plot.
 

Attachments

  • AuthorvData.JPG
    AuthorvData.JPG
    14.1 KB · Views: 379
Astronomy news on Phys.org
  • #2
It sounds like you are using a 'downhill' maximum likelyhood (aka 'best fit') method. In the case of a real parameter space, with real noisy data and, as you say, non-linear equations, it is quite possible that the parameter space (i.e. Chi squared as a function of the fit parameters) is not a smooth function with a single global minima. You may be stuck in a local minima, like rolling a marble down a rocky slope and the marble gets stuck in a hole before it reaches the bottom.

Maximum likelyhood fitting is IMHO a bad idea. It works sometimes but you never know in advance when it will and when it will not. A better approach is to map the parameter space and see how the solution behaves. The most general approach is a Markov Chain Monte Carlo random walk through the parameter space.

Rather than losing it in the re-telling, have a look at http://adsabs.harvard.edu/abs/2005ApJ...631.1198G" paper. It is a marvelous case study into the pitfalls of maximum likelyhood fitting of extrasolar planet data. It shows that the maximum likelyhood fit of Tinney et al. 2003 for the same data was actually the least likely of 3 possible orbits. I would suggest reading this paper, and others by the same author, and implementing the kind of fitting algorithm they use.
 
Last edited by a moderator:
  • #3
Thanks Wallace I will have a look! :approve:
 
  • #4
Hmm, actually I think the paper I described is a different one, though by the same author. I think it is Gregory 2005a that is mentioned in the second paragraph of the paper I linked to. It might be better to read that one first since the one I linked I think is an extension. In any case that paper should still be very useful, but the actual algorithm is probably described in more detail elsewhere. Play follow the citations and you should find it!
 
  • #5
Is there an arXiv's link to the first one. I found it on the Chicago Journals website but that requires a subscription.
Also could you list some texts that may be of some use too. Cheers
 
  • #6
If you search on ADS by a known title and author it will give you a link to the arxiv version of a paper.
 
  • #7
What math models have you or others tried on these data points?

A Lorentz function or damped sine series might do a good job. There is a curve-fit program named http://www.digitalCalculus.com/demo/curvfit.html" that has these and other math models to fit ones data.
 
Last edited by a moderator:

Related to Fitting Highly Non-linear Data

What is non-linear data?

Non-linear data is data that does not follow a linear pattern or relationship. This means that the relationship between the variables in the data cannot be represented by a straight line on a graph.

Why is it important to fit non-linear data?

Fitting non-linear data allows us to better understand the relationship between variables and make predictions based on the data. It can also help us identify patterns and trends that may not be apparent when looking at the raw data.

What techniques are used to fit highly non-linear data?

There are various techniques that can be used to fit highly non-linear data, including polynomial regression, non-linear least squares, and neural networks. The choice of technique depends on the specific data and the desired outcome.

What challenges are associated with fitting highly non-linear data?

Fitting highly non-linear data can be challenging because there may not be a clear relationship between the variables, and the data may have a lot of noise or outliers. It can also be difficult to determine the best model to fit the data and avoid overfitting.

How do you evaluate the accuracy of a non-linear model?

The accuracy of a non-linear model can be evaluated by looking at the residuals, which are the differences between the predicted values and the actual values. Other metrics, such as mean squared error and R-squared, can also be used to assess the performance of the model.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
11
Views
958
  • Calculus and Beyond Homework Help
Replies
6
Views
703
  • Astronomy and Astrophysics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
1K
  • STEM Educators and Teaching
Replies
5
Views
836
  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
1K
  • Programming and Computer Science
Replies
4
Views
808
  • General Math
Replies
6
Views
841
  • MATLAB, Maple, Mathematica, LaTeX
Replies
2
Views
1K
Back
Top