Goodness of Fit Test Without Uncertainties

In summary, you can fit a gaussian distribution to the data without any uncertainties, but you need to specify the y values for each bin with an uncertainty.
  • #1
nicholls
94
0
In what way can one do a goodness of fit test on a set of data that contains no uncertainties?

I am doing a simulated data analysis assignment for a physics lab course and they provided me with data for a supposed xray spectrum for some material. The data consists of energy bins with a count of photon detections for each bin (or something like that). There is no uncertainty on these values.

I can fit a couple gaussian distributions to it, but without uncertainties I am confused as to how to approach a goodness of fit test. I do not have much statistics knowledge so any help would be great!
 
Physics news on Phys.org
  • #2
It isn't clear what you mean by "uncertainties". It also isn't clear why you can fit a couple of gaussian distributions to the data. Did you use two different methods of fitting?

The use of "bins" in a statistics problem hints at using a test like Pearson's Chi-Squared test. However, since this is a physics class, it would be wise to consult your course materials to see what they have in mind.
 
  • #3
Basically, the data I am given is a histogram where the x values are energy bins of width 0.05 KeV, and the y values are the counts of photon detections per energy bin. There is no uncertainity given for the y values.

The data consists of two peaks, both gaussian in shape. Thus, I have managed to fit using python, two gaussian functions to the two peaks, along with a linear function describing the background noise.

I would now like to perform a goodness of fit test on this fit. However, without explicit y uncertainties I am not sure how to proceed. Someone told me, although they did not give me a good explanation, that for the y uncertainty I can use 1/sqrt(N) where N is the count or y value for each bin.

Does this make more sense?
 
  • #4
You still haven't managed to define what "uncertainty" is. It may be obvious to physicists what it means in this context, but this is the math section and it isn't obvious to me what it means in the context of statistical tests. Are you talking about "standard deviation"? "probability"? "precision of measurement"?

If you have fit a distribution (which in your case is a mixture of distributions) to the data then the distribution you fit can be used to do the Pearson Chi-squared test or any other test which relies on computing the probability that [itex] y_i [/itex] values fell in bin [itex] i [/itex].
 
  • #5
What I mean by uncertainty is that there are no error bars on the y values. I'm not exactly sure but I suppose you would just call them standard deviations. Thus with 0 uncertainty, the chi-squared value of the fit tends to infinite.
 
  • #6
Look at the article on Pearson's Chi-square test in the wikipedia: http://en.wikipedia.org/wiki/Pearson's_chi-squared_test

The test statistic is [itex] \chi^2 = \sum_{i=1}^n \frac { (O_i - E_i)^2} { E_i} [/itex]

There is nothing in the computation of that test statistic that involves error bars, so I don't undestand why it would tend to be infinite.
 
  • #7
One thing we need to clarify is physical model for how the data is generated.

Are we assuming that photon energy is a random variable and that each photon energy measurement is an independent realization of that random variable?

Or is the model some deterministic process?
 
  • #8
Pearson's chi squared seems to do the trick. Thanks for showing me that.

I'm not quite sure whether to say that the photon energy is a random variable although I am inclined to say yes.

You could have a quick read of the outline of the experiment if you want:

http://www.physics.utoronto.ca/~phy326/simdata/SDA%20Assignment.pdf"

I am doing experiment A which is on the second page.
 
Last edited by a moderator:

Related to Goodness of Fit Test Without Uncertainties

What is a "Goodness of Fit Test Without Uncertainties"?

A goodness of fit test without uncertainties is a statistical test used to determine if a sample of data fits a particular distribution or model. It is used to test the hypothesis that the observed data is drawn from a specific probability distribution.

Why is a "Goodness of Fit Test Without Uncertainties" important?

A goodness of fit test without uncertainties is important because it allows scientists to determine if their data supports their hypothesis or if it is just due to chance. It helps to ensure that the conclusions drawn from the data are reliable and can be used to make accurate predictions.

What are the assumptions of a "Goodness of Fit Test Without Uncertainties"?

The main assumptions of a goodness of fit test without uncertainties are that the data is independent, normally distributed, and the sample size is large enough. The test also assumes that the data is randomly selected from the population and that the categories being tested are mutually exclusive.

How is a "Goodness of Fit Test Without Uncertainties" performed?

To perform a goodness of fit test without uncertainties, the observed data is compared to the expected values from the hypothesized distribution. The test statistic, such as the chi-square statistic, is calculated and compared to a critical value from the appropriate distribution. If the test statistic is greater than the critical value, the null hypothesis of a good fit is rejected.

What are some limitations of a "Goodness of Fit Test Without Uncertainties"?

One limitation of a goodness of fit test without uncertainties is that it only tests for a specific distribution and cannot determine if the data fits any other distributions. Additionally, the test is sensitive to sample size and may give misleading results if the sample size is too small. Finally, the test assumes that the data is independent and normally distributed, which may not always be the case.

Similar threads

  • Set Theory, Logic, Probability, Statistics
Replies
30
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
16
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
2
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
28
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
1
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
4
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
5
Views
3K
  • Set Theory, Logic, Probability, Statistics
Replies
6
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
665
  • Set Theory, Logic, Probability, Statistics
Replies
30
Views
2K
Back
Top