Statistical significance of a ML model...

In summary, determining if a ML model is statistically significant involves using tests such as t-tests or F-tests for models like linear regression or logistic regression. However, for other models like decision trees, SVM, or neural nets, there is a subfield called uncertainty quantification that is actively developing methods to determine statistical significance. It is important to set aside a part of the input data for testing purposes to avoid bias in the results.
  • #1
fog37
1,568
108
TL;DR Summary
Determining if a ML model is statistically significant...
Hello,

How do we check if a ML model is statistically significant? For models like linear regression, logistic regression, etc. there are tests (t-tests, F-tests, etc.) that will tell us if the model, trained on some dataset, is statistically significant or not.

But in the case of ML models, like decision trees, SVM, or neural nets, how do we determine if the model is statistically significant? I have not seen any specific test to do that...

Thank you!
 
Technology news on Phys.org
  • #2
There is a whole subfield on this called UQ - uncertainty quantification. It is an area or active development.
 
  • #3
fog37 said:
TL;DR Summary: Determining if a ML model is statistically significant...

But in the case of ML models, like decision trees, SVM, or neural nets, how do we determine if the model is statistically significant? I have not seen any specific test to do that...
The t test will work with any predictive model. You're supposed to set aside a part of the input data, and not use it in your model and use it for testing later. (Because predicting your input data with a ML model is cheating). For a yes/no model, you can score a 1 for correct, and 0 for wrong, and you can compare it other ways to predict the outcomes (or random guessing),
 

Similar threads

  • Programming and Computer Science
Replies
4
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
14
Views
309
  • Programming and Computer Science
Replies
28
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
3
Views
914
  • Programming and Computer Science
Replies
7
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
30
Views
2K
  • Programming and Computer Science
Replies
22
Views
951
  • Set Theory, Logic, Probability, Statistics
Replies
23
Views
2K
  • Set Theory, Logic, Probability, Statistics
Replies
8
Views
1K
  • Set Theory, Logic, Probability, Statistics
Replies
7
Views
544
Back
Top