Model Evaluation
28 articles
AUC (Area Under the ROC Curve)
Classification, Machine Learning
Accuracy
Classification, Machine Learning
Baseline
Machine Learning
Classification Threshold
Classification, Machine Learning
Confusion Matrix
Classification, Machine Learning
Cross-Validation
Machine Learning
Decision Threshold
Classification, Machine Learning
Fairness Metric
AI Fairness, Ethics, Machine Learning
False Negative (FN)
Classification, Machine Learning
False Negative Rate
Classification, Machine Learning, Statistics
False Positive (FP)
Classification, Machine Learning
False Positive Rate (FPR)
Classification, Machine Learning, Statistics
Feature Importances
Interpretability, Machine Learning
Generalization
Deep Learning, Machine Learning
Generalization Curve
Learning Theory, Machine Learning
Interpretability
AI Ethics, Machine Learning
Loss Curve
Deep Learning, Machine Learning, Training
Mean Absolute Error (MAE)
Machine Learning, Statistics
Mean Squared Error (MSE)
Machine Learning, Statistics
Model Capacity
Machine Learning
Overfitting
Deep Learning, Machine Learning
Precision
Classification, Machine Learning
Prediction Bias
Machine Learning
Process reward model (PRM)
AI Safety, Machine Learning, Reinforcement Learning
Recall (metric)
Classification, Machine Learning
Terminal-Bench
AI Agents, AI Benchmarks, AI Code Generation
Test Set
Machine Learning
Validation Set
Machine Learning