Data-centric AI (DCAI): Difference between revisions

no edit summary
No edit summary
No edit summary
Line 35: Line 35:


==Reasons for Data-centric AI==
==Reasons for Data-centric AI==
*Data quality issues are costing the U.S. alone an estimated $3 Trillion annually.
*Data quality issues alone are estimated to cost the United States $3 Trillion annually.
*Automated methods and systematic engineering principles are now needed to ensure ML models are trained with clean data.
*Automated methods, systematic engineering principles and automated methods are required to ensure that ML models are trained using clean data.
*Recent research on image classification with noisily labeled data revealed simple methods which adaptively change the dataset can lead to more accurate models than sophisticated modeling strategies.
*Recent research has shown that simple methods that adapt to changing data can produce more accurate models than complex modeling strategies.