Interface administrators, Administrators (Semantic MediaWiki), Curators (Semantic MediaWiki), Editors (Semantic MediaWiki), Suppressors, Administrators
7,785
edits
No edit summary |
No edit summary |
||
Line 6: | Line 6: | ||
For instance, in a dataset of housing prices, attributes might include the number of bedrooms, square footage of the home, its neighborhood location and age. These details are then utilized to train a machine learning model that predicts the price tag on an unseen new home based on those attributes. | For instance, in a dataset of housing prices, attributes might include the number of bedrooms, square footage of the home, its neighborhood location and age. These details are then utilized to train a machine learning model that predicts the price tag on an unseen new home based on those attributes. | ||
==Types of Attributes== | ==Types of Attributes== | ||
Line 22: | Line 13: | ||
Categorical attributes, on the other hand, take on a limited set of possible values. For instance, neighborhood is such an attribute. Values can either be nominal (meaning no inherent order exists) or ordinal (meaning there is one inherent order present). | Categorical attributes, on the other hand, take on a limited set of possible values. For instance, neighborhood is such an attribute. Values can either be nominal (meaning no inherent order exists) or ordinal (meaning there is one inherent order present). | ||
Additionally, attributes can be binary in nature - taking one of two possible values such as true/false or yes/no. | |||
==Importance of Attributes== | ==Importance of Attributes== | ||
Line 27: | Line 20: | ||
Selecting the correct attributes is essential in building an accurate predictive model. In some cases, certain attributes may be more pertinent than others and it's essential to comprehend their relationship with your target variable in order to select those that are most pertinent. | Selecting the correct attributes is essential in building an accurate predictive model. In some cases, certain attributes may be more pertinent than others and it's essential to comprehend their relationship with your target variable in order to select those that are most pertinent. | ||
==Preprocessing Attributes== | ==Preprocessing Attributes== | ||
Before attributes can be utilized in a machine learning model, they typically need to go through preprocessing. This usually involves altering the data in various ways so that it is in an accessible format for the model to use. | Before attributes can be utilized in a machine learning model, they typically need to go through preprocessing. This usually involves altering the data in various ways so that it is in an accessible format for the model to use. | ||
Categorical attributes must be encoded as numerical values, such as by converting the neighborhood attribute into a set of binary variables (known as one-hot encoding). Numerical attributes may need to be scaled so they have similar dimensions since some machine learning algorithms require similar input data. | Categorical attributes must be encoded as numerical values, such as by converting the neighborhood attribute into a set of binary variables (known as one-hot encoding). Numerical attributes may need to be normalized or scaled to a specific range so they have similar dimensions since some machine learning algorithms require similar input data. | ||
==Explain Like I'm 5 (ELI5)== | ==Explain Like I'm 5 (ELI5)== |