written 2.6 years ago by |
Type of Attributes
• This is the First step of Data Data-preprocessing. We differentiate between different types of attributes and then preprocess the data.
• So here is description of attribute types.
Qualitative (Nominal (N), Ordinal (O), Binary(B)).
Quantitative (Discrete, Continuous)
Nominal Attributes - related to names
• The values of a Nominal attribute are name of things, some kind of symbols.
• Values of Nominal attributes represents some category or state and that's why nominal attribute also referred as categorical attributes and there is no order (rank, position) among values of nominal attribute.
Attribute | Values |
---|---|
Colours | Black, Brown, White |
Categorical Data | Lecturer, Professor, Assistant Professor |
Binary Attributes
Binary data has only 2 values/states. For Example yes or no, affected or unaffected, true or false.
i. Symmetric : Both values are equally important (Gender).
ii. Asymmetric : Both values are not equally important (Result).
Attribute | Values |
---|---|
Cancer Detected | Yes, No |
Result | Pass, Fail |
Gender | Male, Female |
Ordinal Attributes
• The Ordinal Attributes contains values that have a meaningful sequence or ranking(order) between them, but the magnitude between values is not actually known, the order of values that shows what is important but don't indicate how important it is
Attribute | Values |
---|---|
Grade | A,B,C,D,E,F |
Basic Pay Scale | 16,17,18 |
Quantitative Attributes
Numeric : A numeric attribute is quantitative because, it is a measurable quantity, represented in integer or real values.
• Numerical attributes are of 2 types, interval and ratio.
i. An interval-scaled attribute has values, whose differences are interpretable, but the numerical attributes do not have the correct reference point or we can call zero point.
• Data can be added and subtracted at interval scale but can not be multiplied or divided.
Consider a example of temperature in degrees Centigrade.
• If a days temperature of one day is twice than the other da we cannot say that one day is twice as hot as another day.
ii. A ratio-scaled attribute is a numeric attribute with an fix zero-point.
If a measurement is ratio-scaled, we can say of a value as being a multiple (or ratio) of another value.
• The values are ordered, and we can also compute the difference between values, and the mean, median, mode, Quantile-range and Five number summary can be given.
Discrete : Discrete data have finite values it can be numerical and can also be in categorical form.
• These attributes has finite or countable infinite set of values
Attribute | Values |
---|---|
Profession | Teacher, Business man, Peon |
ZIP Code | 301701, 110040 |
Continuous :
Continuous data have infinite no of states. Continuous data is of float type. There can be many values between 2 and 3.
Attribute | Values |
---|---|
Height | 5.4, 6.2 ... etc |
Weight | 50, 33, ... etc |