If a dataset contains rows with individual people and columns for year of birth, country, and income, how many of the columns are continuous and how many are categorical?
- 1 continuous and 2 categorical
- 3 categorical
- 3 continuous
- 2 continuous and 1 categorical
Answer(s): D
Explanation:
The columns can be grouped into two types--categorical and continuous columns:
A column is called categorical if its value can only be one of the categories in a finite set. For example, the native country of a person (U.S., India, Japan, etc.) or the education level (high school, college, etc.) are categorical columns.
A column is called continuous if its value can be any numerical value in a continuous range. For example, the capital gain of a person (e.g. $14,084) is a continuous column.
Year of birth and income are continuous columns. Country is a categorical column.
You could use bucketization to turn year of birth and/or income into categorical features, but the raw columns are continuous.
Reference:
https://www.tensorflow.org/tutorials/wide#reading_the_census_data
Reveal Solution Next Question