Question 11

Domain 2: Explore data and run experiments

You are analyzing a numerical dataset that contains missing values in several columns. You need to clean the missing values using an appropriate operation without changing the dimensionality of the feature set, and you want to preserve the full dataset as much as possible. A proposed solution is to remove the entire column that contains the missing data point. Does this solution meet the goal?

A. No; removing an entire column changes the feature set dimensionality and discards potentially useful data instead of preserving the full dataset. B. Yes; dropping the column is always the best way to handle missing values because it keeps the remaining data unchanged.

Previous Next

Question 11

Explanation

Why each option is right or wrong