0 votes
5.3k views
in Computer Science by (13.8k points)

Which first step should a Data Analyst take to clean their data?

(a) Merge duplicate records

(b) Validate the data

(c) Remove all outliers

(d) Impute missing data

1 Answer

0 votes
by (210k points)
 
Best answer

(c) Remove all outliers

The first step in cleaning data is to carry out data profiling, which allows us to identify outlier values or identify problems in data collected. Once the field has been profiled, it is normalized, de-duplicated, and obsolete information is removed, among other things.

...