Free CompTIA DataX DY0-001 Actual Exam Questions - Question 7 Discussion
A data scientist is working with a data set that covers a two-year period for a large number of machines. The data set contains:
Machine system ID numbers
Sensor measurement values
Daily time stamps for each machine
The data scientist needs to plot the total measurements from all the machines over the entire time period. Which of the following is the best way to present this data?
B imo, because the data spans over time and we want to see trends or changes in totals. Scatter plots or histograms don’t show that progression as clearly. Box plots don’t capture daily totals well here.
A/B? Scatter plots (A) are usually better for showing individual data points or relationships, but since we want to see total measurements over time, a line plot (B) seems more straightforward. Histograms (C) don’t really fit here because they show frequency distributions, not trends over time. Box plots (D) summarize data distributions but won’t show how totals change day by day. So between A and B, B feels like the best choice to capture how measurements evolve across the time period.
Makes sense to go with B here. A line plot is best for showing how total measurements change day by day over the two years, which fits the time-series nature of this data. Scatter plots (A) are good for seeing relationships but not trends over time, and histograms (C) just show frequency distribution, so they miss the time factor. Box plots (D) summarize spread but won’t help visualize totals across the whole period either. So yeah, line plot (B) is the way to go.
Maybe D could work if the goal was to summarize the distribution of total measurements per day rather than showing trends over time. But since the question specifically asks for totals over the entire period, a line plot (B) probably makes more sense to track changes day by day. Scatter plots and histograms don’t capture the time series aspect well here.
It’s B because we want to see how the total measurements change across the two years, so a line plot makes the trend clear. Histograms don’t show time progression well for this kind of data.
It’s definitely not A or D because a scatter plot is better for showing relationships between two variables and box plots are for distributions, not trends over time. Between B and C, a histogram breaks data into bins but doesn’t show change over time, which is the key here. So B is the best choice since you want to see the total measurements day by day across two years.
Maybe B. Since the data covers daily timestamps over two years, a line plot would clearly show how total measurements change over time, which seems to be the main goal here.
B/C? A line plot (B) shows trends over time, but a histogram (C) could also work if you want to see the distribution of total measurements instead of the trend. Depends on focus: trend vs distribution.
Actually, I think option B makes the most sense here because you want to see how measurements change over time. A line plot is great for showing trends across a continuous time axis, especially with daily timestamps. Scatter plots (A) would be too cluttered with so many machines and dates. Histograms (C) are for distributions, not trends, and box plots (D) summarize distributions but don’t show changes over time. So, line plot (B) gives a clear overall view of total measurements through the two years.
Maybe B, since it’s a time series and line plots show trends well.