Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Amazon Exam MLS-C01 Topic 4 Question 101 Discussion

Actual exam question for Amazon's MLS-C01 exam
Question #: 101
Topic #: 4
[All MLS-C01 Questions]

A data scientist is building a forecasting model for a retail company by using the most recent 5 years of sales records that are stored in a data warehouse. The dataset contains sales records for each of the company's stores across five commercial regions The data scientist creates a working dataset with StorelD. Region. Date, and Sales Amount as columns. The data scientist wants to analyze yearly average sales for each region. The scientist also wants to compare how each region performed compared to average sales across all commercial regions.

Which visualization will help the data scientist better understand the data trend?

Show Suggested Answer Hide Answer
Suggested Answer: D

The best visualization for this task is to create a bar plot, faceted by year, of average sales for each region and add a horizontal line in each facet to represent average sales. This way, the data scientist can easily compare the yearly average sales for each region with the overall average sales and see the trends over time. The bar plot also allows the data scientist to see the relative performance of each region within each year and across years. The other options are less effective because they either do not show the yearly trends, do not show the overall average sales, or do not group the data by region.

References:

pandas.DataFrame.groupby --- pandas 2.1.4 documentation

pandas.DataFrame.plot.bar --- pandas 2.1.4 documentation

Matplotlib - Bar Plot - Online Tutorials Library


Contribute your Thoughts:

Xuan
10 days ago
Personally, I'd go with option D. It's clean, it's clear, and it gives the data scientist exactly what they need to understand the regional sales trends. Can't go wrong with that.
upvoted 0 times
...
Devora
17 days ago
Haha, option A with the extra bar in each facet to represent the average? That's like a visual dad joke. I mean, it might work, but it seems a little gimmicky.
upvoted 0 times
Dean
8 days ago
User 1: I think option A is a bit gimmicky with the extra bar for average sales.
upvoted 0 times
...
...
Tenesha
25 days ago
I agree with Joseph, option D seems like the most straightforward way to get the insights the data scientist is looking for. Plus, the horizontal line to represent the overall average is a nice touch.
upvoted 0 times
Luis
6 days ago
Definitely, the horizontal line will make it easier to see how each region performed relative to the average.
upvoted 0 times
...
Berry
7 days ago
I agree, having the average sales for each region compared to the overall average will provide a clear comparison.
upvoted 0 times
...
Cassi
8 days ago
I think option D is the best choice for visualizing the data trend.
upvoted 0 times
...
...
Marvel
30 days ago
I think option D is the most suitable as it shows average sales for each region faceted by year, providing a clear comparison.
upvoted 0 times
...
Phuong
1 months ago
Hmm, I'm not sure. Option B looks like it could be useful too, with the bar plot colored by region and faceted by year. That might help the data scientist see how each region is trending over time compared to the others.
upvoted 0 times
Amira
3 days ago
True, Option B could provide a clear comparison of how each region is performing compared to the others.
upvoted 0 times
...
Stefany
5 days ago
But Option B also seems interesting, with the bar plot colored by region to compare performance across regions.
upvoted 0 times
...
Lauran
6 days ago
I agree, Option A seems like a good choice to analyze yearly average sales for each store.
upvoted 0 times
...
Sarina
14 days ago
I think Option A could work well, with the bar plot faceted by year for each store.
upvoted 0 times
...
...
Joseph
1 months ago
I think option D is the way to go. It'll give a clear visualization of the yearly average sales for each region, with the added horizontal line to show the overall average. This should make it easy to spot any regions that are performing above or below the average.
upvoted 0 times
Dorthy
19 days ago
Yeah, I think having the yearly average sales for each region with the overall average line will provide a good comparison.
upvoted 0 times
...
Clorinda
30 days ago
I agree, option D seems like the best choice for this analysis.
upvoted 0 times
...
...
Barney
1 months ago
I prefer option C because it focuses on average sales for each region, which is more relevant for comparison.
upvoted 0 times
...
Truman
1 months ago
I disagree, I believe option B is better as it shows average sales for each store colored by region.
upvoted 0 times
...
Margery
1 months ago
I think option A is the best choice because it shows average sales for each store faceted by year.
upvoted 0 times
...

Save Cancel