Google Exam Professional Data Engineer Topic 3 Question 85 Discussion

Actual exam question for Google's Professional Data Engineer exam

Question #: 85
Topic #: 3

[All Professional Data Engineer Questions]

You operate a database that stores stock trades and an application that retrieves average stock price for a given company over an adjustable window of time. The data is stored in Cloud Bigtable where the datetime of the stock trade is the beginning of the row key. Your application has thousands of concurrent users, and you notice that performance is starting to degrade as more stocks are added. What should you do to improve the performance of your application?

AChange the row key syntax in your Cloud Bigtable table to begin with the stock symbol.

BChange the row key syntax in your Cloud Bigtable table to begin with a random number per second.

CChange the data pipeline to use BigQuery for storing stock trades, and update your application.

DUse Cloud Dataflow to write summary of each day's stock trades to an Avro file on Cloud Storage. Update your application to read from Cloud Storage and Cloud Bigtable to compute the responses.

Show Suggested Answer

Suggested Answer: A

Data Fusion's advantages:

Visual interface: Offers a user-friendly interface for designing data pipelines without extensive coding, making it accessible to a wider range of users.

Built-in transformations: Includes a wide range of pre-built transformations to handle common data quality issues, such as:

Data type conversions

Data cleansing (e.g., removing invalid characters, correcting formatting)

Data validation (e.g., checking for missing values, enforcing constraints)

Data enrichment (e.g., adding derived fields, joining with other datasets)

Custom transformations: Allows for custom transformations using SQL or Java code for more complex cleaning tasks.

Scalability: Can handle large datasets efficiently, making it suitable for processing CSV files with potential data quality issues.

Integration with BigQuery: Integrates seamlessly with BigQuery, allowing for direct loading of transformed data.

by Yolando at Jun 02, 2024, 11:48 AM

Limited Time Offer

25%

Off

Get Premium Professional Data Engineer Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

2 months ago

D) Combining Cloud Dataflow, Cloud Storage, and Cloud Bigtable sounds like a lot of moving parts. Might be overkill for this use case.

upvoted 0 times

Clarence

12 days ago

A) Change the row key syntax in your Cloud Bigtable table to begin with the stock symbol.

upvoted 0 times

...

Stephanie

14 days ago

D) Combining Cloud Dataflow, Cloud Storage, and Cloud Bigtable sounds like a lot of moving parts. Might be overkill for this use case.

upvoted 0 times

...

Delpha

15 days ago

A) Change the row key syntax in your Cloud Bigtable table to begin with the stock symbol.

upvoted 0 times

...

3 months ago

B) A random number per second? That's just crazy! How are we supposed to query that efficiently? Definitely not the way to go.

upvoted 0 times

Emeline

1 months ago

User 2: That sounds like a good idea, it would definitely help with querying efficiently.

upvoted 0 times

...

Regenia

2 months ago

User 1: I think we should change the row key syntax to begin with the stock symbol.

upvoted 0 times

...

Ammie

3 months ago

A) Changing the row key syntax to start with the stock symbol sounds like a good idea. That way, we can group all the trades for a single stock together and improve query performance.

upvoted 0 times

...

Jaime

3 months ago

I disagree, I believe option D) using Cloud Dataflow to write summary of each day's stock trades to an Avro file is the best approach.

upvoted 0 times

...

Tiera

3 months ago

I think we should go with option A) Change the row key syntax to begin with the stock symbol.

upvoted 0 times

...