Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Google Exam Associate Data Practitioner Topic 1 Question 6 Discussion

Actual exam question for Google's Associate Data Practitioner exam
Question #: 6
Topic #: 1
[All Associate Data Practitioner Questions]

Your team is building several data pipelines that contain a collection of complex tasks and dependencies that you want to execute on a schedule, in a specific order. The tasks and dependencies consist of files in Cloud Storage, Apache Spark jobs, and data in BigQuery. You need to design a system that can schedule and automate these data processing tasks using a fully managed approach. What should you do?

Show Suggested Answer Hide Answer
Suggested Answer: C

Using Cloud Composer to create Directed Acyclic Graphs (DAGs) is the best solution because it is a fully managed, scalable workflow orchestration service based on Apache Airflow. Cloud Composer allows you to define complex task dependencies and schedules while integrating seamlessly with Google Cloud services such as Cloud Storage, BigQuery, and Dataproc for Apache Spark jobs. This approach minimizes operational overhead, supports scheduling and automation, and provides an efficient and fully managed way to orchestrate your data pipelines.


Contribute your Thoughts:

Trevor
13 days ago
If I had a dollar for every time I heard 'directed acyclic graph' in a cloud exam, I'd be a millionaire by now. But seriously, C or D are the best options here.
upvoted 0 times
...
Tawanna
14 days ago
Haha, I bet the exam writer is trying to trick us. They're probably looking for the fully managed solution - Cloud Composer all the way!
upvoted 0 times
...
Willodean
18 days ago
Hmm, option B might be easier to set up, but I'm not sure it can handle the level of complexity in the question. I'd go with C or D.
upvoted 0 times
...
Reta
19 days ago
I think using Apache Airflow deployed on Google Kubernetes Engine is the best choice for this scenario.
upvoted 0 times
...
France
22 days ago
I prefer creating DAGs in Cloud Composer with the appropriate operators.
upvoted 0 times
...
Deeanna
22 days ago
I think option D is the way to go. Airflow on GKE gives you more flexibility and control over your pipeline orchestration.
upvoted 0 times
Margurite
8 days ago
Yeah, Airflow on GKE allows for more customization and scalability compared to the other options.
upvoted 0 times
...
Tandra
15 days ago
I agree, option D with Apache Airflow on GKE seems like the best choice for complex data pipelines.
upvoted 0 times
...
...
Almeta
23 days ago
I agree with Ammie, Cloud Scheduler seems like a good option.
upvoted 0 times
...
Ammie
1 months ago
I think we should use Cloud Scheduler to schedule the jobs.
upvoted 0 times
...
Lizbeth
1 months ago
Definitely going with option C. Cloud Composer is the perfect tool for managing complex data pipelines with dependencies.
upvoted 0 times
Pamella
17 days ago
I agree, Cloud Composer with DAGs is the way to go for scheduling and automating data processing tasks.
upvoted 0 times
...
Wayne
19 days ago
Option C sounds like the best choice. Cloud Composer is designed for managing complex data pipelines.
upvoted 0 times
...
...

Save Cancel