Databricks Exam Databricks Certified Generative AI Engineer Associate Topic 5 Question 4 Discussion

Actual exam question for Databricks's Databricks Certified Generative AI Engineer Associate exam

Question #: 4
Topic #: 5

[All Databricks Certified Generative AI Engineer Associate Questions]

A Generative AI Engineer developed an LLM application using the provisioned throughput Foundation Model API. Now that the application is ready to be deployed, they realize their volume of requests are not sufficiently high enough to create their own provisioned throughput endpoint. They want to choose a strategy that ensures the best cost-effectiveness for their application.

What strategy should the Generative AI Engineer use?

ASwitch to using External Models instead

BDeploy the model using pay-per-token throughput as it comes with cost guarantees

CChange to a model with a fewer number of parameters in order to reduce hardware constraint issues

DThrottle the incoming batch of requests manually to avoid rate limiting issues

Show Suggested Answer

Suggested Answer: B

Problem Context: The engineer needs a cost-effective deployment strategy for an LLM application with relatively low request volume.

Explanation of Options:

Option A: Switching to external models may not provide the required control or integration necessary for specific application needs.

Option B: Using a pay-per-token model is cost-effective, especially for applications with variable or low request volumes, as it aligns costs directly with usage.

Option C: Changing to a model with fewer parameters could reduce costs, but might also impact the performance and capabilities of the application.

Option D: Manually throttling requests is a less efficient and potentially error-prone strategy for managing costs.

Option B is ideal, offering flexibility and cost control, aligning expenses directly with the application's usage patterns.

by Izetta at Oct 23, 2024, 02:14 AM

Limited Time Offer

25%

Off

Get Premium Databricks Certified Generative AI Engineer Associate Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Fernanda

7 months ago

Ah, the joys of scaling AI applications. I'd say option B is the way to go, but maybe they should also consider a backup plan just in case. You know, like a Plan B.

upvoted 0 times

...