Deal of The Day! Hurry Up, Grab the Special Discount - Save 25% - Ends In 00:00:00 Coupon code: SAVE25
Welcome to Pass4Success

- Free Preparation Discussions

Databricks Exam Databricks-Certified-Professional-Data-Scientist Topic 4 Question 73 Discussion

Actual exam question for Databricks's Databricks Certified Professional Data Scientist Exam exam
Question #: 73
Topic #: 4
[All Databricks Certified Professional Data Scientist Exam Questions]

What is the best way to evaluate the quality of the model found by an unsupervised algorithm like k-means clustering, given metrics for the cost of the clustering (how well it fits the data) and its stability (how similar the clusters are across multiple runs over the same data)?

Show Suggested Answer Hide Answer
Suggested Answer: A

Contribute your Thoughts:

Stefanie
3 days ago
Option C all the way. I'd rather have a super stable model, even if it's not the absolute lowest cost. Stability is key in unsupervised learning!
upvoted 0 times
...
Kris
4 days ago
I believe the optimal approach is to set a stability threshold and select the model that achieves the lowest cost above that threshold. This way we balance cost and stability effectively.
upvoted 0 times
...
Sharen
5 days ago
Definitely option A. The stability of the clusters is just as important as the cost, so we need to consider both factors.
upvoted 0 times
...
Gearldine
6 days ago
I agree, but we also need to consider stability. Maybe the most stable clustering subject to a minimal cost constraint?
upvoted 0 times
...
Rhea
7 days ago
I think the best way to evaluate the quality of the model is to find the lowest cost clustering subject to a stability constraint. That seems like the most balanced approach to me.
upvoted 0 times
...
Leila
8 days ago
This is a tough one, but I'm going with the lowest cost clustering subject to a stability constraint. Ain't no point in having super stable clusters if they don't fit the data well, am I right?
upvoted 0 times
...
Hana
11 days ago
I think the best way is to choose the lowest cost clustering.
upvoted 0 times
...
Casie
11 days ago
Hold up, what about that tradeoff though? I reckon the best answer is the one that balances cost and stability, like the question says. Gotta find that sweet spot, you know?
upvoted 0 times
...
Bettina
20 days ago
Nah man, you gotta consider stability too. The most stable clustering is the way to go, even if the cost is a bit higher. You don't want your clusters changing all the time, that's just confusing.
upvoted 0 times
...
Sheldon
21 days ago
I think the best way is to go with the lowest cost clustering, because that's the whole point of k-means, right? Stability is overrated. Just give me the clusters that fit the data the best!
upvoted 0 times
Daron
5 days ago
There is a tradeoff between cost and stability in unsupervised learning. The more tightly you fit the data, the less stable the model will be, and vice versa. The idea is to find a good balance with more weight given to the cost. Typically a good approach is to set a stability threshold and select the model that achieves the lowest cost above the stability threshold.
upvoted 0 times
...
Deane
9 days ago
A) The lowest cost clustering
upvoted 0 times
...
...

Save Cancel