Amazon MLS-C01 Exam - Topic 2 Question 99 Discussion

Actual exam question for Amazon's MLS-C01 exam

Question #: 99
Topic #: 2

A Machine Learning Specialist is deciding between building a naive Bayesian model or a full Bayesian network for a classification problem. The Specialist computes the Pearson correlation coefficients between each feature and finds that their absolute values range between 0.1 to 0.95.

Which model describes the underlying data in this situation?

AA naive Bayesian model, since the features are all conditionally independent.

BA full Bayesian network, since the features are all conditionally independent.

CA naive Bayesian model, since some of the features are statistically dependent.

DA full Bayesian network, since some of the features are statistically dependent.

Show Suggested Answer

Suggested Answer: A

The AnalyzeDocument API action is the best option to generate a confidence score for each page of each contract. This API action analyzes an input document for relationships between detected items. The input document can be an image file in JPEG or PNG format, or a PDF file. The output is a JSON structure that contains the extracted data from the document. The FeatureTypes parameter specifies the types of analysis to perform on the document. The available feature types are TABLES, FORMS, and SIGNATURES. By setting the FeatureTypes parameter to SIGNATURES, the API action will detect and extract information about signatures from the document. The output will include a list of SignatureDetection objects, each containing information about a detected signature, such as its location and confidence score. The confidence score is a value between 0 and 100 that indicates the probability that the detected signature is correct. The output will also include a list of Block objects, each representing a document page. Each Block object will have a Page attribute that contains the page number and a Confidence attribute that contains the confidence score for the page. The confidence score for the page is the average of the confidence scores of the blocks that are detected on the page. The law firm can use the AnalyzeDocument API action to generate a confidence score for each page of each contract by using the SIGNATURES feature type and returning the confidence scores from the SignatureDetection and Block objects.

The other options are not suitable for generating a confidence score for each page of each contract. The Prediction API call is not an Amazon Textract API action, but a generic term for making inference requests to a machine learning model. The StartDocumentAnalysis API action is used to start an asynchronous job to analyze a document. The output is a job identifier (JobId) that is used to get the results of the analysis with the GetDocumentAnalysis API action. The GetDocumentAnalysis API action is used to get the results of a document analysis started by the StartDocumentAnalysis API action. The output is a JSON structure that contains the extracted data from the document. However, both the StartDocumentAnalysis and the GetDocumentAnalysis API actions do not support the SIGNATURES feature type, and therefore cannot detect signatures or provide confidence scores for them.

References:

* AnalyzeDocument

* SignatureDetection

* Block

* Amazon Textract launches the ability to detect signatures on any document

by Slyvia at Aug 07, 2024, 05:37 AM

Limited Time Offer

25%

Off

Get Premium MLS-C01 Questions as Interactive Web-Based Practice Test or PDF

Contribute your Thoughts:

Submit Cancel

Kate

3 months ago

Full Bayesian networks are more flexible with dependencies, so D makes sense.

upvoted 0 times

...

Cruz

3 months ago

I thought naive Bayes was the go-to for this kind of data!

upvoted 0 times

...

Daren

4 months ago

Wait, aren't naive Bayes models for independent features?

upvoted 0 times

...

Evangelina

4 months ago

Definitely leaning towards option D here.

upvoted 0 times

...

Mollie

4 months ago

The absolute values of correlation coefficients suggest some dependencies.

upvoted 0 times

...

Daniel

4 months ago

I recall that a naive Bayesian model is typically used when features are independent, but with some dependencies here, I might lean towards the full Bayesian network.

upvoted 0 times

...

Pamella

5 months ago

I’m a bit confused. If the features have high correlation, does that mean they can’t be independent? I feel like I need to review this more.

upvoted 0 times

...

Kimberely

5 months ago

I think I practiced a similar question where the dependencies mattered, so maybe a full Bayesian network is the way to go since some features are dependent.

upvoted 0 times

...

Pura

5 months ago

I remember that naive Bayes assumes features are independent, but with those correlation values, I'm not sure if that's the case here.

upvoted 0 times

...

Shayne

5 months ago

I'm a bit confused on this one. The wording about "conditionally independent" features is throwing me off. I'll need to revisit the differences between naive Bayesian and full Bayesian network models to make the right call.

upvoted 0 times

...

Louann

5 months ago

Okay, I've got this. Since the features are not completely independent, a full Bayesian network is the better choice here. The conditional independence assumption for the naive Bayesian model is not met.

upvoted 0 times

...

Cassi

5 months ago

Hmm, the question mentions that the features are "conditionally independent", which makes me think a naive Bayesian model might be the way to go. But the range of correlation coefficients is concerning. I'll need to review the assumptions for each model to decide.

upvoted 0 times

...

Karol

5 months ago

This is a tricky one. The Pearson correlation coefficients range from 0.1 to 0.95, which suggests that the features are not completely independent. I'm leaning towards a full Bayesian network, but I'll need to think it through carefully.

upvoted 0 times

...

Tora

5 months ago

This question is testing our understanding of the Inventory Management feature. I'll need to think through the options logically.

upvoted 0 times

...

Domonique

5 months ago

I'm leaning towards the Varying Attribute Dimension as well. That seems like the best way to capture the month-to-month changes in customer status. The other options don't seem as well-suited for this use case.

upvoted 0 times

...

Ilona

10 months ago

A Bayesian network, huh? Sounds like a great way to get caught in a web of probability distributions. Maybe we should just flip a coin and call it a day.

upvoted 0 times

Brittni

9 months ago

It's important to choose the model that best fits the underlying data to ensure accurate predictions.

upvoted 0 times

...

Mabel

9 months ago

Let's consider the complexity of the relationships in the data before making a decision.

upvoted 0 times

...

Lorenza

9 months ago

Full Bayesian network would be more suitable for complex relationships with higher correlation coefficients.

upvoted 0 times

...

Lourdes

9 months ago

Naive Bayesian model would be simpler to implement with lower correlation coefficients.

upvoted 0 times

...

Daniela

10 months ago

Let's not get too Bayes-ic here. I'd go with the naïve approach, unless you want to get tangled up in all those conditional probabilities.

upvoted 0 times

Paulina

9 months ago

D) A full Bayesian network, since some of the features are statistically dependent.

upvoted 0 times

...

Lenna

9 months ago

I agree, keeping it simple with the naive approach makes sense.

upvoted 0 times

...

Delisa

9 months ago

A) A naive Bayesian model, since the features are all conditionally independent.

upvoted 0 times

...

Lera

10 months ago

Hmm, I'm not sure. If the features are all over the place, from 0.1 to 0.95 correlation, that sounds like a job for a full Bayesian network to me.

upvoted 0 times

Claudia

9 months ago

B) A full Bayesian network, since some of the features are statistically dependent.

upvoted 0 times

...

Jeannine

10 months ago

A) A naive Bayesian model, since the features are all conditionally independent.

upvoted 0 times

...

Magda

11 months ago

Hold up, if the features are conditionally independent, wouldn't a full Bayesian network be overkill? Just keep it simple with a naive Bayes.

upvoted 0 times

Reed

9 months ago

C) A naive Bayesian model, since some of the features are statistically dependent.

upvoted 0 times

...

Ryan

9 months ago

Hold up, if the features are conditionally independent, wouldn't a full Bayesian network be overkill? Just keep it simple with a naive Bayes.

upvoted 0 times

...

Krissy

10 months ago

A) A naive Bayesian model, since the features are all conditionally independent.

upvoted 0 times

...

Cecily

11 months ago

A naive Bayesian model would be the better choice here since the features have varying degrees of correlation, indicating some statistical dependence.

upvoted 0 times

...