Home
Microsoft
DP-100: Designing and Implementing a Data Science Solution on Azure Dumps

Free Microsoft DP-100 Exam Dumps

Here you can find all the free questions related with Microsoft Designing and Implementing a Data Science Solution on Azure (DP-100) exam. You can also find on this page links to recently updated premium files with which you can practice for actual Microsoft Designing and Implementing a Data Science Solution on Azure Exam. These premium versions are provided as DP-100 exam practice tests, both as desktop software and browser based application, you can use whatever suits your style. Feel free to try the Designing and Implementing a Data Science Solution on Azure Exam premium files for free, Good luck with your Microsoft Designing and Implementing a Data Science Solution on Azure Exam.

Question No: 31

MultipleChoice

A set of CSV files contains sales records. All the CSV files have the same data schema.

Each CSV file contains the sales record for a particular month and has the filename sales.csv. Each file in stored in a folder that indicates the month and year when the data was recorded. The folders are in an Azure blob container for which a datastore has been defined in an Azure Machine Learning workspace. The folders are organized in a parent folder named sales to create the following hierarchical structure:

At the end of each month, a new folder with that month's sales file is added to the sales folder.

You plan to use the sales data to train a machine learning model based on the following requirements:

* You must define a dataset that loads all of the sales data to date into a structure that can be easily converted to a dataframe.

* You must be able to create experiments that use only data that was created before a specific previous month, ignoring any data that was added after that month.

* You must register the minimum number of datasets possible.

You need to register the sales data as a dataset in Azure Machine Learning service workspace.

What should you do?

Options

ACreate a tabular dataset that references the datastore and explicitly specifies each 'sales/mm-yyyy/
sales.csv' file every month. Register the dataset with the name sales_dataset each month, replacing the
existing dataset and specifying a tag named month indicating the month and year it was registered. Use
this dataset for all experiments.

BCreate a tabular dataset that references the datastore and specifies the path 'sales/*/sales.csv', register the dataset with the name sales_dataset and a tag named month indicating the month and year it was registered, and use this dataset for all experiments.

CCreate a new tabular dataset that references the datastore and explicitly specifies each 'sales/mm-yyyy/ sales.csv' file every month. Register the dataset with the name sales_dataset_MM-YYYY each month with appropriate MM and YYYY values for the month and year. Use the appropriate month-specific dataset for experiments.

DCreate a tabular dataset that references the datastore and explicitly specifies each 'sales/mm-yyyy/
sales.csv' file. Register the dataset with the name sales_dataset each month as a new version and with a tag named month indicating the month and year it was registered. Use this dataset for all experiments,
identifying the version to be used based on the month tag as necessary.

Answer
BExplanation

Specify the path.

Example:

The following code gets the workspace existing workspace and the desired datastore by name. And then passes the datastore and file locations to the path parameter to create a new TabularDataset, weather_ds.

from azureml.core import Workspace, Datastore, Dataset

datastore_name = 'your datastore name'

# get existing workspace

workspace = Workspace.from_config()

# retrieve an existing datastore in the workspace by name

datastore = Datastore.get(workspace, datastore_name)

# create a TabularDataset from 3 file paths in datastore

datastore_paths = [(datastore, 'weather/2018/11.csv'),

(datastore, 'weather/2018/12.csv'),

(datastore, 'weather/2019/*.csv')]

weather_ds = Dataset.Tabular.from_delimited_files(path=datastore_paths)

Question No: 32

Hotspot

You are a lead data scientist for a project that tracks the health and migration of birds. You create a multi-image classification deep learning model that uses a set of labeled bird photos collected by experts. You plan to use the model to develop a cross-platform mobile app that predicts the species of bird captured by app users.

You must test and deploy the trained model as a web service. The deployed model must meet the following requirements:

* An authenticated connection must not be required for testing.

* The deployed model must perform with low latency during inferencing.

* The REST endpoints must be scalable and should have a capacity to handle large number of requests when multiple end users are using the mobile application.

You need to verify that the web service returns predictions in the expected JSON format when a valid REST request is submitted.

Which compute resources should you use? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

Answer

Explanation

Box 1: ds-workstation notebook VM

An authenticated connection must not be required for testing.

On a Microsoft Azure virtual machine (VM), including a Data Science Virtual Machine (DSVM), you create local user accounts while provisioning the VM. Users then authenticate to the VM by using these credentials.

Box 2: gpu-compute cluster

Image classification is well suited for GPU compute clusters

https://docs.microsoft.com/en-us/azure/machine-learning/data-science-virtual-machine/dsvm-common-identity

https://docs.microsoft.com/en-us/azure/architecture/reference-architectures/ai/training-deep-learning

Question No: 33

DragDrop

You plan to explore demographic data for home ownership in various cities. The data is in a CSV file with the following format:

age,city,income,home_owner

21,Chicago,50000,0

35,Seattle,120000,1

23,Seattle,65000,0

45,Seattle,130000,1

18,Chicago,48000,0

You need to run an experiment in your Azure Machine Learning workspace to explore the data and log the results. The experiment must log the following information:

* the number of observations in the dataset

* a box plot of income by home_owner

* a dictionary containing the city names and the average income for each city

You need to use the appropriate logging methods of the experiment's run object to log the required information.

How should you complete the code? To answer, drag the appropriate code segments to the correct locations. Each code segment may be used once, more than once, or not at all. You may need to drag the split bar between panes or scroll to view content.

NOTE: Each correct selection is worth one point.

Answer

Explanation

Box 1: log

The number of observations in the dataset.

run.log(name, value, description='')

Scalar values: Log a numerical or string value to the run with the given name. Logging a metric to a run causes that metric to be stored in the run record in the experiment. You can log the same metric multiple times within a run, the result being considered a vector of that metric.

Example: run.log('accuracy', 0.95)

Box 2: log_image

A box plot of income by home_owner.

log_image Log an image to the run record. Use log_image to log a .PNG image file or a matplotlib plot to the run. These images will be visible and comparable in the run record.

Example: run.log_image('ROC', plot=plt)

Box 3: log_table

A dictionary containing the city names and the average income for each city.

log_table: Log a dictionary object to the run with the given name.

Question No: 34

MultipleChoice

You create an Azure Machine Learning workspace.

You must create a custom role named DataScientist that meets the following requirements:

* Role members must not be able to delete the workspace.

* Role members must not be able to create, update, or delete compute resource in the workspace.

* Role members must not be able to add new users to the workspace.

You need to create a JSON file for the DataScientist role in the Azure Machine Learning workspace.

The custom role must enforce the restrictions specified by the IT Operations team.

Which JSON code segment should you use?

Options

AOption A

BOption B

COption C

DOption D

Answer
AExplanation

The following custom role can do everything in the workspace except for the following actions:

* It can't create or update a compute resource.

* It can't delete a compute resource.

* It can't add, delete, or alter role assignments.

* It can't delete the workspace.

To create a custom role, first construct a role definition JSON file that specifies the permission and scope for the role. The following example defines a custom role named 'Data Scientist Custom' scoped at a specific workspace level:

data_scientist_custom_role.json :

{

'Name': 'Data Scientist Custom',

'IsCustom': true,

'Description': 'Can run experiment but can't create or delete compute.',

'Actions': ['*'],

'NotActions': [

'Microsoft.MachineLearningServices/workspaces/*/delete',

'Microsoft.MachineLearningServices/workspaces/write',

'Microsoft.MachineLearningServices/workspaces/computes/*/write',

'Microsoft.MachineLearningServices/workspaces/computes/*/delete',

'Microsoft.Authorization/*/write'

'AssignableScopes': [

'/subscriptions/<subscription_id>/resourceGroups/<resource_group_name>/providers/Microsoft.MachineLearningServices/workspaces/<workspace_name>'

]

}

https://docs.microsoft.com/en-us/azure/machine-learning/how-to-assign-roles

Question No: 35

DragDrop

You need to visually identify whether outliers exist in the Age column and quantify the outliers before the outliers are removed.

Which three Azure Machine Learning Studio modules should you use in sequence? To answer, move the appropriate modules from the list of modules to the answer area and arrange them in the correct order.

Create Scatterplot

Summarize Data

Clip Values

You can use the Clip Values module in Azure Machine Learning Studio, to identify and optionally replace data values that are above or below a specified threshold. This is useful when you want to remove outliers or replace them with a mean, a constant, or other substitute value.

References:

Answer

Question No: 36

MultipleChoice

Note: This question is part of a series of questions that present the same scenario. Each question in the series contains a unique solution that might meet the stated goals. Some question sets might have more than one correct solution, while others might not have a correct solution.

After you answer a question in this section, you will NOT be able to return to it. As a result, these questions will not appear in the review screen.

You are a data scientist using Azure Machine Learning Studio.

You need to normalize values to produce an output column into bins to predict a target column.

Solution: Apply a Equal Width with Custom Start and Stop binning mode.

Does the solution meet the goal?

Options

AYes

BNo

Answer
B

Question No: 37

OrderList

YOU have a data-set that contains over 150 features. You use the dataset to train a Support Vector Machine (SVM) binary classifirer.

You need to use the Permutation Feature Importance module in Azure Machine Learning Studio to compute a set of feature importance scores for the dataset.

In which order should you perform the actions? To answer move al actions from from the list of Actions to the answer area and arrange them in the correct order.

Answer

Set the Metric for measuring performance property to Classification - Accuracy and then run the experiment.Add a dataset to the experiment.Add a Two-Class Support Vector Machine module to initialize the SVM classifier.Add a Permutation Feature Importanance module and connect the trained model and test dataset.

Question No: 38

Hotspot

You are creating a machine learning model in Python. The provided dataset contains several numerical columns and one text column.

*Biker

*Cars

*Vans

*Boats

You are building a regression model using the scikit- learn Python package.

You need to transform the text data to be compatible with the scikit-learn Python package

How should you complete the code segment? To answer, select the appropriate options in the answer area.

NOTE: Each correct selection is worth one point.

Answer

Question No: 39

MultipleChoice

You plan to use .a Deep learning Virtual Machine (DLVM) to train deep learning models using Compute Unified Device Architecture (CUDA) computations.

You need to configure the IXVM to support CUOA

What should you implement?

Options

AIntel Software Guard Extensions (Intel SGX) technology

BSolid State Drives (SSD)

CGraphic Processing Unit (GPU)

DComputer Processing Unit (CPU) speed increase by using overcloking

EHigh Random Access Memory (RAM) configuration

Answer
B

Question No: 40

MultipleChoice

You are determining if two sets of data are significantly different from one another by using Azure Machine Learning Studio.

Estimated values in one set of data may be more than or less than reference values in the other set of data. You must produce a distribution that has a constant. Type I error as a function of the correlation.

You need to produce the distribution.

Which type of distribution should you produce?

Options

APaired t-test with a two-tail option

BUnpaired t-test with a two tail option

CPaired t-test with a one-tail option

DUnpaired t-test with a one-tail option

Answer
D

Limited Time Offer

25%

Off

Get Premium DP-100 Questions as Interactive Web-Based Practice Test or PDF