You are creating a new Azure Machine Learning pipeline using the designer.
The pipeline must train a model using data in a comma-separated values (CSV) file that is published on a
website. You have not created a dataset for this file.
You need to ingest the data from the CSV file into the designer pipeline using the minimal administrative effort.
Which module should you add to the pipeline in Designer?
The preferred way to provide data to a pipeline is a Dataset object. The Dataset object points to data that lives in or is accessible from a datastore or at a Web URL. The Dataset class is abstract, so you will create an instance of either a FileDataset (referring to one or more files) or a TabularDataset that's created by from one or more files with delimited columns of data.
Example:
from azureml.core import Dataset
iris_tabular_dataset = Dataset.Tabular.from_delimited_files([(def_blob_store, 'train-dataset/iris.csv')])
https://docs.microsoft.com/en-us/azure/machine-learning/how-to-create-your-first-pipeline
Abel
3 months agoErnie
3 months agoTrevor
3 months agoLucy
4 months agoDenae
4 months agoDevora
4 months agoFranchesca
4 months agoTruman
4 months agoVanna
5 months agoAdolph
5 months agoBrendan
5 months agoLaticia
5 months ago