An online delivery company wants to choose the fastest courier for each delivery at the moment an order is placed. The company wants to implement this feature for existing users and new users of its application. Data scientists have trained separate models with XGBoost for this purpose, and the models are stored in Amazon S3. There is one model fof each city where the company operates.
The engineers are hosting these models in Amazon EC2 for responding to the web client requests, with one instance for each model, but the instances have only a 5% utilization in CPU and memory, ....operation engineers want to avoid managing unnecessary resources.
Which solution will enable the company to achieve its goal with the LEAST operational overhead?
The best solution for this scenario is to use a multi-model endpoint in Amazon SageMaker, which allows hosting multiple models on the same endpoint and invoking them dynamically at runtime. This way, the company can reduce the operational overhead of managing multiple EC2 instances and model servers, and leverage the scalability, security, and performance of SageMaker hosting services. By using a multi-model endpoint, the company can also save on hosting costs by improving endpoint utilization and paying only for the models that are loaded in memory and the API calls that are made. To use a multi-model endpoint, the company needs to prepare a Docker container based on the open-source multi-model server, which is a framework-agnostic library that supports loading and serving multiple models from Amazon S3. The company can then create a multi-model endpoint in SageMaker, pointing to the S3 bucket containing all the models, and invoke the endpoint from the web client at runtime, specifying the TargetModel parameter according to the city of each request. This solution also enables the company to add or remove models from the S3 bucket without redeploying the endpoint, and to use different versions of the same model for different cities if needed.References:
Use Docker containers to build models
Host multiple models in one container behind one endpoint
Multi-model endpoints using Scikit Learn
Multi-model endpoints using XGBoost
Macy
2 months agoTammara
2 months agoAilene
2 months agoAlesia
3 months agoTabetha
3 months agoAlita
3 months agoMalika
3 months agoErick
3 months agoShalon
4 months agoGerri
4 months agoDomitila
4 months agoMaile
4 months agoBrynn
4 months agoTiffiny
5 months agoLenna
5 months agoBrigette
5 months agoZena
5 months agoMarge
5 months agoAntonio
6 months agoTegan
11 months agoTherese
11 months agoCharlena
11 months agoLeontine
11 months agoDustin
10 months agoAshley
10 months agoArleen
10 months agoEmerson
12 months agoEvangelina
12 months agoAlpha
12 months agoMollie
12 months agoHailey
12 months agoCatalina
11 months agoTracie
11 months agoFidelia
11 months agoGraham
1 year agoAlpha
1 year ago