From the course: Cloud-Based AI Solution Design Patterns
Unlock this course with a free trial
Join today to access over 24,900 courses taught by industry experts.
Serverless AI inference
From the course: Cloud-Based AI Solution Design Patterns
Serverless AI inference
- We introduced serverless computing earlier in Chapter 1 when we covered the serverless data pipeline pattern. If you haven't watched that video, then please do so before proceeding with this lesson. With the serverless AI inference pattern, we look at a way to leverage a serverless environment provided by the cloud provider in order to deploy an AI model together with the AI inference engine so that the underlying cloud infrastructure automatically takes care of all the scaling and virtual server provisioning for us. This effectively separates all of an AI system's inference processing into an environment that can fully support just about any range of processing demands and fluctuations that may occur. To understand the scenario, we need to take a closer look under the hood of an AI system to revisit the core modules that were introduced in the AI Foundations course. So as we know, any one of these modules can be placed in a container or encapsulated by a microservice to allow us to…