Start free trial Sign in

From the course: Cloud-Based AI Solution Design Patterns

Unlock this course with a free trial

Join today to access over 24,900 courses taught by industry experts.

Serverless AI inference

Serverless AI inference

From the course: Cloud-Based AI Solution Design Patterns

Start my 1-month free trial Buy for my team

Serverless AI inference

“

- We introduced serverless computing earlier in Chapter 1 when we covered the serverless data pipeline pattern. If you haven't watched that video, then please do so before proceeding with this lesson. With the serverless AI inference pattern, we look at a way to leverage a serverless environment provided by the cloud provider in order to deploy an AI model together with the AI inference engine so that the underlying cloud infrastructure automatically takes care of all the scaling and virtual server provisioning for us. This effectively separates all of an AI system's inference processing into an environment that can fully support just about any range of processing demands and fluctuations that may occur. To understand the scenario, we need to take a closer look under the hood of an AI system to revisit the core modules that were introduced in the AI Foundations course. So as we know, any one of these modules can be placed in a container or encapsulated by a microservice to allow us to…

Contents

- (Locked)
  
  Next steps
  
  42s