From the course: MLOps Essentials: Model Deployment and Monitoring

Unlock the full course today

Join today to access over 24,600 courses taught by industry experts.

Planning for infrastructure

Planning for infrastructure

- [Instructor] Infrastructure plays a vital part in delivering a stable and resilient serving for machine learning. Let's explore various elements and best practices for infrastructure in this video. There are two types of infrastructure elements that participate in serving: compute resources, and tools and technologies. Having enough compute resources are essential for maintaining operational performance goals like latency and concurrency. First, we need CPUs and GPUs to run the models with expected latency. GPUs are important for ML and need to be managed efficiently. Then comes memory to cache models and other data. Accelerators are available today for improving the performance of the models. Storage also becomes essential with the amount of data being captured and archived. Networking should be fast and reliable. Finally, resiliency features like standbys and backups are also needed for all the infrastructure…

Contents