Deploying and Serving Challenges Flashcards
(7 cards)
What are the most common deployment and infrastructure challenges?
Model deployment complexity
Model monitoring and maintenance
Scalability and latency
What are model deployment complexities?
Converting a trained model into an API or micro service
Deployment formats can vary if models need special runtime environments
Ensuring compatibility across cloud platforms and edge devices
What are model monitoring and maintenance challenges?
Deployed models need monitoring to detect degradation overtime. Drift data can cause models to be outdated requiring retraining. Logging and tracking can help
What are scalability and latency problems?
Models need to be able to handle a large volume of requests and data, requiring scalable infrastructure and efficient serving techniques
What are infrastructure challenges
Setting up and maintaining the infrastructure for deploying and serving ML models can be complex, requiring expertise in cloud computing, networking, and DevOps
What are real time vs batch processing challenges
Choosing the right serving strategy (real-time or batch) depends on the application requirements and can impact performance and resource utilization
Why is monitoring and logging helpful?
Continuous monitoring of model performance and infrastructure health is essential for identifying and addressing issues promptly.