Skip to content

Commit d493d7b

Browse files
authored
Merge pull request #1260 from pytorch/msaroufim-patch-2
Remove title heading from torchserve blog post
2 parents 49469f4 + 04b1807 commit d493d7b

File tree

1 file changed

+0
-2
lines changed

1 file changed

+0
-2
lines changed

_posts/2022-12-28-torchserve-performance-tuning.md

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -5,8 +5,6 @@ author: Hamid Shojanazeri, Geeta Chauhan, Mark Saroufim, Jesse Smith
55
featured-img: "assets/images/sketch_animator.png"
66
---
77

8-
## Serving models in production
9-
108
In this post we discuss performance tuning of Torchserve for serving your models in production. One of the biggest challenges in the life cycle of a ML project is deploying models in production. This requires a reliable serving solution along with solutions that address the MLOps needs. A robust serving solution needs to provide support for multi model serving, model versioning, metric logging, monitoring and scaling to serve the peak traffic. In this post, we will have an overview of Torchserve and how to tune its performance for production use-cases. We discuss the [Animated Drawings app](https://ai.facebook.com/blog/using-ai-to-bring-childrens-drawings-to-life/) from Meta that can turn your human figure sketches to animations and how it could serve the peak traffic with Torchserve. The Animated Drawing’s workflow is below.
119

1210
<p>

0 commit comments

Comments
 (0)