Update and rename 2021-6-15-pytorch-1.9-new-library-releases.md to 2021-6-14-pytorch-1.9-new-library-releases.md

pytorchsam · web-flow · commit 93b601756350 · 2021-06-14T17:21:59.000-05:00
diff --git a/_posts/2021-6-14-pytorch-1.9-new-library-releases.md b/_posts/2021-6-14-pytorch-1.9-new-library-releases.md
@@ -9,7 +9,7 @@ Today, we are announcing updates to a number of PyTorch libraries, alongside the
 Some highlights include:
 
 * **TorchVision** - Added new SSD and SSDLite models, quantized kernels for object detection, GPU Jpeg decoding, and iOS support. See [release notes](https://github.com/pytorch/vision/releases) here.
-* **TorchAudio** - Added inference-only wav2vec 2.0 model that can run in non-Python environments (including iOS), improved re-sampling (i.e. Kaiser Window), switched to PyTorch native Complex type, improved filtering and autograd support. See [release notes](https://github.com/pytorch/audio/releases) here.
+* **TorchAudio** - Added wav2vec 2.0 model deployable in non-Python environments (including C++, Android, and iOS). Many performance improvements in lfilter, spectral operations, resampling. Added options for quality control in sampling (i.e. Kaiser window support). Initiated the migration of complex tensors operations. Improved autograd support. See [release notes](https://github.com/pytorch/audio/releases) here.
 * **TorchText** - Added a new high-performance Vocab module that provides common functional APIs for NLP workflows. See [release notes](https://github.com/pytorch/text/releases) here.
 
 We’d like to thank the community for their support and work on this latest release.
@@ -19,10 +19,18 @@ Features in PyTorch releases are classified as Stable, Beta, and Prototype. You
 # TorchVision 0.10
 
 ### (Stable) Quantized kernels for object detection 
-The forward pass of the nms and roi_align operators now support tensors with a quantized dtype, which can help lower the memory footprint of object detection models, particularly on mobile environments. For more details, refer to [the documentation](https://pytorch.org/vision/stable/auto_examples/index.html). 
+The forward pass of the nms and roi_align operators now support tensors with a quantized dtype, which can help lower the memory footprint of object detection models, particularly on mobile environments. For more details, refer to [the documentation](https://pytorch.org/vision/stable/ops.html#torchvision.ops.roi_align). 
+
+### (Stable) Speed optimizations for Tensor transforms 
+The resize and flip transforms have been optimized and its runtime improved by up to 5x on the CPU. 
+
+### (Stable) Documentation improvements 
+Significant improvements were made to the documentation. In particular, a new gallery of examples is available. These examples visually illustrate how each transform acts on an image, and also properly documents and illustrates the output of the segmentation models.
+
+The example gallery will be extended in the future to provide more comprehensive examples and serve as a reference for common torchvision tasks. For more details, refer to [the documentation](https://pytorch.org/vision/stable/auto_examples/index.html).
 
 ### (Beta) New models for detection 
-SSD and SSDlite are two popular object detection architectures that are efficient in terms of speed and provide good results for low resolution pictures. In this release, we provide implementations for the original SSD model with VGG16 backbone and for its mobile-friendly variant SSDlite with MobileNetV3-Large backbone.
+[SSD](https://arxiv.org/abs/1512.02325) and [SSDlite](https://arxiv.org/abs/1801.04381) are two popular object detection architectures that are efficient in terms of speed and provide good results for low resolution pictures. In this release, we provide implementations for the original SSD model with VGG16 backbone and for its mobile-friendly variant SSDlite with MobileNetV3-Large backbone.
 
 The models were pre-trained on COCO train2017 and can be used as follows:
 
@@ -47,7 +55,7 @@ The following accuracies can be obtained on COCO val2017 (full results available
 
 
  {:.table.table-striped.table-bordered}
- |  | Model | mAP | mAP@50 | mAP@75 |
+| Model | mAP | mAP@50 | mAP@75 |
 | ------------- | ------------- |  ------------- |  ------------- |
 | SSD300 VGG16 | 25.1 | 41.5 | 26.2 | 
 | SSDlite320 MobileNetV3-Large | 21.3 | 34.3 | 22.1 |
@@ -71,7 +79,7 @@ TorchVision 0.10 now provides pre-compiled iOS binaries for its C++ operators, w
 # TorchAudio 0.9.0
 
 ### (Stable) Complex Tensor Migration 
-TorchAudio has functions that handle complex-valued tensors. These functions follow a convention to use an extra dimension to represent real and imaginary parts. (In the following, we call this convention pseudo complex type.) In PyTorch 1.6, the native complex type was introduced. As its API is getting stable, torchaudio has started to migrate to the native complex type. 
+TorchAudio has functions that handle complex-valued tensors. These functions follow a convention to use an extra dimension to represent real and imaginary parts. In PyTorch 1.6, the native complex type was introduced. As its API is getting stable, torchaudio has started to migrate to the native complex type. 
 
 In this release, we added support for native complex tensors, and you can opt-in to use them. Using the native complex types, we have verified that affected functions continue to support autograd and TorchScript, moreover, switching to native complex types improves their performance. For more details, refer to [pytorch/audio#1337](https://github.com/pytorch/audio/issues/1337). 
 
@@ -95,22 +103,22 @@ We have added the model architectures from [Wav2Vec2.0](https://arxiv.org/abs/20
 The following code snippet illustrates such a use case. Please check out our [c++ example directory](https://github.com/pytorch/audio/tree/master/examples/libtorchaudio) for the complete example. Currently, it is designed for running inference. If you would like more support for training, please file a feature request.
 
 ```python
-# Import fine-tuned model from Hugging Face Hub
+|# Import fine-tuned model from Hugging Face Hub
 import transformers
 from torchaudio.models.wav2vec2.utils import import_huggingface_model
 
 original = Wav2Vec2ForCTC.from_pretrained("facebook/wav2vec2-base-960h")
-imported = import_huggingface_model(original)
+imported = import_huggingface_model(original)|
 
-# Import fine-tuned model from fairseq
+|# Import fine-tuned model from fairseq
 import fairseq
 from torchaudio.models.wav2vec2.utils import import_fairseq_model
 
 original, _, _ = fairseq.checkpoint_utils.load_model_ensemble_and_task(
     ["wav2vec_small_960h.pt"], arg_overrides={'data': "<data_dir>"})
-imported = import_fairseq_model(original[0].w2v_encoder)
+imported = import_fairseq_model(original[0].w2v_encoder)|
 
-# Build uninitialized model and load state dict
+|# Build uninitialized model and load state dict
 from torchaudio.models import wav2vec2_base
 
 model = wav2vec2_base(num_out=32)
@@ -121,7 +129,7 @@ quantized_model = torch.quantization.quantize_dynamic(
     model, qconfig_spec={torch.nn.Linear}, dtype=torch.qint8)
 scripted_model = torch.jit.script(quantized_model)
 optimized_model = optimize_for_mobile(scripted_model)
-optimized_model.save("model_for_deployment.pt")
+optimized_model.save("model_for_deployment.pt")|
 ```
 
 For more details, see [the documentation](https://pytorch.org/audio/0.9.0/models.html#wav2vec2-0). 
@@ -132,7 +140,7 @@ In release 0.8, we vectorized the operation in ```torchaudio.compliance.kaldi.re
 We have:
 *  Added Kaiser window support for a wider range of resampling quality.
 *  Added ```rolloff``` parameter for anti-aliasing control.
-*  Added the mechanism to precompute the kernel and cache it in torchaudio.transforms.Resample for even faster operation.
+*  Added the mechanism to precompute the kernel and cache it in ```torchaudio.transforms.Resample``` for even faster operation.
 *  Moved the implementation from ```torchaudio.compliance.kaldi.resample_waveform``` to ```torchaudio.functional.resample``` and deprecated ```torchaudio.compliance.kaldi.resample_waveform```.
 
 For more details, see [the documentation](https://pytorch.org/audio/0.9.0/transforms.html#resample).