Posted June 25, 20213 yr Amazon EC2 Inf1 instances and AWS Neuron now support YOLOv5 and ResNext deep learning models as well as the latest open-source Hugging Face Transformers. We have also optimized the Neuron compiler to enhance performance and you can now achieve an out-of-the box 12X higher throughput than comparable GPU-based instances for pre-trained BERT base models. These enhancements enable you to effectively meet your high-performance inference requirements and deploy state of the art deep learning models at low cost. View the full article
Join the conversation
You can post now and register later. If you have an account, sign in now to post with your account.