How Amazon Search reduced ML inference costs by 85% with AWS Inferentia

Matt Vaughn
Date : September 22, 2022
Categories : AI/ML
Tags : ai ml ,ec2 ,artificial intelligence ,inferentia ,machine learning ,compute ,hpcblog

Amazon’s product search engine indexes billions of products, serves hundreds of millions of customers worldwide, and is one of the most heavily used services in the world. The Amazon Search team develops machine learning (ML) technology that powers the Amazon.com search engine and helps customers search effortlessly. To deliver a great customer experience and operate […]

Read the Post on the AWS Blog Channel