New Amazon Bedrock service tiers help you match AI workload performance with cost

Source: AWS Blog

Amazon Bedrock now offers three distinct service tiers—Priority, Standard, and Flex—enabling users to select options that best fit their AI workload needs. The Priority tier is designed for mission-critical applications, prioritizing requests for real-time interactions, whereas the Standard tier caters to everyday tasks with consistent performance. The Flex tier, more cost-effective, is intended for workloads that can afford longer response times, such as content summarization or multistep workflows.

Each tier provides flexibility in managing both cost and performance, catering to differing requirements. For example, businesses can optimize their AI applications by selecting the appropriate tier for the task at hand, potentially gaining up to 25% better output tokens per second latency in the Priority tier compared to Standard. Using tools like the AWS Pricing Calculator and Amazon CloudWatch enhances visibility into usage and costs, enabling further fine-tuning of workload management. The adjustments will allow organizations to cater efficiently to varying urgency and performance needs, ensuring optimal resource allocation across their AI applications.

👉 Pročitaj original: AWS Blog