Reduce costs and latency with Amazon Bedrock Intelligent Prompt Routing and prompt caching (preview)
Today, Amazon Bedrock has introduced in preview two capabilities that help reduce costs and latency for generative AI applications: Amazon Bedrock Intelligent Prompt Routing – When invoking a model, you can now use a combination of foundation models (FMs) from the same model family to help optimize for quality and cost. For example, with the …