P-EAGLE accelerates generative AI via parallel speculative decoding on SageMaker. This integration enables developers to deploy optimized real-time endpoints by selecting models from SageMaker JumpStart and configuring drafting specifications.
Opening Kapyn…