Improve LLM performance with human and AI feedback on Amazon SageMaker for Amazon Engineering
AWS Machine Learning - AI
APRIL 24, 2024
To increase training samples for better learning, we also used another LLM to generate feedback scores. We present the reinforcement learning process and the benchmarking results to demonstrate the LLM performance improvement. Other users provided scores and explained how they justify the LLM answers in their notes.
Let's personalize your content