LLMs Demand Observability-Driven Development
Honeycomb
SEPTEMBER 20, 2023
Some of these things are related to cost/benefit tradeoffs, but most are about weak telemetry, instrumentation, and tooling. Instead, ML teams typically build evaluation systems to evaluate the effectiveness of the model or prompt. You can’t really write unit tests for this (nor practice TDD). With LLMs, you might not.
Let's personalize your content