Skip to content
Cette page a été générée et traduite avec l'aide de l'IA. Si vous remarquez des inexactitudes, n'hésitez pas à contribuer. Modifier sur GitHub

L2: Prompt Evolution

Layer 2 opere at medium frequency to refine la reponse de l'agent system prompts. It analyzes conversation quality metrics and proposes prompt modifications, testing them via unn A/B framework before permanent adoption.

Apercu

L2 evolution addresses:

  • System prompt refinement -- improve instruction clarity and task coverage
  • Persona tuning -- adjust tone, verbosity, and communication style
  • Tool usage instructions -- optimize how tools are described to le LLM
  • A/B testing -- statistically validate prompt changes before rollout

A/B Testing Framework

When a prompt modification is proposed, L2 runs both the original and modified prompts in parallel pour un configurable evaluation period:

  1. Split traffic -- alternate between original and candidate prompts
  2. Collect metrics -- track task completion, user satisfaction, tool usage efficiency
  3. Statistical test -- apply significance testing to determine the winner
  4. Promote or rollback -- adopt the winner or keep the original

Configuration

toml
[self_evolution.l2]
enabled = false
schedule = "weekly"
min_samples = 50
confidence_level = 0.95
max_concurrent_experiments = 2

Voir aussi Pages

Released under the Apache-2.0 License.