karenina.integrations.gepa.data_types¶
data_types
¶
Data types for GEPA-Karenina integration.
These types map karenina's verification concepts to GEPA's adapter interface: - KareninaDataInst: Input data instance (question + ground truth) - KareninaTrajectory: Execution trace (verification result + metadata) - KareninaOutput: Optimized components output - BenchmarkSplit: Train/val/test split result
Classes¶
BenchmarkSplit
dataclass
¶
Result of splitting a benchmark for optimization.
Contains train, validation, and optionally test sets of KareninaDataInst objects.
Source code in src/karenina/integrations/gepa/data_types.py
Attributes¶
seed
class-attribute
instance-attribute
¶
Random seed used for splitting (for reproducibility).
test
class-attribute
instance-attribute
¶
test: list[KareninaDataInst] | None = None
Optional test set instances.
Functions¶
summary
¶
Get a summary string of the split.
Source code in src/karenina/integrations/gepa/data_types.py
KareninaDataInst
dataclass
¶
Single evaluation instance for GEPA.
Represents one question from a karenina benchmark with all information needed for verification.
Source code in src/karenina/integrations/gepa/data_types.py
Attributes¶
few_shot_examples
class-attribute
instance-attribute
¶
Optional few-shot examples for the question.
metadata
class-attribute
instance-attribute
¶
Additional metadata (author, tags, etc.).
question_text
instance-attribute
¶
The question text to be answered by the model.
rubric
class-attribute
instance-attribute
¶
Optional rubric configuration for quality evaluation.
template_code
instance-attribute
¶
Python code defining the Answer template class.
Functions¶
to_dict
¶
Convert to JSON-serializable dict for GEPA.
Source code in src/karenina/integrations/gepa/data_types.py
KareninaOutput
dataclass
¶
Optimized text components from a GEPA run.
Contains the best-performing text components discovered during optimization, along with metrics.
Source code in src/karenina/integrations/gepa/data_types.py
Attributes¶
answering_system_prompt
class-attribute
instance-attribute
¶
Optimized system prompt for answering model.
baseline_score
class-attribute
instance-attribute
¶
Score before optimization (seed candidate).
best_generation
class-attribute
instance-attribute
¶
Generation number where best candidate was found.
improvement
class-attribute
instance-attribute
¶
Relative improvement: (val_score - baseline_score) / baseline_score.
mcp_tool_descriptions
class-attribute
instance-attribute
¶
Optimized MCP tool descriptions (tool_name -> description).
parsing_instructions
class-attribute
instance-attribute
¶
Optimized instructions for parsing model.
test_score
class-attribute
instance-attribute
¶
Optional score on test set (if test set was used).
total_generations
class-attribute
instance-attribute
¶
Number of GEPA generations run.
total_metric_calls
class-attribute
instance-attribute
¶
Total number of evaluation calls used.
train_score
class-attribute
instance-attribute
¶
Final score on training set.
val_score
class-attribute
instance-attribute
¶
Final score on validation set.
Functions¶
get_optimized_prompts
¶
Get all optimized prompts as a dict.
Source code in src/karenina/integrations/gepa/data_types.py
KareninaTrajectory
dataclass
¶
Execution trace from one verification run.
Captures detailed information about a single verification attempt for use in GEPA's reflective feedback mechanism.
Each trajectory represents one (question, model) combination.
Source code in src/karenina/integrations/gepa/data_types.py
Attributes¶
failed_fields
class-attribute
instance-attribute
¶
List of template fields that failed verification.
model_name
instance-attribute
¶
Name/identifier of the model that produced this result.
optimized_components
instance-attribute
¶
The optimized text components that were injected.
parsing_error
class-attribute
instance-attribute
¶
Error message if parsing failed.
raw_llm_response
class-attribute
instance-attribute
¶
Raw text response from the answering model.
rubric_scores
class-attribute
instance-attribute
¶
Per-trait rubric scores.
verification_result
instance-attribute
¶
verification_result: VerificationResult
Complete verification result from karenina.
Functions¶
passed
¶
to_feedback_dict
¶
Convert to feedback dict for GEPA reflection.
Returns dict with "Inputs", "Generated Outputs", "Feedback" keys as expected by GEPA's reflective proposer.