`karenina.storage.views.template_attributes`¶

template_attributes ¶

template_attributes_view

Attribute-level comparison of ground truth vs LLM-parsed values. One row per attribute per result. Use for debugging verification failures and identifying systematic parsing errors by comparing gt_value vs llm_value.

Columns

result_id (TEXT): Unique identifier for the verification result verification_date (TEXT): Date the verification was performed (YYYY-MM-DD) run_name (TEXT): Name of the verification run benchmark_name (TEXT): Name of the benchmark question_id (TEXT): Unique identifier for the question (MD5 hash) question_text (TEXT): The question content attribute_name (TEXT): Name of the template attribute (e.g., 'gene_name', 'tissue') gt_value (TEXT): Ground truth (expected) value for this attribute llm_value (TEXT): Value extracted from the LLM response attribute_match (INTEGER): 1 if gt_value == llm_value, 0 otherwise

Keys

Primary: result_id + attribute_name Joins: result_id → template_results_view.result_id question_id → question_attributes_view.question_id

Example

SELECT attribute_name, SUM(attribute_match) as correct, COUNT(*) as total FROM template_attributes_view GROUP BY attribute_name;

Functions¶

create_template_attributes_view ¶

create_template_attributes_view(engine: Engine) -> None

Create or replace the template_attributes_view.

Source code in src/karenina/storage/views/template_attributes.py

def create_template_attributes_view(engine: Engine) -> None:
    """Create or replace the template_attributes_view."""
    create_view_safe(engine, VIEW_NAME, _SQLITE_SQL, _POSTGRES_SQL)

drop_template_attributes_view ¶

drop_template_attributes_view(engine: Engine) -> None

Drop the template_attributes_view if it exists.