Test Report Automation

State machine · Conversational pipeline

DET AI HYBRID HUMAN

Draft with AI.
Owned by people.

The pipeline narrates itself through a Teams thread. The scientists are always in the loop. Open QC to see the trust architecture.

At ~150 reports a year and ~4 hours each to draft, this tool saves >500 hours/year of scientist time, allowing the team to focus on the science. Keeping the human in the loop drives ownership and maintains ISO compliance.

Supporting view · The configuration engine

A new report type is a YAML file and a Word template.
Not a feature branch.

Each report type declares its sections, its analysis recipe, its QC rules, its template. The engine reads them at runtime - the pipeline scales by configuration, not by code.

wet_patch.yaml · 42 lines

report_type: wet_patch_chemical

sections:
  - objective           # required
  - materials_methods   # required
  - results_discussion  # required
  - conclusions         # required + pass/fail
  - photos              # optional

analysis_recipe:
  sheet: "Results"
  metrics:
    - name: rating
      stats: [mean, std]
      pass_criteria: ">= 7"
      provenance: "col Rating, rows 2-10"

qc_rules:
  ground_numbers: true
  verdict_required: true
  cite_standards: false   # phase 2

template:
  path: "Templates/wet_patch.docx"
  sha256: "..."

→

Analysis

reads recipe

→

Drafting

reads sections

→

reads qc_rules

→

Doc Gen

reads template path

5–10 test types planned · same engine

Draft with AI.
Owned by people.

Scientist kicks off interaction · bot fetches metadata · manual fallback

Reads Excel + PDFs · stats · tables · charts · images

A conditional state - only when warnings exist

Claude Sonnet · per-section · temp 0.1

Don't trust your own LLM.
Prove the work before a human reads it.

Regex grounding

Claude Haiku review

Hard gate - nothing publishes without explicit approval

SHA-256 template verification · content-control fill · upload

Terminal - audit trail closed · second human review