A legal skill gets updated. Before rollout, this meta-skill runs it through 20 test scenarios across jurisdictions, compares output quality to the previous version, and flags any regressions — ensuring updates improve, not break.
A legal skill gets updated. Before rollout, this meta-skill runs it through 20 test scenarios across jurisdictions, compares output quality to the previous version, and flags any regressions — ensuring updates improve, not break.