Patterns

Recurring Failure Patterns

Grouped by agent · failure mode · evidence

False Resolutioncriticalstable
False Success in incident response

The agent marked the work resolved while verification evidence still showed the underlying problem was active.

Require recovery verification to pass before incident status can be set to resolved. Treat restart success as an intermediate action rather than the customer outcome.

Affected agent

Incident Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Recurring Operational Driftcriticalstable
Latent False Success Drift in incident response

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Escalate the incident workflow to enforce metric recovery at the status-update boundary and convert this cluster into a regression eval.

Affected agent

Incident Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

False Resolutioncriticalstable
False Success in incident response

The agent marked the work resolved while verification evidence still showed the underlying problem was active.

Require recovery verification to pass before incident status can be set to resolved. Treat restart success as an intermediate action rather than the customer outcome.

Affected agent

Incident Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

False Resolutioncriticalstable
False Success in incident response

The agent marked the work resolved while verification evidence still showed the underlying problem was active.

Require metric-recovery verification artifact with status 'passed' before allowing incident status to be set to 'resolved'. Block ticket close when health check returns 'unknown'.

Affected agent

Incident Agent

9 findings

2026-06-11T05:47:08.198Z/2026-06-11T05:47:15.783Z

False Resolutioncriticalstable
False Success in incident response

The agent marked the work resolved while verification evidence still showed the underlying problem was active.

Require metric-recovery verification artifact with status 'passed' before allowing incident status to be set to 'resolved'. Block ticket close when health check returns 'unknown'.

Affected agent

Incident Agent

9 findings

2026-06-11T05:47:08.198Z/2026-06-11T05:47:15.783Z

False Resolutioncriticalincreasing
False Success in incident response

The agent marked the work resolved while verification evidence still showed the underlying problem was active.

Require metric-recovery verification artifact with status 'passed' before allowing incident status to be set to 'resolved'. Block ticket close when health check returns 'unknown'.

Affected agent

Incident Agent

9 findings

2026-06-11T05:47:08.198Z/2026-06-11T05:47:15.783Z

Recurring Operational Driftcriticalstable
Guardrail Friction in customer inquiry

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Review agent prompt and output pipeline to prevent customer identifiers from appearing in external reply construction. Add pre-send validation to block identifier inclusion. Escalate recurring pattern to compliance review.

Affected agent

Support Agent

6 findings

2026-06-11T05:47:02.407Z/2026-06-11T05:47:07.257Z

Recurring Operational Driftcriticalstable
Latent False Success Drift in incident response

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Escalate to engineering leadership: DevOps agent has a systemic pattern of false incident resolution. Enforce mandatory metric-recovery verification at the workflow level. Promote this cluster to a regression eval.

Affected agent

Incident Agent

4 findings

2026-06-11T05:47:12.880Z/2026-06-11T05:47:15.783Z

Recurring Operational Driftcriticalstable
Latent False Success Drift in incident response

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Escalate to engineering leadership: DevOps agent has a systemic pattern of false incident resolution. Enforce mandatory metric-recovery verification at the workflow level. Promote this cluster to a regression eval.

Affected agent

Incident Agent

4 findings

2026-06-11T05:47:12.880Z/2026-06-11T05:47:15.783Z

Context Ignoredcriticalstable
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customercriticalstable
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalstable
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalstable
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlingcriticalstable
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Require an explicit retention policy for recruiting context before writing to long-term or shared stores. Add customer-facing disclosure/control for retained candidate context and block eval-dataset writes when retention is not justified.

Affected agent

Recruiting Agent

1 findings

2026-06-11T05:47:01.443Z/2026-06-11T05:47:01.443Z

Trust-Damaging Handlingcriticalstable
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Require an explicit retention policy for recruiting context before writing to long-term or shared stores. Add customer-facing disclosure/control for retained candidate context and block eval-dataset writes when retention is not justified.

Affected agent

Recruiting Agent

1 findings

2026-06-11T05:47:01.443Z/2026-06-11T05:47:01.443Z

Trust-Damaging Handlingcriticalnew
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Require an explicit retention policy for recruiting context before writing to long-term or shared stores. Add customer-facing disclosure/control for retained candidate context and block eval-dataset writes when retention is not justified.

Affected agent

Recruiting Agent

1 findings

2026-06-11T05:47:01.443Z/2026-06-11T05:47:01.443Z

Recurring Operational Drifthighincreasing
Guardrail Friction in customer inquiry

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Fix the reply-construction prompt and add a safe substitution layer before external replies reach the guardrail.

Affected agent

Support Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

False Resolutionhighincreasing
False Success in incident response

The agent marked the work resolved while verification evidence still showed the underlying problem was active.

Require recovery verification to pass before incident status can be set to resolved. Treat restart success as an intermediate action rather than the customer outcome.

Affected agent

Incident Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Recurring Operational Drifthighincreasing
Latent False Success Drift in incident response

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Escalate the incident workflow to enforce metric recovery at the status-update boundary and convert this cluster into a regression eval.

Affected agent

Incident Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Recurring Operational Drifthighstable
Guardrail Friction in customer inquiry

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Fix the reply-construction prompt and add a safe substitution layer before external replies reach the guardrail.

Affected agent

Support Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Recurring Operational Drifthighstable
Guardrail Friction in customer inquiry

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Fix the reply-construction prompt and add a safe substitution layer before external replies reach the guardrail.

Affected agent

Support Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Recurring Operational Drifthighstable
Latent False Success Drift in incident response

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Escalate the incident workflow to enforce metric recovery at the status-update boundary and convert this cluster into a regression eval.

Affected agent

Incident Agent

12 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

False Resolutionhighstable
False Success in incident response

The agent marked the work resolved while verification evidence still showed the underlying problem was active.

Require metric-recovery verification artifact with status 'passed' before allowing incident status to be set to 'resolved'. Block ticket close when health check returns 'unknown'.

Affected agent

Incident Agent

9 findings

2026-06-11T05:47:08.198Z/2026-06-11T05:47:15.783Z

Recurring Operational Drifthighstable
Guardrail Friction in customer inquiry

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Review agent prompt and output pipeline to prevent customer identifiers from appearing in external reply construction. Add pre-send validation to block identifier inclusion. Escalate recurring pattern to compliance review.

Affected agent

Support Agent

6 findings

2026-06-11T05:47:02.407Z/2026-06-11T05:47:07.257Z

Recurring Operational Drifthighstable
Guardrail Friction in customer inquiry

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Review agent prompt and output pipeline to prevent customer identifiers from appearing in external reply construction. Add pre-send validation to block identifier inclusion. Escalate recurring pattern to compliance review.

Affected agent

Support Agent

6 findings

2026-06-11T05:47:02.407Z/2026-06-11T05:47:07.257Z

Recurring Operational Drifthighincreasing
Guardrail Friction in customer inquiry

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Review agent prompt and output pipeline to prevent customer identifiers from appearing in external reply construction. Add pre-send validation to block identifier inclusion. Escalate recurring pattern to compliance review.

Affected agent

Support Agent

6 findings

2026-06-11T05:47:02.407Z/2026-06-11T05:47:07.257Z

Recurring Operational Drifthighstable
Guardrail Friction in customer inquiry

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Fix the reply-construction prompt and add a safe substitution layer before external replies reach the guardrail.

Affected agent

Support Agent

4 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

False Resolutionhighstable
False Success in incident response

The agent marked the work resolved while verification evidence still showed the underlying problem was active.

Require recovery verification to pass before incident status can be set to resolved. Treat restart success as an intermediate action rather than the customer outcome.

Affected agent

Incident Agent

4 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Recurring Operational Drifthighstable
Latent False Success Drift in incident response

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Escalate the incident workflow to enforce metric recovery at the status-update boundary and convert this cluster into a regression eval.

Affected agent

Incident Agent

4 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Recurring Operational Drifthighstable
Latent False Success Drift in incident response

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Escalate to engineering leadership: DevOps agent has a systemic pattern of false incident resolution. Enforce mandatory metric-recovery verification at the workflow level. Promote this cluster to a regression eval.

Affected agent

Incident Agent

4 findings

2026-06-11T05:47:12.880Z/2026-06-11T05:47:15.783Z

Recurring Operational Drifthighincreasing
Latent False Success Drift in incident response

The same agent repeatedly shows service gaps, guardrail friction, or false-success behavior across completed traces.

Escalate to engineering leadership: DevOps agent has a systemic pattern of false incident resolution. Enforce mandatory metric-recovery verification at the workflow level. Promote this cluster to a regression eval.

Affected agent

Incident Agent

4 findings

2026-06-11T05:47:12.880Z/2026-06-11T05:47:15.783Z

Trust-Damaging Handlinghighstable
Trust Damaging Retention in candidate offer follow up

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Context Ignoredhighincreasing
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerhighincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlinghighincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerhighincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlinghighincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Trust-Damaging Handlinghighincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerhighincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Context Ignoredhighincreasing
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

2 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:22.905Z

Effort Shifted To Customerhighincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

2 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:22.905Z

Effort Shifted To Customerhighincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

2 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:22.905Z

Trust-Damaging Handlinghighincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

2 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:22.905Z

Trust-Damaging Handlinghighincreasing
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

2 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:22.905Z

Context Ignoredhighnew
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Context Ignoredhighstable
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Effort Shifted To Customerhighnew
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Effort Shifted To Customerhighnew
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Effort Shifted To Customerhighstable
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Trust-Damaging Handlinghighnew
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Trust-Damaging Handlinghighnew
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Trust-Damaging Handlinghighstable
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Effort Shifted To Customerhighnew
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Trust-Damaging Handlinghighnew
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Trust-Damaging Handlinghighnew
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Block long-term/shared retention of sensitive candidate context unless a retention policy, purpose, and candidate-facing control are present.

Affected agent

Recruiting Agent

1 findings

2026-06-10T14:30:32.601Z/2026-06-10T14:30:32.601Z

Trust-Damaging Handlinghighstable
Trust Damaging Retention in candidate screening

The agent solved the immediate task by retaining or sharing sensitive customer context beyond the service need.

Require an explicit retention policy for recruiting context before writing to long-term or shared stores. Add customer-facing disclosure/control for retained candidate context and block eval-dataset writes when retention is not justified.

Affected agent

Recruiting Agent

1 findings

2026-06-11T05:47:01.443Z/2026-06-11T05:47:01.443Z

Context Ignoredhighstable
Evidence Output Contradiction in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to apply retrieved policy exceptions before issuing a denial. Add a verification step that confirms all relevant policy exceptions have been checked.

Affected agent

Support Agent

1 findings

2026-06-11T05:47:00.491Z/2026-06-11T05:47:00.491Z

Context Ignoredmediumstable
Context Neglect in sla credit request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Context Ignoredmediumstable
Context Neglect in cancellation request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Context Ignoredmediumstable
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Context Ignoredmediumstable
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Context Ignoredmediumstable
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Context Ignoredmediumincreasing
Context Neglect in refund request

The agent had account, policy, or contract evidence that should have changed the answer, but closed the work against that context.

Require the agent to reconcile retrieved account and policy context before closing or denying the customer request. Route exception-qualified cases to a human approval path.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerlowstable
Customer Effort Inflation in escalation request

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerlowstable
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerlowincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerlowstable
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerlowincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerlowstable
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerlowincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z

Effort Shifted To Customerlowincreasing
Customer Effort Inflation in support follow up

The agent made the customer carry unresolved work: repeat information, retry a failed self-service path, or come back later without an owner.

Preserve prior customer context, assign an owner or escalation path when repeat contact is detected, and block resolutions that ask the customer to repeat already-provided information.

Affected agent

Support Agent

3 findings

2026-06-10T14:30:32.601Z/2026-06-11T05:47:29.662Z