Fix tool approval denial reason not propagating to LLM or parent agent by anubra266 · Pull Request #2500 · inkeep/agents

anubra266 · 2026-03-03T21:57:47Z

Summary

When a tool call is denied with a reason, the sub-agent's LLM now receives a clean human-readable string instead of the raw internal sentinel object ({ __inkeepToolDenied: true, ... })
Denial redirect events are now tracked on the Agent instance during execution
When a delegated sub-agent's task is redirected mid-execution (e.g. user denies get_coordinates(San Francisco) with reason "I want tokyo instead"), the delegation response now includes a note so the parent agent knows the task evolved

Problem

Two bugs in the approval denial flow:

Bug 1 — Sub-agent LLM received a raw JSON sentinel instead of the reason string, so it would see {"__inkeepToolDenied":true,"toolCallId":"...","reason":"..."} and sometimes ignore the redirect.

Bug 2 — Parent agent had no visibility into user-triggered redirects during delegation. In the trace: activities-planner asked get-coordinates-agent for San Francisco, the user denied and redirected to Tokyo, get-coordinates-agent correctly fetched Tokyo — but activities-planner received the Tokyo result without context, got confused by the mismatch, and hallucinated San Francisco's coordinates from training data.

Changes

Agent.ts — Returns result.reason string (not the sentinel object) from wrapToolWithStreaming so the AI SDK passes a clean message to the LLM; tracks denial events in taskDenialRedirects[]; exposes getTaskDenialRedirects() getter
generateTaskHandler.ts — Prepends a [NOTE: The user redirected this task...] prefix to delegation responses when denials occurred, giving the parent agent full context
Updated tests to match new behavior

CleanShot.2026-03-03.at.13.49.07.mp4

- Return denial reason string from tool execute so the sub-agent LLM sees a clean human-readable message instead of the raw sentinel object - Track denial redirect events on Agent instances during execution - Prepend denial context to delegation responses so the parent agent knows the user redirected the task mid-execution (e.g. SF → Tokyo)

changeset-bot · 2026-03-03T21:57:52Z

🦋 Changeset detected

Latest commit: 532e5bc

The changes in this PR will be included in the next version bump.

This PR includes changesets to release 10 packages

Name	Type
@inkeep/agents-api	Patch
@inkeep/agents-manage-ui	Patch
@inkeep/agents-cli	Patch
@inkeep/agents-core	Patch
@inkeep/agents-email	Patch
@inkeep/agents-mcp	Patch
@inkeep/agents-sdk	Patch
@inkeep/agents-work-apps	Patch
@inkeep/ai-sdk-provider	Patch
@inkeep/create-agents	Patch

Not sure what this means? Click here to learn what changesets are.

Click here if you're a maintainer who wants to add another changeset to this PR

vercel · 2026-03-03T21:57:52Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
agents-api	Ready	Preview, Comment	Mar 4, 2026 6:27pm
agents-docs	Ready	Preview, Comment	Mar 4, 2026 6:27pm
agents-manage-ui	Ready	Preview, Comment	Mar 4, 2026 6:27pm

pullfrog · 2026-03-03T21:59:13Z

Fixes two bugs in the tool approval denial flow: (1) sub-agent LLMs received the raw { __inkeepToolDenied: true, ... } sentinel object instead of a human-readable denial reason string, and (2) parent agents had no visibility into user-triggered redirects during delegation — causing them to hallucinate stale data.

agents-api/src/domains/run/agents/Agent.ts — adds taskDenialRedirects tracking array and getTaskDenialRedirects() getter; returns result.reason (or a default message) from wrapToolWithStreaming when a tool call is denied, replacing the sentinel object
agents-api/src/domains/run/agents/generateTaskHandler.ts — prepends a [NOTE: The user redirected this task...] block to delegation responses when denials with reasons occurred, so the parent agent understands why the result differs from the original request
agents-api/src/__tests__/run/agents/functionToolApprovals.test.ts — updates assertion to expect the reason string ('no') instead of the sentinel object
agents-api/src/__tests__/run/agents/generateTaskHandler.test.ts — adds getTaskDenialRedirects() stub to the Agent mock

^{｜ View workflow run ｜ Using Claude Code ｜ Triggered by Pullfrog ｜ pullfrog.com ｜ 𝕏}

pullfrog

The sentinel-to-string fix in wrapToolWithStreaming is correct — the LLM will now see a clean human-readable denial message. However, the redirect tracking has a semantic issue: approvalResult.reason is always truthy on any denial (timeouts, expiries, explicit denials), so the parent agent will receive misleading "user redirected this task" notes even for timeout/expiry denials. The redirect note path in generateTaskHandler.ts also lacks test coverage.

^{｜ Fix all ➔ ｜ Fix 👍s ➔ ｜ View workflow run ｜ pullfrog.com ｜ 𝕏}

claude

PR Review Summary

(6) Total Issues | Risk: Medium

🟠⚠️ Major (4) 🟠⚠️

Inline Comments:

🟠 Major: Agent.ts:1027-1033 Denials without explicit reasons are silently excluded from taskDenialRedirects, creating inconsistent parent agent visibility
🟠 Major: Agent.ts:1353-1359 Same issue for function tools path
🟠 Major: generateTaskHandler.ts:506-513 User-provided denial reason injected into parent agent prompt without sanitization
🟠 Major: generateTaskHandler.test.ts:202-204 Core Bug #2 fix (note injection) has zero test coverage

🟠 5) file Missing changeset for bug fix

Issue: Per AGENTS.md, any bug fix affecting runtime behavior requires a changeset. This PR fixes two bugs in agents-api that affect how tool denials propagate through the system.

Why: Changesets document changes for release notes and help consumers understand what changed between versions.

Fix: Run:

pnpm bump patch --pkg agents-api "Fix tool approval denial reason not propagating to LLM or parent agent"

Refs:

AGENTS.md — changeset requirements
changeset-bot comment already flagged this

🟡 Minor (1) 🟡

🟡 1) functionToolApprovals.test.ts No test for denial-without-reason fallback

Issue: The code at Agent.ts:753 has a fallback result.reason ?? 'Tool call was denied by the user.' for denials without explicit reasons. The existing test only covers denial WITH a reason (reason: 'no'). The fallback path is untested.

Why: If the fallback is removed or broken, LLMs would receive undefined as a tool result, potentially confusing the model.

Fix: Add a test case where waitForApproval returns { approved: false, reason: undefined } and verify the result equals the fallback string.

Refs:

Agent.ts:753 — fallback message

💭 Consider (2) 💭

💭 1) Agent.ts:297 taskDenialRedirects array never cleared

Issue: The array accumulates throughout the agent's lifetime. If an Agent instance were reused across multiple delegated tasks, denials from earlier tasks would persist in later responses.

Why: Currently agents appear to be instantiated per-task (cleanup() is called in finally block), but this is an implicit assumption. If agent reuse patterns change, this becomes a bug.

Fix: Either clear the array at the start of generate(), add a comment documenting the single-use assumption, or clear it in cleanup().

Refs:

generateTaskHandler.ts:506 — where array is consumed

💭 2) Agent.ts:297 Naming: "redirects" vs "reasons"

Issue: The variable is named taskDenialRedirects but the implementation tracks all denials with any reason, not specifically "redirect-like" denials. A user saying "no, I changed my mind" is different from "no, use Tokyo instead".

Why: The PR description focuses on "redirects" but the code captures all denial reasons. This is fine if intentional, but the naming could confuse future maintainers.

Fix: Consider renaming to taskDenialReasons for accuracy, or add a comment clarifying that all reasons are captured.

🚫 REQUEST CHANGES

Summary: The fix addresses both bugs well at a high level, but has gaps that could leave Bug #2 partially unresolved. Specifically: (1) denials without explicit reasons won't propagate to parent agents since tracking is conditional, and (2) the core fix for Bug #2 has no test coverage. The prompt injection concern with unsanitized reasons is also worth addressing. Once these are resolved, this is a solid fix.

Discarded (4)

Location	Issue	Reason Discarded
`Agent.ts:1027-1038`	Code organization — tracking in different location than transformation	Valid observation but code works correctly; purely stylistic preference
`Agent.ts:753`	Fallback lacks tool-specific context	Low confidence; generic message is acceptable for edge case
`Agent.ts:1027`	Denials should track even without reason (duplicate)	Merged into main Major finding
`generateTaskHandler.test.ts:202`	No test coverage (duplicate)	Merged into main Major finding

Reviewers (5)

Reviewer	Returned	Main Findings	Consider	Inline Comments	Discarded
`pr-review-errors`	3	0	0	2	1
`pr-review-tests`	3	1	0	1	1
`pr-review-llm`	2	0	0	1	1
`pr-review-precision`	3	0	2	0	1
`pr-review-standards`	1	1	0	0	0
Total	12	2	2	4	4

…coverage

claude

PR Review Summary

(0) Total Issues | Risk: Low

This is a delta re-review scoped to changes since commit 1f6e0f786b25. The new commit cf2bcf3a3 addresses all prior review feedback.

✅ Prior Feedback Addressed

Prior Issue	Resolution in Delta
🟠 Denials without reasons silently excluded	✅ `recordDenial()` now always records with fallback message
🟠 Same issue for function tools path	✅ Both paths now use the same `recordDenial()` helper
🟠 Unsanitized reasons in parent prompt	✅ Added `sanitize()` — removes newlines, caps at 200 chars
🟠 No test coverage for redirect note	✅ New test validates note injection with toolName, toolCallId, reason
🟠 Missing changeset	✅ Added `.changeset/impossible-sapphire-elk.md`

💭 Consider (1) 💭

Inline Comments:

💭 Consider: generateTaskHandler.test.ts:584-611 Test cleanup pattern could cause test pollution on assertion failure

🔍 Delta Changes Verified

Agent.ts (lines 554-560):

private recordDenial(toolName: string, toolCallId: string, reason: string | undefined): void {
  this.taskDenialRedirects.push({
    toolName,
    toolCallId,
    reason: reason ?? 'Tool call was denied by the user.',  // ← Always records
  });
}

Centralized denial recording with fallback ✅
Applied to both MCP (line 1037) and function tool (line 1357) paths ✅

generateTaskHandler.ts (lines 506-516):

Sanitization added: s.replace(/\n/g, ' ').slice(0, 200) ✅
toolCallId now included in note format ✅
Note wording refined to be more accurate ✅

generateTaskHandler.test.ts (lines 583-611):

New test exercises the denial redirect note injection ✅
Validates note contains toolName, toolCallId, and reason ✅

✅ APPROVE

Summary: All four Major issues from the prior review have been addressed. The fix is now complete: denials are always tracked (regardless of reason truthiness), reasons are sanitized before injection into parent agent context, and test coverage validates the behavior. The one remaining suggestion is a minor test cleanup pattern improvement that doesn't affect production behavior.

Discarded (2)

Location	Issue	Reason Discarded
`generateTaskHandler.ts:508`	Truncation should add ellipsis	Low confidence; current behavior is acceptable, purely stylistic preference
`generateTaskHandler.ts:506`	Optional chaining masks null agent	Low confidence; agent is created in same try block and would have thrown earlier

Reviewers (4)

Reviewer	Returned	Consider	Inline Comments	Discarded
`pr-review-standards`	0	0	0	0
`pr-review-tests`	2	1	1	0
`pr-review-errors`	2	0	0	2
`pr-review-precision`	1	0	0	1
Total	5	1	1	3

Note: pr-review-precision finding was duplicate of pr-review-tests finding (test cleanup pattern).

claude · 2026-03-03T22:25:31Z

+      vi.mocked(Agent).prototype.getTaskDenialRedirects = vi.fn().mockReturnValue([
+        {
+          toolName: 'get_coordinates',
+          toolCallId: 'call-abc',
+          reason: 'I want the coordinates for tokyo instead',
+        },
+      ]);
+
+      const taskHandler = createTaskHandler(mockConfig);
+      const task: A2ATask = {
+        id: 'task-123',
+        input: { parts: [{ kind: 'text', text: 'Get coordinates for San Francisco' }] },
+        context: { conversationId: 'conv-123' },
+      };
+
+      const result = await taskHandler(task);
+
+      expect(result.status.state).toBe(TaskState.Completed);
+      expect(result.artifacts?.[0].parts).toHaveLength(2);
+      const notePart = result.artifacts?.[0].parts[0];
+      expect(notePart?.kind).toBe('text');
+      expect((notePart as any)?.text).toContain('[NOTE: Some tool calls were denied');
+      expect((notePart as any)?.text).toContain(
+        'get_coordinates (call-abc): I want the coordinates for tokyo instead'
+      );
+
+      vi.mocked(Agent).prototype.getTaskDenialRedirects = vi.fn().mockReturnValue([]);
+    });


💭 Consider: Test cleanup pattern

Issue: The mock is restored at the end of the test body (line 610) rather than in an afterEach hook. If the test fails before reaching line 610, subsequent tests may inherit the mocked getTaskDenialRedirects.

Why: This is a low-risk issue since the base MockAgent class at line 203-205 already returns [] by default and most tests create fresh instances. However, the current pattern could cause confusing cascading failures if assertions fail mid-test.

Fix: Consider wrapping in try/finally or moving cleanup to afterEach:

it('prepends denial redirect note when task had denied tool calls', async () => { const originalMethod = vi.mocked(Agent).prototype.getTaskDenialRedirects; vi.mocked(Agent).prototype.getTaskDenialRedirects = vi.fn().mockReturnValue([...]); try { // ... test body ... } finally { vi.mocked(Agent).prototype.getTaskDenialRedirects = originalMethod; } });

Refs:

MockAgent default implementation

claude

PR Review Summary

(0) Total Issues | Risk: Low

This is a delta re-review scoped to changes since commit cf2bcf3a3b2c. The delta consists solely of a merge from main (commit f58484d55).

🔍 Delta Analysis

The only changes since the last approved review are from PR #2502 (Trace improvements), merged via main:

File	Delta Change	Relevance to This PR
`Agent.ts:204`	Added `agentName?: string` to `AgentConfig` type	❌ Unrelated — trace attribute fix
`Agent.ts:2432`	Changed span attribute to use `this.config.agentName`	❌ Unrelated — trace attribute fix
`generateTaskHandler.ts:197`	Added `agentName: currentAgent.name` to Agent constructor	❌ Unrelated — trace attribute fix

This PR's bug fix code is unchanged since the last approved review:

✅ recordDenial() helper with fallback message — intact
✅ MCP and function tool paths both call recordDenial() — intact
✅ Sanitization in generateTaskHandler.ts — intact
✅ Test coverage for denial redirect note — intact
✅ Changeset present — intact

🕐 Pending Recommendations (1)

💭 generateTaskHandler.test.ts:611 Test cleanup pattern could cause test pollution (Consider — from prior review)

✅ APPROVE

Summary: The merge from main introduced only trace-related changes (agent name attribute) that are orthogonal to this PR's bug fix. The denial reason propagation fix remains complete and unchanged. All prior feedback has been addressed. Ship it! 🚀

Reviewers (0)

No reviewers dispatched — delta contained no PR-specific changes requiring domain review.

amikofalvy · 2026-03-04T07:04:09Z

                      'Tool execution denied by user'
                    );

+                    this.recordDenial(toolName, toolCallId, approvalResult.reason);


I have a concern about our tool approval flow in general. I bring it up here because this is a continuation of what I think is the problematic pattern.

What we are doing is aggregating the denials in memory. I think all tool related states should be persisted beyond application memory so that we can have potentially long running operations.

Let's discuss tomorrow and either we can fix it in this PR or as a separate PR.

@tim-inkeep is refactoring Agent.ts so I think we should implement persistent storage later

Worth thinking about how this is related to durable execution. I.e., are the tool related states persisted via durable execution state primatives (using e.g. Vercel Workflows) and/or the runtime DB.

I shared context with Abraham on durable execution, that's the next project.

claude

PR Review Summary

(0) Total Issues | Risk: Low

This is a delta re-review scoped to changes since commit cf2bcf3a3b2c. The delta consists solely of a merge from main (commit 30e85841d).

🔍 Delta Analysis

The only changes since the last approved review are from merging main, which brought in:

PR #2516: Docker-compose dependency bumps
PR #2514-2515: Docs updates for scheduled triggers
PR #2513: Version packages
PR #2512: Errors-only view in traces
PR #2502: Trace improvements (agentName attribute)

Changes to PR-specific files from the merge:

File	Delta Change	Relevance to This PR
`Agent.ts:204`	Added `agentName?: string` to `AgentConfig` type	❌ Unrelated — trace attribute fix
`Agent.ts:2432`	Changed span attribute to use `this.config.agentName`	❌ Unrelated — trace attribute fix
`generateTaskHandler.ts:197`	Added `agentName: currentAgent.name` to Agent constructor	❌ Unrelated — trace attribute fix

This PR's bug fix code is unchanged since the last approved review:

✅ recordDenial() helper with fallback message — intact
✅ MCP and function tool paths both call recordDenial() — intact
✅ Sanitization in generateTaskHandler.ts — intact
✅ Test coverage for denial redirect note — intact
✅ Changeset present — intact

🕐 Pending Recommendations (1)

💭 generateTaskHandler.test.ts:611 Test cleanup pattern could cause test pollution (Consider — from prior review)

📝 Human Review Discussion (Acknowledged)

@amikofalvy and @nick-inkeep raised valid points about in-memory denial tracking vs persistent storage. This is acknowledged as a future consideration for the durable execution project — not blocking for this bug fix PR.

✅ APPROVE

Summary: The merge from main introduced only trace-related changes (agent name attribute) that are orthogonal to this PR's bug fix. The denial reason propagation fix remains complete and unchanged. All prior feedback has been addressed. Ship it! 🚀

Reviewers (0)

No reviewers dispatched — delta contained no PR-specific changes requiring domain review.

Document that parent agents now receive a note in the delegation response when a user redirects a delegated sub-agent's task mid-execution (e.g. by denying a tool call with a reason). This gives the parent full context about how the task evolved. Ref: PR #2500

* docs: clarify denial reason propagation in tool approvals runtime behavior * docs: document denial reason propagation for tool approvals * docs: add task redirect visibility to delegation relationships Document that parent agents now receive a note in the delegation response when a user redirects a delegated sub-agent's task mid-execution (e.g. by denying a tool call with a reason). This gives the parent full context about how the task evolved. Ref: PR #2500 * Apply suggestion from @pullfrog[bot] Co-authored-by: pullfrog[bot] <226033991+pullfrog[bot]@users.noreply.github.com> --------- Co-authored-by: inkeep[bot] <257615677+inkeep[bot]@users.noreply.github.com> Co-authored-by: Abraham <[email protected]> Co-authored-by: pullfrog[bot] <226033991+pullfrog[bot]@users.noreply.github.com>

In durable execution, each step creates a fresh Agent instance, so taskDenialRedirects accumulated during executeToolStep were lost before the next callLlmStep. This meant the LLM had no awareness that a tool was denied and the generateTaskHandler delegation note from #2500 never fired in durable mode. Carry denial state in the workflow loop, pass it into callLlmStep, and give the LLM an explicit post-approval message when denials occurred. Also improve conversation history format for denied tools.

vercel Bot deployed to Preview – agents-api March 3, 2026 21:58 View deployment

pullfrog Bot reviewed Mar 3, 2026

View reviewed changes

Comment thread agents-api/src/domains/run/agents/Agent.ts Outdated

Comment thread agents-api/src/domains/run/agents/Agent.ts Outdated

Comment thread agents-api/src/__tests__/run/agents/generateTaskHandler.test.ts

Comment thread agents-api/src/domains/run/agents/generateTaskHandler.ts Outdated

vercel Bot deployed to Preview – agents-manage-ui March 3, 2026 22:00 View deployment

claude Bot reviewed Mar 3, 2026

View reviewed changes

Comment thread agents-api/src/domains/run/agents/Agent.ts Outdated

Comment thread agents-api/src/domains/run/agents/Agent.ts Outdated

Comment thread agents-api/src/domains/run/agents/generateTaskHandler.ts

Comment thread agents-api/src/__tests__/run/agents/generateTaskHandler.test.ts

github-actions Bot deleted a comment from claude Bot Mar 3, 2026

Address review: fix denial tracking gaps, sanitize reasons, add test …

cf2bcf3

…coverage

vercel Bot deployed to Preview – agents-docs March 3, 2026 22:22 View deployment

vercel Bot deployed to Preview – agents-api March 3, 2026 22:23 View deployment

claude Bot reviewed Mar 3, 2026

View reviewed changes

github-actions Bot deleted a comment from claude Bot Mar 3, 2026

Merge branch 'main' into fix/tool-approval-denial-context

f58484d

anubra266 requested a review from amikofalvy March 4, 2026 01:36

claude Bot reviewed Mar 4, 2026

View reviewed changes

vercel Bot deployed to Preview – agents-manage-ui March 4, 2026 01:39 View deployment

vercel Bot deployed to Preview – agents-api March 4, 2026 01:39 View deployment

github-actions Bot deleted a comment from claude Bot Mar 4, 2026

amikofalvy reviewed Mar 4, 2026

View reviewed changes

amikofalvy approved these changes Mar 4, 2026

View reviewed changes

anubra266 enabled auto-merge (squash) March 4, 2026 18:21

Merge branch 'main' into fix/tool-approval-denial-context

532e5bc

vercel Bot deployed to Preview – agents-manage-ui March 4, 2026 18:26 View deployment

vercel Bot deployed to Preview – agents-api March 4, 2026 18:27 View deployment

vercel Bot deployed to Preview – agents-docs March 4, 2026 18:27 View deployment

anubra266 merged commit 31b5e8b into main Mar 4, 2026
9 checks passed

anubra266 deleted the fix/tool-approval-denial-context branch March 4, 2026 18:27

claude Bot reviewed Mar 4, 2026

View reviewed changes

github-actions Bot deleted a comment from claude Bot Mar 4, 2026

inkeep Bot mentioned this pull request Mar 4, 2026

docs: Document tool approval denial reason propagation #2520

Merged

This was referenced Mar 4, 2026

Trigger page UI optimizations #2253

Merged

feat: user profiles with timezone storage #2486

Merged

anubra266 mentioned this pull request Apr 3, 2026

Fix tool denial context lost across durable workflow steps #3009

Merged

Conversation

anubra266 commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Problem

Changes

Uh oh!

changeset-bot Bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🦋 Changeset detected

Uh oh!

vercel Bot commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pullfrog Bot commented Mar 3, 2026

Uh oh!

pullfrog Bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

PR Review Summary

🟠⚠️ Major (4) 🟠⚠️

🟡 Minor (1) 🟡

💭 Consider (2) 💭

🚫 REQUEST CHANGES

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

PR Review Summary

✅ Prior Feedback Addressed

💭 Consider (1) 💭

🔍 Delta Changes Verified

✅ APPROVE

Uh oh!

claude Bot Mar 3, 2026

Choose a reason for hiding this comment

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

PR Review Summary

🔍 Delta Analysis

🕐 Pending Recommendations (1)

✅ APPROVE

Uh oh!

amikofalvy Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

nick-inkeep Mar 4, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

PR Review Summary

🔍 Delta Analysis

🕐 Pending Recommendations (1)

📝 Human Review Discussion (Acknowledged)

✅ APPROVE

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

anubra266 commented Mar 3, 2026 •

edited

Loading

changeset-bot Bot commented Mar 3, 2026 •

edited

Loading

vercel Bot commented Mar 3, 2026 •

edited

Loading

pullfrog Bot left a comment •

edited

Loading