fix(agent): follow-up review fixes for tinyagents parity (#4451–#4469) by senamakel · Pull Request #4504 · tinyhumansai/openhuman

senamakel · 2026-07-04T17:33:05Z

Follow-up to #4495 — addresses the 5 review-bot findings that were left unresolved when it merged. All seam-side fixes on top of current main.

Fixes

observability event ordering (Subagent persistence/observability: failed runs persist nothing, degraded mirror, lossy deltas, tokenjuice off, breaker halt reports Completed #4466) — under channel backpressure the old path spawned a send() task per event, so a later try_send could land a ToolCallCompleted before its ToolCallStarted and leave a tool row stuck "running". Now a single ordered overflow forwarder drains spilled events FIFO; once anything spills, everything queues until drained.
carried token telemetry (Cost & telemetry parity: charged USD lost, cap-summary/failed-run usage unrecorded, elapsed_ms=0, tool DomainEvents gone #4467) — ModelCallCompleted now emits the carried cache-creation/reasoning token breakdown (the crate usage.* is 0 for some providers) so per-call + Langfuse telemetry matches the cost tracker.
fallback-route usage carry (Cost & telemetry parity: charged USD lost, cap-summary/failed-run usage unrecorded, elapsed_ms=0, tool DomainEvents gone #4467) — build_route_models now shares the bridge's ProviderUsageCarry, so a successful fallback-route call reports backend-charged USD / context / cache / reasoning instead of a catalogue estimate.
cross-process MEMORY.md lock (Memory protocol mandates unregistered update_memory_md tool — permanent index-drift nag loop; non-atomic MEMORY.md writes #4458) — added an fs2 exclusive flock around the read-modify-write; the in-process mutex only serialized one process, so cron subprocesses could still clobber the file.
frontend subagent failure (Tool-failure classification: approval denial → 'try again', TTL expiry → auto-retry Timeout, failure missing from snapshot #4459) — plumbed the structured failure from ChatSubagentToolResultEvent into the child tool-call row so a failed sub-agent tool keeps its why/next copy live instead of losing it until a snapshot reload.

Testing

Frontend change typechecks clean.
⚠️ Rust changes are verified via CI — the local shared box OOM-kills full cargo builds. Tests for the changed lines are being added in follow-up commits on this branch.

Submission Checklist

N/A: no schema/RPC surface change — pure behavior fixes to existing seams.
Tests: frontend typecheck clean; Rust/coverage validated in CI (added on this branch).
N/A: no user-facing feature added (bug fixes; frontend change is data plumbing to existing UI).
No secrets or PII added; scrubbing paths untouched.
Follows repo conventions (seam-side, verbose logging, no vendor/tinyagents edits).

Refs #4451–#4469 (tracker #4470).

https://claude.ai/code/session_019j5TLsRLHsM3kqAYFyH4hR

…i#4451-tinyhumansai#4469) Addresses the 5 review-bot findings left open when tinyhumansai#4495 merged: - observability: preserve progress-event ordering under backpressure via a single ordered overflow forwarder (was per-event spawned send() that could land a ToolCallCompleted before its Started, leaving a tool row stuck "running") (tinyhumansai#4466). - observability: ModelCallCompleted now emits the carried cache-creation / reasoning token breakdown instead of the crate usage (0 for some providers) so per-call/Langfuse telemetry matches the cost tracker (tinyhumansai#4467). - routes/mod: fallback route ProviderModels share the bridge's provider usage carry, so a successful fallback call reports charged-USD/context/cache/ reasoning instead of a catalogue estimate (tinyhumansai#4467). - update_memory_md: add an fs2 cross-process advisory flock around the read-modify-write so cron subprocesses can't clobber MEMORY.md (the in-process mutex only serialized one process) (tinyhumansai#4458). - frontend: plumb the structured `failure` from ChatSubagentToolResultEvent into the subagent tool-call row so a failed child keeps its why/next copy live instead of losing it until a snapshot reload (tinyhumansai#4459). Claude-Session: https://claude.ai/code/session_019j5TLsRLHsM3kqAYFyH4hR

coderabbitai · 2026-07-04T17:33:16Z

Warning

Review limit reached

You’ve reached a temporary PR review limit under our Fair Usage Limits Policy.

Your recent review volume is higher than typical usage, so adaptive limits are currently applied.

Next review available in: 3 minutes

Enable usage-based reviews in Billing to review now. Otherwise, wait until the next included review is available.
You're only billed for reviews past your plan's rate limits ($0.25/file).

How can I continue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based reviews.

How do review limits work?

CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan review availability.

For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, additional reviews become available more gradually as earlier reviews age out of the rolling window.

Please refer docs for additional details.

Review details

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 388709ee-cbfb-4aa6-a0f2-b671df79dfbc

📥 Commits

Reviewing files that changed from the base of the PR and between bb27fe1 and 7180329.

📒 Files selected for processing (9)

app/src/providers/ChatRuntimeProvider.tsx
app/src/providers/__tests__/ChatRuntimeProvider.test.tsx
app/src/services/chatService.ts
app/src/store/chatRuntimeSlice.ts
app/src/types/turnState.ts
src/openhuman/tinyagents/mod.rs
src/openhuman/tinyagents/observability.rs
src/openhuman/tinyagents/routes.rs
src/openhuman/tools/impl/filesystem/update_memory_md.rs

_{Comment @coderabbitai help to get the list of available commands.}

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 812a507b3c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

…4459) Asserts a failed subagent_tool_result stores the parsed structured failure (class/category/causePlain/nextAction) on the child tool-call row. Claude-Session: https://claude.ai/code/session_019j5TLsRLHsM3kqAYFyH4hR

…tinyhumansai#4459) The live path renders subagent.transcript (not the fallback toolCalls list) once a child has emitted a tool call, so resolveSubagentTranscriptTool must also carry the parsed failure — otherwise a failed child tool lost its why/next until a snapshot reload. Adds failure to the runtime + persisted transcript tool item, the resolver payload/reducer, and hydration; extends the test to assert the transcript path. Claude-Session: https://claude.ai/code/session_019j5TLsRLHsM3kqAYFyH4hR

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 718032968a

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-07-04T18:17:26Z

      if (result != null) item.result = result;
+      // Carry the structured why/next onto the rendered transcript item; a
+      // successful result clears any stale failure (#4459).
+      item.failure = success ? undefined : failure;


Render subagent failures instead of only storing them

When a sub-agent child tool fails in the normal transcript-present path, this only stores failure on the transcript item, but the renderers for those items (ToolTimelineBlock inline rows and SubagentDrawer rows) still display only the tool name/status/detail/timing and never read the new field. In that scenario the UI still shows just “Failed” and the why/next copy remains invisible, so this plumbing needs to be connected to the subagent row renderers as well.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-07-04T18:17:26Z

+        let file = std::fs::OpenOptions::new()
+            .create(true)
+            .write(true)
+            .truncate(false)
+            .open(&lock_path)


Reject symlinked workspace lock files

If a workspace already contains .memory-write.lock as a symlink (including a dangling symlink), this new OpenOptions call follows it with create(true).write(true) before any containment check. That lets a project-controlled path make update_memory_md create/open/lock a file outside the workspace, bypassing the symlink hardening applied to MEMORY.md/SKILL.md; the lock file should be opened with no-follow semantics or prevalidated as a regular file inside the canonical workspace.

Useful? React with 👍 / 👎.

senamakel requested a review from a team July 4, 2026 17:33

chatgpt-codex-connector Bot reviewed Jul 4, 2026

View reviewed changes

Comment thread app/src/providers/ChatRuntimeProvider.tsx

senamakel added 3 commits July 4, 2026 17:40

style: cargo fmt + prettier on follow-up fixes

01e5cd7

coderabbitai Bot approved these changes Jul 4, 2026

View reviewed changes

senamakel merged commit d1b5eeb into tinyhumansai:main Jul 4, 2026
12 checks passed

chatgpt-codex-connector Bot reviewed Jul 4, 2026

View reviewed changes

senamakel mentioned this pull request Jul 4, 2026

fix(agent): render subagent tool failures + harden memory lock symlinks #4507

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(agent): follow-up review fixes for tinyagents parity (#4451–#4469)#4504

fix(agent): follow-up review fixes for tinyagents parity (#4451–#4469)#4504
senamakel merged 4 commits into
tinyhumansai:mainfrom
senamakel:fix/tinyagents-parity-followups

senamakel commented Jul 4, 2026

Uh oh!

coderabbitai Bot commented Jul 4, 2026 •

edited

Loading

Review limit reached

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jul 4, 2026

Uh oh!

chatgpt-codex-connector Bot Jul 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

senamakel commented Jul 4, 2026

Fixes

Testing

Submission Checklist

Uh oh!

coderabbitai Bot commented Jul 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review limit reached

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jul 4, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jul 4, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

coderabbitai Bot commented Jul 4, 2026 •

edited

Loading