Skip to content

fix(agent): follow-up review fixes for tinyagents parity (#4451–#4469)#4504

Merged
senamakel merged 4 commits into
tinyhumansai:mainfrom
senamakel:fix/tinyagents-parity-followups
Jul 4, 2026
Merged

fix(agent): follow-up review fixes for tinyagents parity (#4451–#4469)#4504
senamakel merged 4 commits into
tinyhumansai:mainfrom
senamakel:fix/tinyagents-parity-followups

Conversation

@senamakel

Copy link
Copy Markdown
Member

Follow-up to #4495 — addresses the 5 review-bot findings that were left unresolved when it merged. All seam-side fixes on top of current main.

Fixes

Testing

  • Frontend change typechecks clean.
  • ⚠️ Rust changes are verified via CI — the local shared box OOM-kills full cargo builds. Tests for the changed lines are being added in follow-up commits on this branch.

Submission Checklist

  • N/A: no schema/RPC surface change — pure behavior fixes to existing seams.
  • Tests: frontend typecheck clean; Rust/coverage validated in CI (added on this branch).
  • N/A: no user-facing feature added (bug fixes; frontend change is data plumbing to existing UI).
  • No secrets or PII added; scrubbing paths untouched.
  • Follows repo conventions (seam-side, verbose logging, no vendor/tinyagents edits).

Refs #4451#4469 (tracker #4470).

https://claude.ai/code/session_019j5TLsRLHsM3kqAYFyH4hR

…i#4451-tinyhumansai#4469)

Addresses the 5 review-bot findings left open when tinyhumansai#4495 merged:

- observability: preserve progress-event ordering under backpressure via a
  single ordered overflow forwarder (was per-event spawned send() that could
  land a ToolCallCompleted before its Started, leaving a tool row stuck
  "running") (tinyhumansai#4466).
- observability: ModelCallCompleted now emits the carried cache-creation /
  reasoning token breakdown instead of the crate usage (0 for some providers)
  so per-call/Langfuse telemetry matches the cost tracker (tinyhumansai#4467).
- routes/mod: fallback route ProviderModels share the bridge's provider usage
  carry, so a successful fallback call reports charged-USD/context/cache/
  reasoning instead of a catalogue estimate (tinyhumansai#4467).
- update_memory_md: add an fs2 cross-process advisory flock around the
  read-modify-write so cron subprocesses can't clobber MEMORY.md (the
  in-process mutex only serialized one process) (tinyhumansai#4458).
- frontend: plumb the structured `failure` from ChatSubagentToolResultEvent
  into the subagent tool-call row so a failed child keeps its why/next copy
  live instead of losing it until a snapshot reload (tinyhumansai#4459).

Claude-Session: https://claude.ai/code/session_019j5TLsRLHsM3kqAYFyH4hR
@senamakel senamakel requested a review from a team July 4, 2026 17:33
@coderabbitai

coderabbitai Bot commented Jul 4, 2026

Copy link
Copy Markdown
Contributor

Warning

Review limit reached

You’ve reached a temporary PR review limit under our Fair Usage Limits Policy.

Your recent review volume is higher than typical usage, so adaptive limits are currently applied.

Next review available in: 3 minutes

Enable usage-based reviews in Billing to review now. Otherwise, wait until the next included review is available.
You're only billed for reviews past your plan's rate limits ($0.25/file).

How can I continue?

After more reviews become available, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

To avoid repeated limits, reduce automatic review volume by pausing incremental auto-reviews earlier, using label-based review opt-in, excluding WIP or generated PR titles, or requesting reviews manually when the PR is ready. If your team needs uninterrupted high-volume reviews, an organization admin can enable usage-based reviews.

How do review limits work?

CodeRabbit enforces per-developer PR review limits for each organization. Most developers receive the normal plan review availability.

For paid Pro and Pro+ PR reviews, CodeRabbit uses adaptive limits for sustained high-volume activity. When a developer's recent PR review activity reaches the 95th percentile or higher among CodeRabbit users, additional reviews become available more gradually as earlier reviews age out of the rolling window.

Please refer docs for additional details.

Review details
⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 388709ee-cbfb-4aa6-a0f2-b671df79dfbc

📥 Commits

Reviewing files that changed from the base of the PR and between bb27fe1 and 7180329.

📒 Files selected for processing (9)
  • app/src/providers/ChatRuntimeProvider.tsx
  • app/src/providers/__tests__/ChatRuntimeProvider.test.tsx
  • app/src/services/chatService.ts
  • app/src/store/chatRuntimeSlice.ts
  • app/src/types/turnState.ts
  • src/openhuman/tinyagents/mod.rs
  • src/openhuman/tinyagents/observability.rs
  • src/openhuman/tinyagents/routes.rs
  • src/openhuman/tools/impl/filesystem/update_memory_md.rs

Comment @coderabbitai help to get the list of available commands.

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 812a507b3c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

Comment thread app/src/providers/ChatRuntimeProvider.tsx
senamakel added 3 commits July 4, 2026 17:40
…4459)

Asserts a failed subagent_tool_result stores the parsed structured failure
(class/category/causePlain/nextAction) on the child tool-call row.

Claude-Session: https://claude.ai/code/session_019j5TLsRLHsM3kqAYFyH4hR
…tinyhumansai#4459)

The live path renders subagent.transcript (not the fallback toolCalls list)
once a child has emitted a tool call, so resolveSubagentTranscriptTool must
also carry the parsed failure — otherwise a failed child tool lost its why/next
until a snapshot reload. Adds failure to the runtime + persisted transcript
tool item, the resolver payload/reducer, and hydration; extends the test to
assert the transcript path.

Claude-Session: https://claude.ai/code/session_019j5TLsRLHsM3kqAYFyH4hR
@senamakel senamakel merged commit d1b5eeb into tinyhumansai:main Jul 4, 2026
12 checks passed

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 718032968a

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

if (result != null) item.result = result;
// Carry the structured why/next onto the rendered transcript item; a
// successful result clears any stale failure (#4459).
item.failure = success ? undefined : failure;

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Render subagent failures instead of only storing them

When a sub-agent child tool fails in the normal transcript-present path, this only stores failure on the transcript item, but the renderers for those items (ToolTimelineBlock inline rows and SubagentDrawer rows) still display only the tool name/status/detail/timing and never read the new field. In that scenario the UI still shows just “Failed” and the why/next copy remains invisible, so this plumbing needs to be connected to the subagent row renderers as well.

Useful? React with 👍 / 👎.

Comment on lines +64 to +68
let file = std::fs::OpenOptions::new()
.create(true)
.write(true)
.truncate(false)
.open(&lock_path)

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Reject symlinked workspace lock files

If a workspace already contains .memory-write.lock as a symlink (including a dangling symlink), this new OpenOptions call follows it with create(true).write(true) before any containment check. That lets a project-controlled path make update_memory_md create/open/lock a file outside the workspace, bypassing the symlink hardening applied to MEMORY.md/SKILL.md; the lock file should be opened with no-follow semantics or prevalidated as a regular file inside the canonical workspace.

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant