Files

Release Gate / gate (push) Has been cancelled

Details

AX Agent 남은 parity 작업과 설정 정리 기준을 문서화하고 카드 hover 깜박임 수정\n\n- claw-code 대비 AX Agent 핵심 엔진/UI 남은 차이와 현재 추정 진척율을 parity 계획 문서에 기록\n- PlanMode, FreeTierDelaySeconds, MaxAgentIterations, MaxRetryOnError 등 런타임 영향 설정의 제거/개발자 전용 후보를 정리\n- 작업유형 카드에서 custom hover 라벨과 기본 ToolTip이 충돌해 발생하던 깜박임을 제거\n- README와 DEVELOPMENT 문서에 변경 이유와 검증 결과를 즉시 반영\n\n검증 결과\n- dotnet build src/AxCopilot/AxCopilot.csproj -c Release -v minimal -p:OutputPath=bin\\verify\\ -p:IntermediateOutputPath=obj\\verify\\\n- 경고 0 / 오류 0

2026-04-05 16:44:35 +09:00

12 KiB

Raw Blame History

Claw Code Parity Plan (Rewritten)

Scope

Align AX Copilot with claw-code quality for loop reliability, permission/hook behavior, and session durability.

Update

Updated: 2026-04-05 15:34 (KST)
Rebased the AX Agent improvement plan on actual claw-code runtime files instead of earlier AX snapshots. The reference spine is now src/bootstrap/state.ts -> src/bridge/initReplBridge.ts -> src/bridge/sessionRunner.ts -> src/screens/REPL.tsx -> src/components/Messages.tsx -> src/components/StatusLine.tsx.
AX Agent work should follow that same quality order: state first, execution second, render last. UI-only fixes that bypass state/execution should be treated as temporary.
Updated: 2026-04-05 16:55 (KST)
Current estimated parity vs claw-code: core execution engine 82%, main chat UI 68%, Cowork/Code status UX 63%, internal settings linkage 88%, overall AX Agent 74%.
Engine-affecting settings should be handled conservatively during parity work. If a setting changes the main execution route, approval flow, or recovery behavior without representing a stable real-world user choice, it should be moved to developer-only UI or removed from user-facing surfaces.

Preserved History (Summary)

Core loop guards and post-tool verification gates are already partially implemented.
Plan Mode, parallel tool execution, and unknown-tool recovery are in place.
Session restore hardening is ongoing.

Reference Map

claw-code reference	AX apply target	completion criteria	quality criteria
`src/bootstrap/state.ts`	`src/AxCopilot/Views/ChatWindow.xaml.cs`, `src/AxCopilot/Services/Agent/AxAgentExecutionEngine.cs`, `src/AxCopilot/Services/ChatStorageService.cs`	one canonical runtime/session state for current turn, queue, retry, execution events, and persisted snapshot	reopen/retry/queue flows do not create duplicate or blank assistant messages
`src/bridge/initReplBridge.ts`	`src/AxCopilot/Services/Agent/AxAgentExecutionEngine.cs`, `src/AxCopilot/Services/LlmService.cs`	send/regenerate/retry/queued follow-up/slash all enter through one prepared-execution path	same input under same settings takes same execution route regardless of entry point
`src/bridge/sessionRunner.ts`	`src/AxCopilot/Services/Agent/AgentLoopService.cs`, `src/AxCopilot/Services/Agent/AgentLoopTransitions.cs`, `src/AxCopilot/Services/Agent/AgentLoopTransitions.Execution.cs`	tool start/result/error/progress normalized once inside loop layer	Cowork/Code no longer flash repeated status strings or overshare debug payloads
`src/bridge/bridgeMessaging.ts`	`src/AxCopilot/Views/ChatWindow.xaml.cs`, `src/AxCopilot/Services/Agent/AgentLoopService.cs`	inbound execution events separated from display-only events before UI render	execution event replay does not duplicate visible timeline banners
`src/screens/REPL.tsx`	`src/AxCopilot/Views/ChatWindow.xaml`, `src/AxCopilot/Views/ChatWindow.xaml.cs`	screen state transitions, queue flow, retry flow, and composer state use shared runtime helpers	window resize, queue chaining, and retry feel stable instead of UI-patched
`src/components/Messages.tsx`	`src/AxCopilot/Views/ChatWindow.xaml.cs`	timeline derives from normalized conversation/session state only	no token-only completions, blank cards, or direct injected duplicates
`src/components/StatusLine.tsx`	`src/AxCopilot/Views/ChatWindow.xaml`, `src/AxCopilot/Views/ChatWindow.xaml.cs`	status strip computed from debounced runtime state, not multiple imperative refresh calls	metadata stays lightweight and does not overpower message timeline

AX Agent Improvement Phases

Phase A. Runtime State Canonicalization

Reference: src/bootstrap/state.ts
AX apply location: src/AxCopilot/Views/ChatWindow.xaml.cs, src/AxCopilot/Services/Agent/AxAgentExecutionEngine.cs, src/AxCopilot/Services/ChatStorageService.cs
Completion criteria:
- Chat, Cowork, Code all update one shared runtime/session state model.
- queue, retry, post-compaction, and execution-event state can be restored after reopen.
Quality criteria:
- reopening a conversation reproduces the same visible timeline without extra assistant cards.
- queue and execution badges remain in sync with the stored conversation.

Phase B. Prepared Execution Unification

Reference: src/bridge/initReplBridge.ts
AX apply location: src/AxCopilot/Services/Agent/AxAgentExecutionEngine.cs, src/AxCopilot/Services/LlmService.cs
Completion criteria:
- prompt stack assembly, execution mode choice, and final assistant commit are engine-owned.
- send/regenerate/retry/queued follow-up/slash flows all call the same preparation API.
Quality criteria:
- behavior is deterministic per tab/settings combination.
- UI stops building different prompt stacks for the same conversation state.

Phase C. AgentLoop Event Normalization

Reference: src/bridge/sessionRunner.ts, src/bridge/bridgeMessaging.ts
AX apply location: src/AxCopilot/Services/Agent/AgentLoopService.cs, src/AxCopilot/Services/Agent/AgentLoopTransitions.cs, src/AxCopilot/Services/Agent/AgentLoopTransitions.Execution.cs
Completion criteria:
- loop events are normalized into bounded activity/event records before UI consumption.
- permission requests, failure states, retries, and completion states use a stable event shape.
Quality criteria:
- Cowork/Code no longer flash rapidly during long-running tool sequences.
- file path/debug detail remains collapsed by default.

Phase D. Timeline Render Parity

Reference: src/screens/REPL.tsx, src/components/Messages.tsx
AX apply location: src/AxCopilot/Views/ChatWindow.xaml, src/AxCopilot/Views/ChatWindow.xaml.cs
Completion criteria:
- assistant/user messages, execution logs, compact boundaries, and queue summaries are rendered from one derived timeline model.
- direct imperative bubble injection is removed from normal send/regenerate/retry flows.
Quality criteria:
- no blank assistant cards.
- no token-only completion without visible content.
- no duplicate event banners after re-render.

Phase E. Composer and Status Strip Simplification

Reference: src/screens/REPL.tsx, src/components/StatusLine.tsx
AX apply location: src/AxCopilot/Views/ChatWindow.xaml, src/AxCopilot/Views/ChatWindow.xaml.cs
Completion criteria:
- composer height grows only on explicit line breaks.
- status strip, queue summary, and runtime activity all use debounced runtime updates.
- Chat/Cowork/Code share one responsive width calculation policy.
Quality criteria:
- resizing feels natural.
- composer does not keep growing after send.
- metadata remains subordinate to the message timeline.

Phase F. Recovery, Resume, and Verification

Reference: src/bootstrap/state.ts, src/bridge/sessionRunner.ts, src/screens/REPL.tsx
AX apply location: src/AxCopilot/Views/ChatWindow.xaml.cs, src/AxCopilot/Services/Agent/AxAgentExecutionEngine.cs, src/AxCopilot/Services/ChatStorageService.cs
Completion criteria:
- reopen after interruption keeps queue, runtime summary, and latest visible assistant state consistent.
- retry-last and regenerate do not depend on mutating InputBox.Text.
- all three tabs pass reopen/retry/manual compact/manual stop/manual resume scenarios.
Quality criteria:
- stored conversation and rendered conversation stay identical after restore.
- final reopened state matches the last completed runtime state.

Execution Tracks

Hook contract parity

Structured hook output support (updatedInput, updatedPermissions, additionalContext).
Runtime gating through settings toggles.

Session/state parity

Deterministic run resume rules.
Stable jsonl event schema + replay compatibility.

Recovery parity

Failure-type classification and standardized retry guidance.
Reduced repeated wrong-tool loops.

Completion parity

Evidence-based finalization criteria for code/document tasks.

Done Criteria

Internal parity scenarios pass target threshold.
Resume/replay failures: zero.
dotnet build warnings/errors: zero.

Validation Matrix

Build: dotnet build src/AxCopilot/AxCopilot.csproj -c Release -v minimal -p:OutputPath=bin\\verify\\ -p:IntermediateOutputPath=obj\\verify\\
Manual scenario 1: Chat send -> answer visible -> retry -> regenerate -> reopen conversation
Manual scenario 2: Cowork tool run -> progress summary -> completion -> queue next request -> reopen
Manual scenario 3: Code task with execution log noise -> completion -> compact -> next turn -> reopen
Manual scenario 4: AX Agent internal settings change -> immediate runtime reflection without layout regression

Current Snapshot

Updated: 2026-04-05 19:42 (KST)
Estimated parity:
- Core engine: 89%
- Main transcript UI: 96%
- Cowork/Code runtime UX: 92%
- Internal settings linkage: 88%
- Overall AX Agent parity: 93%

Remaining Gaps

Prompt lifecycle parity

claw-code reference: src/utils/handlePromptSubmit.ts, src/utils/processUserInput/processTextPrompt.ts
AX gap:
- send / retry / regenerate are mostly unified, but slash / compact 후 다음 턴 / 일부 queue 후처리는 아직 ChatWindow.xaml.cs에서 UI 상태를 먼저 만지는 구간이 남아 있습니다.
- 목표는 모든 입력 진입점이 AxAgentExecutionEngine의 동일한 prepare/execute/finalize 축만 타게 만드는 것입니다.

Plan / approval rendering parity

claw-code reference: src/components/messages/PlanApprovalMessage.tsx
AX gap:
- 기본 transcript에서는 compact pill 위주로 줄였지만, 승인/계획 결과 표현이 아직 Popup/Window + WPF 카드와 섞여 있습니다.
- 목표는 “본문 우선 + 필요 시 열기” 기준으로 더 단일한 timeline 언어로 수렴시키는 것입니다.

Status line / composer parity

claw-code reference: src/components/StatusLine.tsx, src/components/PromptInput/PromptInput.tsx
AX gap:
- 하단 상태바와 composer 옵션은 많이 줄었지만, 상태 메타가 여전히 분산돼 있고 일부 토글/빠른 설정이 별도 행으로 남아 있습니다.
- 목표는 transcript 하단의 작업 바 한 축으로 더 압축하는 것입니다.

Runtime event density parity

claw-code reference: src/bridge/sessionRunner.ts, src/components/StatusNotices.tsx
AX gap:
- non-debug 기본 로그는 줄었지만, 일부 Cowork/Code 이벤트는 여전히 timeline을 자주 흔듭니다.
- 목표는 permission / tool / error / complete / paused / resumed를 더 안정된 event shape로 정규화하는 것입니다.

Settings Review

Remove candidate:
- PlanMode
  - current state: UI에서는 사실상 제거됐지만 AppSettings, SettingsViewModel, AppStateService, AgentLoopService 잔재가 남아 있음
  - rationale: 현재 정책이 off 고정이라 사용자 선택값이 엔진에 의미 있게 기여하지 않음
Move to developer-only candidate:
- FreeTierDelaySeconds
  - rationale: 일반 사용자가 조정할 이유가 적고 엔진 지연 정책에 직접 영향
- MaxAgentIterations
- MaxRetryOnError
  - rationale: 핵심 실행 루프 품질에 직접 영향하는 런타임 튜닝값
Keep as runtime-critical:
- OperationMode
- MaxContextTokens
- ContextCompactTriggerPercent
- EnableProactiveContextCompact
- EnableCoworkVerification
- EnableCodeVerification
- Code.EnablePlanModeTools / EnableWorktreeTools / EnableTeamTools / EnableCronTools

Known UX / Performance Risks

Topic preset hover flicker was caused by duplicate hover systems:
- custom hover label
- default WPF ToolTip
AX fix:
- remove default ToolTip from topic cards and keep a single hover label path
Remaining runtime performance review targets:
- RefreshContextUsageVisual() frequency
- BuildTopicButtons() rebuild frequency
- OnAgentEvent timeline churn during long Cowork/Code runs

12 KiB Raw Blame History

Claw Code Parity Plan (Rewritten)

Scope

Update

Preserved History (Summary)

Reference Map

AX Agent Improvement Phases

Phase A. Runtime State Canonicalization

Phase B. Prepared Execution Unification

Phase C. AgentLoop Event Normalization

Phase D. Timeline Render Parity

Phase E. Composer and Status Strip Simplification

Phase F. Recovery, Resume, and Verification

Execution Tracks

Done Criteria

Validation Matrix

Current Snapshot

Remaining Gaps

Settings Review

Known UX / Performance Risks

12 KiB

Raw Blame History