Fix supervisor: report vault execution so stuck-scan order isn't fixed by holyfuchs · Pull Request #187 · onflow/FlowYieldVaults

holyfuchs · 2026-02-25T14:26:01Z

Closes: #177

Description

The supervisor “check the first N vaults” logic was fixed: vault executions are now reported to the registry, which keeps an ordered list of “least recently executed” vaults. The supervisor then scans only those first N (e.g. 5) and recovers the ones that are actually stuck, instead of always the same fixed set.

What was implemented

Execution callback
Each AutoBalancer now has an execution callback that runs after a scheduled rebalance. The callback calls the registry with that vault’s id so the registry can update its internal order (remove id from the list, append to the end).
Shared callback resource
In FlowYieldVaultsAutoBalancers, a single RegistryReportCallback resource per account implements DeFiActions.AutoBalancerExecutionCallback. Its onExecuted(balancerUUID) calls the registry so the vault that just ran is reported by id. Every new AutoBalancer gets a capability to this shared callback and passes it to setExecutionCallback(cap).

Context (from discussion)

The supervisor was limited to processing a small batch (e.g. first 5 vaults) per run. The agreed short-term approach was to order the vault list by “last executed” so the supervisor always checks the oldest / least recently executed vaults first (most likely stuck).

holyfuchs · 2026-03-04T17:46:00Z

Failing tests are due to rounding issues and should be fixed with:
onflow/FlowALP#188

Just for info:
This PR required changes in FlowActions.
FlowYieldVaults has FlowALP as submodule which has FlowActions as submodule, hence the need to update FlowALP.
But since FlowALP main contains changes currently not compatible with FlowYieldVaults, we are using a different branch.

cadence/contracts/FlowYieldVaultsSchedulerRegistry.cdc

cadence/scripts/flow-yield-vaults/get_pending_yield_vaults_paginated.cdc

holyfuchs · 2026-03-09T17:12:01Z

failing tests need: #182

Kay-Zee

LGTM, but you mentioned it still requires FlowActions changes? or are those resolved.

cadence/contracts/FlowYieldVaultsSchedulerRegistry.cdc

Kay-Zee · 2026-03-10T00:30:10Z

cadence/contracts/FlowYieldVaultsSchedulerRegistry.cdc

+        if !(self.yieldVaultRegistry[yieldVaultID] ?? false) {
+            return
+        }
+        if let index = self.stuckScanOrder.firstIndex(of: yieldVaultID) {


Not that there's a solution, but wonder what the limit of the size of this array will be before we see problems, since we're just doing array ops.

There is a "solution".
In the most recent commit now I used a linked list.
Not really happy about implementing a linked list in cadence but it gets execution time down to O(1).
If you prefer the old version without that added complexity we can always revert that commit.

cadence/contracts/FlowYieldVaultsSchedulerV1.cdc

added autobalancer callback to find potentially stuck vaults

…iene Fix scheduler registry hygiene follow-ups

liobrasil · 2026-03-12T17:26:11Z

I do have this PR against the actual @holyfuchs 's branch: #207

nvdtf · 2026-03-13T22:14:00Z

cadence/contracts/FlowYieldVaultsSchedulerRegistry.cdc

+
+    /// Node in the simulated doubly-linked list used for O(1) stuck-scan ordering.
+    /// `prev` points toward the head (most recently executed); `next` points toward the tail (oldest/least recently executed).
+    access(all) struct ListNode {


Nice work with the doubly linked list!
Minor comment, I think we can extract it from this contract into a utility contract for cleaner code. I asked Claude, decision up to you:

The core list logic is already self-contained. The only coupling is in getStuckScanCandidates, which touches scheduleCaps and calls dequeuePending. The node operations themselves touch nothing external.

Here's the clean extraction boundary:

┌──────────────────────────────────────────────────────────────┐ │ NEW: UInt64LinkedList contract (or resource) │ │ │ │ struct ListNode { prev: UInt64?, next: UInt64? } │ │ var nodes: {UInt64: ListNode} │ │ var head: UInt64? │ │ var tail: UInt64? │ │ │ │ access(account) fun insertAtHead(id: UInt64) │ │ access(account) fun remove(id: UInt64): Bool │ │ access(all) view fun contains(id: UInt64): Bool │ │ access(all) view fun tailWalk(limit: UInt): [UInt64] │ └──────────────────────────────────────────────────────────────┘ ▲ called by ┌──────────────────────────────────────────────────────────────┐ │ FlowYieldVaultsSchedulerRegistry (unchanged business API) │ │ │ │ register() → list.insertAtHead(id) if participates │ │ unregister() → list.remove(id) │ │ reportExecution() → list.remove(id); list.insertAtHead(id) │ │ getStuckScanCandidates() → list.tailWalk() + prune logic │ │ (coupling to scheduleCaps + dequeuePending stays here) │ └──────────────────────────────────────────────────────────────┘

As a resource (cleanest Cadence idiom):

access(all) contract UInt64LinkedList { access(all) struct ListNode { access(all) var prev: UInt64? access(all) var next: UInt64? // ... setters ... } access(all) resource List { access(all) var nodes: {UInt64: ListNode} access(all) var head: UInt64? access(all) var tail: UInt64? access(all) fun insertAtHead(id: UInt64) { ... } // O(1) access(all) fun remove(id: UInt64): Bool { ... } // O(1) access(all) view fun contains(id: UInt64): Bool { ... } } access(all) fun create(): @List { return <- create List(...) } }

SchedulerRegistry stores one @UInt64LinkedList.List resource and delegates node operations to it. Changes required in SchedulerRegistry: replace the three fields and two private functions with a single @UInt64LinkedList.List field — roughly a 40-line reduction. The getStuckScanCandidates signature and behavior are unchanged.

The only decision is whether to use a resource (idiomatic, but requires the registry to store it) or a separate contract with its own module-level state (simpler to call, but less composable). The resource approach is preferred since it makes the list lifetime explicit and lets you instantiate multiple lists per contract if needed later.

holyfuchs force-pushed the holyfuchs/supervisor-fix branch from a6cce3f to b2af175 Compare February 25, 2026 20:16

holyfuchs changed the title ~~feat(scheduler): FlowAutoBalancer + stuck-scan order for Supervisor~~ Fix supervisor: report vault execution so stuck-scan order isn't fixed Feb 25, 2026

holyfuchs self-assigned this Mar 2, 2026

holyfuchs marked this pull request as ready for review March 3, 2026 19:44

holyfuchs requested a review from a team as a code owner March 3, 2026 19:44

vishalchangrani requested review from Kay-Zee, jordanschalm, liobrasil and nvdtf March 5, 2026 17:14

nvdtf reviewed Mar 6, 2026

View reviewed changes

cadence/contracts/FlowYieldVaultsSchedulerRegistry.cdc Outdated Show resolved Hide resolved

cadence/contracts/FlowYieldVaultsSchedulerRegistry.cdc Outdated Show resolved Hide resolved

tim-barry reviewed Mar 6, 2026

View reviewed changes

cadence/contracts/FlowYieldVaultsSchedulerRegistry.cdc Outdated Show resolved Hide resolved

cadence/scripts/flow-yield-vaults/get_pending_yield_vaults_paginated.cdc Show resolved Hide resolved

Kay-Zee reviewed Mar 10, 2026

View reviewed changes

holyfuchs added 3 commits March 10, 2026 04:51

Fix supervisor unstuck vaults

e1ffc9f

added autobalancer callback to find potentially stuck vaults

update submodule

05148e9

update submodule branch to v0

53cad29

holyfuchs mentioned this pull request Mar 10, 2026

update FlowActions submodule onflow/FlowALP#253

Draft

holyfuchs force-pushed the holyfuchs/supervisor-fix branch from 3008e0a to ed5ebfd Compare March 10, 2026 04:33

holyfuchs added 3 commits March 10, 2026 06:01

address pr comments

c21d5b4

update submodule

3878bc4

use linked list for stuck-scan

c7fa0e6

holyfuchs force-pushed the holyfuchs/supervisor-fix branch from 293dd3a to c7fa0e6 Compare March 10, 2026 05:03

liobrasil and others added 7 commits March 10, 2026 08:54

Fix scheduler registry hygiene

4133c85

Clean up scheduler follow-up issues

fcebdc7

Avoid adding callback capability state

32e6f2a

Tighten healthy supervisor test isolation

ec71d67

Align scheduler docs with current architecture

726a454

Merge pull request #204 from onflow/lionel/fix-scheduler-registry-hyg…

a25e094

…iene Fix scheduler registry hygiene follow-ups

Merge branch 'main' into holyfuchs/supervisor-fix

0d23dd6

nvdtf reviewed Mar 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix supervisor: report vault execution so stuck-scan order isn't fixed#187

Fix supervisor: report vault execution so stuck-scan order isn't fixed#187
holyfuchs wants to merge 13 commits intomainfrom
holyfuchs/supervisor-fix

holyfuchs commented Feb 25, 2026 •

edited

Loading

Uh oh!

holyfuchs commented Mar 4, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

holyfuchs commented Mar 9, 2026

Uh oh!

Kay-Zee left a comment

Uh oh!

Uh oh!

Kay-Zee Mar 10, 2026

Uh oh!

holyfuchs Mar 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

liobrasil commented Mar 12, 2026

Uh oh!

nvdtf Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

holyfuchs commented Feb 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

holyfuchs commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

holyfuchs commented Mar 9, 2026

Uh oh!

Kay-Zee left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Kay-Zee Mar 10, 2026

Choose a reason for hiding this comment

Uh oh!

holyfuchs Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

liobrasil commented Mar 12, 2026

Uh oh!

nvdtf Mar 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

holyfuchs commented Feb 25, 2026 •

edited

Loading

holyfuchs commented Mar 4, 2026 •

edited

Loading

holyfuchs Mar 10, 2026 •

edited

Loading