Add cancel() and isBusy to InferenceEngine protocol by stikves · Pull Request #32 · apple/coreai-models

stikves · 2026-06-12T19:55:44Z

Adds lifecycle management APIs to the InferenceEngine protocol so callers can gracefully cancel in-flight generation and query busy state.

All three engine implementations (Sequential, Pipelined, StaticShape) and MockEngine are updated with concrete implementations.

CoreAILanguageModel now calls cancel() before reset() to ensure clean state transitions.

carinapeng · 2026-06-12T21:58:51Z

-    private func drain() {
+    public var isBusy: Bool { generating.withLock { $0 } }
+
+    public func cancel() async throws {


My understanding is that cancel() in static engine has a slightly different meaning than in pipelined, and sequential shares that same definition as static here

Pipelined engine cancels a real background Task so cancel actually does stop the work. but the static and sequential engines has no background Task so generation runs inside the consumer's next() calls, so generating only becomes false when next() runs and sees cancel being requested.

So cancel here kind of means: we ask the consumer to stop if it keeps pulling. Should we restructure the static/sequential path so cancellation doesn't depend on consumer making progress?

The main problem I see here is that cancelRequested is only checked inside next(), so semantically the idea of cancel is different between pipelined and the other engines

To make more concrete if a consumer is paused and not calling next() the check never runts so cancel() in that case means throwing a timeout

Yes that is a valid concern. The iterator and the engine are a bit decoupled, and it might make sense to cancel on the engine side, and have the iterator be invalid.

Added a Token concept. And only the iterator that contains the token would talk to the engine

stikves force-pushed the sukru/engine-cancel-api branch from 4e5d422 to f1a8bf3 Compare June 12, 2026 20:11

stikves requested review from alejandro-isaza, carinapeng, kevchengcodes and tjia1818 June 12, 2026 20:16

stikves self-assigned this Jun 12, 2026

kevchengcodes reviewed Jun 12, 2026

View reviewed changes

Comment thread swift/Sources/CoreAILanguageModels/InferenceEngines/CoreAIPipelinedEngine.swift Outdated

Comment thread swift/Sources/CoreAILanguageModels/InferenceEngines/CoreAISequentialEngine.swift Outdated

stikves force-pushed the sukru/engine-cancel-api branch from 1fd5faa to 50386a3 Compare June 12, 2026 21:23

carinapeng reviewed Jun 12, 2026

View reviewed changes

stikves force-pushed the sukru/engine-cancel-api branch 3 times, most recently from 342fd81 to f666779 Compare June 12, 2026 22:40

Add cancel() and isBusy with GenerationToken pattern

1a754e1

stikves force-pushed the sukru/engine-cancel-api branch from f666779 to 1a754e1 Compare June 12, 2026 22:50

stikves and others added 2 commits June 12, 2026 15:53

Format changes

0e78f84

Merge branch 'main' into sukru/engine-cancel-api

3267828

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add cancel() and isBusy to InferenceEngine protocol#32

Add cancel() and isBusy to InferenceEngine protocol#32
stikves wants to merge 3 commits into
apple:mainfrom
stikves:sukru/engine-cancel-api

stikves commented Jun 12, 2026

Uh oh!

Uh oh!

Uh oh!

carinapeng Jun 12, 2026

Uh oh!

carinapeng Jun 12, 2026 •

edited

Loading

Uh oh!

stikves Jun 12, 2026

Uh oh!

stikves Jun 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

stikves commented Jun 12, 2026

Uh oh!

Uh oh!

Uh oh!

carinapeng Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

carinapeng Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stikves Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

stikves Jun 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

carinapeng Jun 12, 2026 •

edited

Loading