Recording stuck 'active' (never finalized) when egress ends out-of-band (e.g. room access change) → room permanently 409-locked

## Summary

A recording's `Recording` row stays `status=active` indefinitely (never transitions to `stopped`/`saved`) when the egress is terminated **out-of-band** — i.e. by anything other than the `stop-recording` flow. The clearest trigger: **changing the room access while recording**, which stops the LiveKit egress directly. After this, the room is effectively locked — `POST /api/v1.0/rooms/{id}/start-recording/` returns **409 Conflict** ("already recording") and never recovers without a manual DB edit.

## Steps to reproduce

1. Start a recording in a room (observed with `mode=transcript`, but the lifecycle is mode-independent).
2. **Change the room's access** while it is recording (this stops the egress).
3. LiveKit fires `egress_ended` → meet receives it (`POST /api/v1.0/rooms/webhooks-livekit/` → 200), **but the `Recording` row stays `active`.**
4. Try to record in that room again → **409 Conflict**, permanently.

### Contrast — the normal paths work

| End method | Recording row |
|---|---|
| Stop button / leave room (→ `stop-recording`) | `stopped` ✅ |
| Change room access (egress stops out-of-band) | stays `active` ❌ → 409 lock |

## Expected

Finalize the recording (`active` → `stopped`/`saved`) on the `egress_ended` webhook **regardless of how the egress ended**, not only via the `stop-recording` request — or add a reconciliation/janitor for `active` recordings whose egress has already ended. As-is, a single access change permanently 409-locks the room and leaks `active` rows.

## Notes

- meet **does** receive the egress webhook at the moment it ends (200 response), so the signal is available — it just isn't used to finalize the recording.
- Minor, likely related: when the egress ends this way, stopping the metadata-collector agent dispatch logs `TwirpError(code=unavailable, "no response from servers", 503)` because the agent already self-exited on participant disconnect — a no-op stop on a finished job. Worth guarding so it doesn't surface as an error.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Recording stuck 'active' (never finalized) when egress ends out-of-band (e.g. room access change) → room permanently 409-locked #1448

Summary

Steps to reproduce

Contrast — the normal paths work

Expected

Notes

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

End method	Recording row
Stop button / leave room (→ `stop-recording`)	`stopped` ✅
Change room access (egress stops out-of-band)	stays `active` ❌ → 409 lock

Uh oh!

Recording stuck 'active' (never finalized) when egress ends out-of-band (e.g. room access change) → room permanently 409-locked #1448

Description

Summary

Steps to reproduce

Contrast — the normal paths work

Expected

Notes

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions