Skip to content

[Tracker] 2026-06-10 issue triage — 26 open #263

@jakebromberg

Description

@jakebromberg

Cross-repo rollup: items from this tracker that block other repos or carry external deadlines are mirrored in Cross-repo critical path (org Project #33).

End-of-day triage of all 26 open issues in WXYC/discogs-etl. Intended as the entry point for any contributor or agent picking up after 2026-06-10. Supersedes #238 (2026-05-29). Updated 2026-06-10 — original 2026-06-03 snapshot has been refreshed in place; what changed since is summarised in the section below.

What's changed since 2026-06-03

P0 sequencing — recommended order

  1. Close-out [Epic I] Artwork-NULL ambiguity #247 (Epic I — Artwork-NULL ambiguity). Every Scope row is ✅ as of 2026-06-07. Either back-publish the post-drain artwork_null_share measurement and close, or close now and file a thin follow-up specifically for the published-baseline record. Cheapest piece of outstanding work in the queue and removes the highest-noise stale tracker.
  2. Repair or close + re-file the four stub-body issues (check_cache_drift.py: assert artist_name_variation populated + release_track_artist.extra column present #226 / rebuild-cache.sh hygiene: non-zero exit on wait_for_postgres timeout + mask DSN in import_csv.py log output #227 / rebuild-cache.sh: fetch + process all four Discogs dump files (artists, labels, masters in addition to releases) #228 / [Epic] Expand monthly cache rebuild to ingest the full Discogs dump (releases + artists + labels + masters) #229). Now 21 days post-filing without repair. [Epic] Expand monthly cache rebuild to ingest the full Discogs dump (releases + artists + labels + masters) #229 in particular is materially smaller post-jumpstart (labels+masters only); a clean re-file with the post-Manual jumpstart rebuild to land #497 fields ahead of 2026-07-04 cron #267 framing is probably less work than reconstructing the original intent. Per-issue status comments dated 2026-06-10 spell out the changes.
  3. rebuild-cache-bootstrap.sh: skip the discogs-xml-converter clone+build when prebuilt download will succeed #275 — move the discogs-xml-converter clone+build out of rebuild-cache-bootstrap.sh to lean on the prebuilt-download path. Low-priority but mechanically straightforward. ~2-5 min off every cold start, no correctness impact.
  4. Resolve release_artist.role schema drift: drop the column or wire the converter #210 — execute Option A (drop release_artist.role). One alembic migration plus mirror in schema/create_database.sql:77-83. Decision input was complete in [Tracker] 2026-06-10 issue triage — 26 open #263's 2026-06-03 cut; remaining work is mechanical.
  5. Pick DuckDB + extension versions; create plans/duckdb-version-pin.md in WXYC/wiki #189 — pick DuckDB + extension versions and create WXYC/wiki/plans/duckdb-version-pin.md. First-mover prerequisite for the entire Epic: rewrite verify_cache.py against DuckDB ATTACH (Project 2 candidate 1) #186 epic chain (Rewrite scripts/verify_cache.py against DuckDB ATTACH #190Parity test: rewritten verify_cache.py vs original on fixture corpus #191Update README.md and CLAUDE.md for the new DuckDB-based pattern #192Production smoke-test against full prod-shaped corpus #193). Cheap, unblocks four downstream issues.
  6. Add PG integration test for populate_cache_metadata's ON CONFLICT race-tolerance #207 + Add integration test for release_artist import with the converter's actual CSV shape #208 + Add PG integration test for the NOT VALID FK race-tolerance pattern #212 — three follow-up PG integration tests for the recent race-tolerance fixes. (Note Add PG integration test for populate_cache_metadata's ON CONFLICT race-tolerance #207 closed 2026-06-07; Add integration test for release_artist import with the converter's actual CSV shape #208 + Add PG integration test for the NOT VALID FK race-tolerance pattern #212 remain.)
  7. PR test: bump charset-torture corpus pin to v0.12.0 #150 — rebase against current main and merge; CI is all green and the original gating dep (wxyc-shared v0.12.0) is long-resolved.

Critical / high severity

Cross-cluster conflicts (decide once before starting)

DuckDB rewrite vs. verify_cache hardening (carried)

#186's epic rewrites the entire scripts/verify_cache.py (75 KB / 1854 LOC currently). The recent column-loss fixes (#232/#234, closed) were targeted patches on that same script. If the rewrite lands, those patches don't migrate automatically. Decision still needed before #190 starts: does the parity test in #191 assert the same column-preservation invariants those patches were defending? If yes, fold into parity. If no, file a follow-up to re-derive them in the DuckDB shape.

Full-dump expansion (#229) vs. ANV ask — RESOLVED

The LML#356 ANV ask that was previously called out as natural-fit-inside-#229 closed 2026-06-07 as dup of LML#497. The underlying gap was already shipped by the jumpstart rebuild's converter+loader work. #229's residual scope is labels+masters only; the ANV thread is closed end-to-end.

release_artist.role decision (#210) — RESOLVED at decision-input level (carried)

Cross-repo grep + converter inspection done 2026-06-03; no consumer references the column. Mechanical work remaining: ship Option A. Not really a conflict anymore.

Obsolete / needs revision (recommend close)

Stub-body issues — bodies failed to interpolate at filing (carried, 21 days uninterpolated)

Cross-repo dependencies (encoded both natively and in body prose)

Issue Blocked by Method State
#190 #189 native blockedBy OPEN
#191 #190 native blockedBy OPEN
#192 #190 native blockedBy OPEN
#193 #191 native blockedBy OPEN
#228 #226 (#222 closed; stale link) native blockedBy OPEN
#186 (epic) sub-issues #189-#193 sub-issue relationships OPEN
#229 (epic) sub-issues #226/#227/#228 sub-issue relationships OPEN

Cross-repo runtime risks

Cross-repo runtime risks — RESOLVED since 2026-06-03

Open PR in flight

  • PR test: bump charset-torture corpus pin to v0.12.0 #150 — bump charset-torture corpus pin to v0.12.0. Author: jakebromberg; created 2026-05-05. CI all green (lint, drift-check, test, pg, marker-sync). mergeStateStatus: DIRTY — needs rebase against current main. Gating dep (wxyc-shared v0.12.0) is long-resolved. Rebase + merge.

Cluster map (26 open)

Active cross-cutting projects this repo participates in

How to find work

  • All blocked / sequenced issues carry a "## Blocked by" body section and a native blockedBy relationship (rendered under the "Relationships" pill in the GitHub UI).
  • Per-issue triage commentary is on each issue's page (look for "Triage finding" / "Status update" / "Sequencing concern" / "Recommend closing" / "Stub-body finding" comments dated 2026-06-03 and 2026-06-10).
  • Open PRs in flight: test: bump charset-torture corpus pin to v0.12.0 #150 (charset-torture corpus pin; CI green, needs rebase).
  • Supersedes tracker [Tracker] 2026-05-29 issue triage — 24 open #238 (2026-05-29).

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions