Skip to content

ci: fail release if any expected artifacts are missing#16

Open
kenvandine wants to merge 1 commit into
lemonadefrom
fix/validate-release-artifacts
Open

ci: fail release if any expected artifacts are missing#16
kenvandine wants to merge 1 commit into
lemonadefrom
fix/validate-release-artifacts

Conversation

@kenvandine

Copy link
Copy Markdown
Member

Summary

Part of lemonade-sdk/lemonade#2163

The previous "Check release artifacts" step only verified that at least one artifact existed. This meant a release could be published with missing CUDA builds — as happened with b9549, which was missing all 7 Windows CUDA artifacts and ubuntu-cuda-sm_90-x64.

Changes

The check now explicitly enumerates every required artifact and fails with actionable error messages before a release is created:

  • All 7 CUDA SM variants (sm_75, sm_80, sm_86, sm_89, sm_90, sm_100, sm_120) for each platform:
    • Ubuntu x64 (.tar.xz)
    • Ubuntu arm64 (.tar.xz)
    • Windows x64 (.7z)
  • Ubuntu ROCm (.tar.gz)
  • Ubuntu OpenVINO (.tar.gz)
  • Windows ROCm (.zip)
  • Windows CPU (.zip)

If any artifact is absent, the step fails and lists exactly which files are missing — so a partially-populated release can never be tagged.

The previous check only verified that at least one artifact existed.
This meant a release could be published with missing CUDA builds
(as happened with b9549, which was missing all Windows CUDA artifacts
and ubuntu-cuda-sm_90-x64).

The updated check explicitly enumerates every expected artifact:
- All 7 CUDA SM variants (sm_75/80/86/89/90/100/120) for each of:
  ubuntu x64, ubuntu arm64, windows x64
- ubuntu ROCm, ubuntu OpenVINO, Windows ROCm, Windows CPU

If any are absent the step fails before the release is created,
so a partially-populated release can never be tagged.

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

@fl0rianr fl0rianr left a comment

Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't think this should be the final fix. The artifact manifest check is useful, but this makes releases all-or-nothing: a transient Windows CUDA failure now blocks the entire release.

I'd prefer this as a first step only if we also add a recovery path: publish available artifacts with a clear missing-artifacts list, then allow a manual backfill workflow to rebuild/upload a missing sm_* target for the same tag.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants