GitHub Action — Provenance on Every Release
Generate a DBOM for any data file with a single command — then automate it in CI.
The Problem
Your team releases data files every sprint — updated models, refreshed training sets, new configuration bundles. Each release should include provenance metadata: where the data came from, who signed off, what hash it had at release time. But provenance is a manual step, and manual steps get skipped. By the third sprint, nobody remembers to generate the attestation file. By the sixth sprint, the compliance team notices.
The real cost isn't the audit finding — it's the scramble to retroactively reconstruct provenance for files that have already shipped. You're reverse-engineering hash values from old CI logs and guessing at signer identities from git blame. A five-second automation problem has become a five-day archaeology project.
What You Will See
Run It
What Else This Handles
- Automated DBOM generation on every git push or release tag
- Integration with existing CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins)
- Batch DBOM generation for entire directories of data files
- Custom signer identity from CI environment variables or OIDC tokens