Self-hosted clone of https://github.com/openova-io/openova (post-cutover, standalone)

Go to file

hatiyildiz b0c1c07271 fix(bp-flux): align upstream flux2 version with cloud-init's flux install (no double-install destruction) Live verified on omantel.omani.works (2026-04-29). bp-flux:1.1.1 shipped the fluxcd-community `flux2` subchart at 2.13.0 (= upstream Flux appVersion 2.3.0). Cloud-init pre-installed Flux core at v2.4.0 via `https://github.com/fluxcd/flux2/releases/download/v2.4.0/install.yaml`. helm-controller's reconcile of bp-flux ran `helm install` on top of the running v2.4.0 Flux; the chart's v2.3.0 CRD update failed apiserver admission with `status.storedVersions[0]: Invalid value: "v1": must appear in spec.versions`; Helm rolled back; the rollback DELETED every running Flux controller Deployment (helm-controller, source-controller, kustomize-controller, image-automation-controller, image-reflector-controller, notification-controller). The cluster lost its GitOps engine — no further HelmRelease could progress, and the only recovery was full `tofu destroy` + reprovision. This is OPTION C of the architectural fix proposed in the incident memo: version-align cloud-init's flux2 install with the bp-flux umbrella chart's `flux2` subchart so a single upstream Flux release is installed and helm-controller adopts it on first reconcile rather than reinstalls on top with a different version. Changes: * `infra/hetzner/cloudinit-control-plane.tftpl` — kept the install.yaml URL pinned at v2.4.0 (deliberate; this is the source of truth) and added the CRITICAL VERSION-PIN INVARIANT comment block documenting the failure mode. * `platform/flux/chart/Chart.yaml` — bumped `flux2` subchart dep from 2.13.0 to 2.14.1. The community chart 2.14.1 carries appVersion 2.4.0, matching cloud-init exactly. Bumped chart version 1.1.1 -> 1.1.2. * `platform/flux/chart/values.yaml` — `catalystBlueprint.upstream .version` mirror of the dep pin moved from 2.13.0 to 2.14.1. * `clusters/_template/bootstrap-kit/03-flux.yaml` and `clusters/omantel.omani.works/bootstrap-kit/03-flux.yaml` — bumped bp-flux HelmRelease to 1.1.2 + added explicit `install.disableTakeOwnership: false`, `upgrade.disableTakeOwnership: false`, and `upgrade.preserveValues: true` so helm-controller adopts the cloud-init-installed Flux objects rather than rolling back on ownership conflict. * `products/catalyst/chart/Chart.yaml` — bumped bp-catalyst-platform umbrella 1.1.1 -> 1.1.2, with bp-flux dep bumped to 1.1.2. * `clusters/_template/bootstrap-kit/13-bp-catalyst-platform.yaml` and `clusters/omantel.omani.works/bootstrap-kit/13-bp-catalyst-platform.yaml` — bumped HelmRelease to 1.1.2. * `platform/flux/chart/tests/version-pin-replay.sh` — NEW. Six-case catastrophic-failure replay test: Case 1: Chart.yaml declares the flux2 subchart with explicit version. Case 2: cloud-init pins flux2 install.yaml to an explicit v-tag. Case 3: chart's flux2 subchart appVersion equals cloud-init's pinned upstream version (the load-bearing invariant). Case 4: values.yaml metadata mirrors the Chart.yaml dep pin. Case 5: helm template renders cleanly + contains the four core Flux controllers. Case 6: replay test rejects a planted mismatched fake Chart.yaml (the gate's own self-test — proves the gate works). All six cases green locally; the new test joins the existing observability-toggle test in tests/. * `docs/RUNBOOK-PROVISIONING.md` — new section "bp-flux double-install — version-pin invariant" documenting the failure mode, the four pin-sites, the safe bump procedure, and the existing-Sovereign recovery path (full reprovision). Existing Sovereigns running 1.1.1: no in-place recovery is possible once the rollback has fired. Reprovision required against 1.1.2. Per docs/INVIOLABLE-PRINCIPLES.md #3 (architecture as documented) + #4 (never hardcode) — the version pins remain operator-bumpable via PR, but BOTH cloud-init's URL AND the chart's subchart MUST move together in the same PR; CI gate tests/version-pin-replay.sh enforces this. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>		2026-04-29 19:38:17 +02:00
.claude	docs(iter-1): add IMPLEMENTATION-STATUS, fix wrong-org refs, reconcile monorepo	2026-04-27 20:43:31 +02:00
.github	fix(bp-*): observability toggles default false — break circular CRD dependency	2026-04-29 18:08:09 +02:00
.playwright-mcp/admin-evidence	docs(wizard): Playwright evidence for sovereign admin landing + per-app tabs	2026-04-29 17:31:02 +02:00
clusters	fix(bp-flux): align upstream flux2 version with cloud-init's flux install (no double-install destruction)	2026-04-29 19:38:17 +02:00
core	feat(wizard): #169 — StepDomain three-mode (pool / byo-manual / byo-api)	2026-04-29 09:01:07 +02:00
docs	fix(bp-flux): align upstream flux2 version with cloud-init's flux install (no double-install destruction)	2026-04-29 19:38:17 +02:00
infra/hetzner	fix(bp-flux): align upstream flux2 version with cloud-init's flux install (no double-install destruction)	2026-04-29 19:38:17 +02:00
platform	fix(bp-flux): align upstream flux2 version with cloud-init's flux install (no double-install destruction)	2026-04-29 19:38:17 +02:00
products	fix(bp-flux): align upstream flux2 version with cloud-init's flux install (no double-install destruction)	2026-04-29 19:38:17 +02:00
scripts	docs(ops): comprehensive operator runbook + remediation playbook + idempotent recovery script	2026-04-29 19:26:29 +02:00
tests	merge: cloud-init creates ghcr-pull secret durable + GHCR token pipeline	2026-04-29 18:08:32 +02:00
.gitignore	feat(wizard): admin landing = app card grid; per-app page with logs/prereqs/status/overview tabs (replaces DAG)	2026-04-29 17:23:59 +02:00
CLAUDE.md	docs(component-count): update 53 → 56 anchors after Pass 105 (spire + nats-jetstream + sealed-secrets)	2026-04-28 13:48:24 +02:00
README.md	docs(reconcile-pass-2): align docs with ground truth at `6afdb303`	2026-04-29 11:48:57 +02:00

README.md

OpenOva Catalyst

A self-sufficient Kubernetes-native platform. Published as signed OCI Blueprints. Deployable as your own Sovereign.

Catalyst is the open-source platform built by OpenOva. It turns any Kubernetes cluster into a Sovereign: a self-contained control plane that hosts Organizations, Environments, and Applications via GitOps + Crossplane, with a unified UI/Git/API for users.

Documentation

Document	What it covers
`docs/GLOSSARY.md`	Canonical terminology — read first
`docs/ARCHITECTURE.md`	Catalyst architecture overview
`docs/IMPLEMENTATION-STATUS.md`	What's built today vs what's design-only — read second
`docs/NAMING-CONVENTION.md`	Naming patterns for every resource type
`docs/PERSONAS-AND-JOURNEYS.md`	Personas × journeys matrix; surfaces
`docs/SECURITY.md`	Identity (SPIFFE + Keycloak), secrets (OpenBao + ESO), rotation, multi-region semantics
`docs/SOVEREIGN-PROVISIONING.md`	How to bring a Sovereign online
`docs/BLUEPRINT-AUTHORING.md`	Writing Blueprints (incl. Crossplane Compositions)
`docs/PLATFORM-TECH-STACK.md`	Every component's role in Catalyst
`docs/SRE.md`	Operating a Sovereign
`docs/BUSINESS-STRATEGY.md`	Product strategy and GTM
`docs/TECHNOLOGY-FORECAST-2027-2030.md`	Component forecast 2027–2030
`docs/VALIDATION-LOG.md`	Trail of doc-integrity validation passes (audit log)

Heads-up before reading further: the architecture docs in this repo describe Catalyst's target state. Significant portions are not yet implemented — see docs/IMPLEMENTATION-STATUS.md for what exists today vs what is design.

The model in 60 seconds

OpenOva (the company) publishes Catalyst (the platform).
A deployed Catalyst is called a Sovereign.

A Sovereign has:
  - Organizations (multi-tenancy unit)
  - Environments (org-scoped, env-typed: prod/stg/uat/dev/poc)
  - Applications (installed Blueprints)
  - Blueprints (the App Store catalog — public + Org-private)

Users install Applications from Blueprints into Environments.
Blueprints can depend on Blueprints (arbitrary depth).
Each Environment is one Gitea repo + one or more vclusters.
Every state change is a Git commit.
Every UI surface reads from a single CQRS projection.

Same code runs in every Sovereign:
  - openova         (run by us; SaaS Organizations)
  - omantel         (run by Omantel; SME Organizations across Oman)
  - bankdhofar      (run by the bank; internal Organizations)
  - your-company    (run by you, on infrastructure you choose)

See docs/GLOSSARY.md for every term, docs/ARCHITECTURE.md for the full picture.

What's in this repo

openova/
├── core/              # Catalyst control-plane application (Go) — design-stage; mostly placeholders today
├── platform/          # Component Blueprint folders (one folder per upstream OSS project)
├── products/          # Composite Blueprint folders OpenOva publishes
│   ├── catalyst/      # The Catalyst control plane itself, target umbrella Blueprint
│   ├── cortex/        # AI Hub (LLM serving, RAG, AI safety)
│   ├── axon/          # SaaS LLM Gateway (default upstream for Cortex)
│   ├── fingate/       # Open Banking (PSD2/FAPI sandbox)
│   ├── fabric/        # Data & Integration (event-driven + lakehouse)
│   └── relay/         # Communication (email, video, chat, WebRTC)
│                      # (specter and exodus are deliverable services, not Blueprints in this layout)
└── docs/              # Platform documentation

Each folder under platform/ and products/ is the source of one Blueprint, published from CI as a signed OCI artifact at ghcr.io/openova-io/bp-<name>:<semver> (the bp- prefix is added to the OCI artifact name; folder names stay short). Per-folder isolation is provided at the OCI artifact layer, not the Git repo layer — this is a monorepo with per-Blueprint fan-out, not a meta-repo of separate Git repositories. See docs/BLUEPRINT-AUTHORING.md §2 for the folder layout contract.

Today, the 12-component bootstrap kit (cilium, cert-manager, flux, crossplane, sealed-secrets, spire, nats-jetstream, openbao, keycloak, gitea, powerdns + the bp-catalyst-platform umbrella under products/catalyst/) ships with full chart/ + blueprint.yaml per docs/IMPLEMENTATION-STATUS.md §7, plus products/axon/ and the external-dns leaf chart. The remaining 45 platform components and the cortex / fabric / fingate / relay product folders are design-stage — README only — until each lands its Blueprint manifest, chart, Compositions, and CI fan-out.

Stack at a glance

Layer	Technology
Container runtime	k3s (k8s-conformant), containerd
CNI / Service Mesh	Cilium (eBPF mTLS, L7 policies, Gateway API)
GitOps	Flux (per-vcluster, lightweight)
Git	Gitea (per-Sovereign, hosts Blueprint mirror + per-Environment repos)
IaC for non-K8s	Crossplane (the only IaC; not user-facing)
Bootstrap IaC	OpenTofu (one-shot, archived after Phase 0)
Multi-tenancy	vcluster (one per Organization per host cluster)
Identity (workloads)	SPIFFE/SPIRE (5-min rotating SVIDs, mTLS everywhere)
Identity (users)	Keycloak (per-Org for SME, per-Sovereign for corporate)
Secrets	OpenBao (Apache 2.0; independent Raft per region, no stretched cluster) + External Secrets Operator
Event spine	NATS JetStream (Apache 2.0; pub/sub + KV; per-Org accounts)
TLS	cert-manager + Let's Encrypt or corporate CA
Policy	Kyverno
Supply chain	cosign (Sigstore), Syft + Grype SBOM, Trivy scans
Runtime security	Falco (eBPF)
Observability	OpenTelemetry → Grafana stack (Alloy + Loki + Mimir + Tempo)
WAF	Coraza (OWASP CRS)
DNS	PowerDNS authoritative per Sovereign zone + DNSSEC + lua-records (`ifurlup`, `pickclosest`); pool-domain-manager allocates pool subdomains and flips parent-zone NS via registrar adapters (Cloudflare / Namecheap / GoDaddy / OVH / Dynadot) — see `docs/MULTI-REGION-DNS.md`, `docs/PLATFORM-POWERDNS.md`
Backup	Velero (to SeaweedFS, which routes the cold tier to cloud archival S3)
Container registry	Harbor

For the full component list and trends see docs/PLATFORM-TECH-STACK.md and docs/TECHNOLOGY-FORECAST-2027-2030.md.

Cloud providers

Provider	Status
Hetzner Cloud	Available (most-tested path)
AWS / GCP / Azure	Crossplane providers available; full path coming
Oracle Cloud (OCI)	Crossplane provider available; full path coming
Huawei Cloud	Crossplane provider available; full path coming

All providers reach Catalyst via the same Crossplane abstraction; Sovereign provisioning details per provider are in docs/SOVEREIGN-PROVISIONING.md.

Getting started

Try it (managed)

Visit marketplace.openova.io to install Applications on the openova Sovereign without any infrastructure setup. SaaS journey for SMEs and evaluations.

Run your own Sovereign

1. Provision via catalyst-provisioner.openova.io (managed bootstrap), OR
2. Self-host bp-catalyst-provisioner in your own infrastructure (air-gap path).

Then follow the procedure in docs/SOVEREIGN-PROVISIONING.md.

Build a Blueprint

See docs/BLUEPRINT-AUTHORING.md. A Blueprint is a folder under platform/<name>/ (or products/<name>/) in this monorepo containing blueprint.yaml + manifests (Helm chart or Kustomize base) + (optional) Crossplane Compositions. CI signs each folder's contents and publishes to OCI as ghcr.io/openova-io/bp-<name>:<semver>. Catalyst's blueprint-controller picks it up automatically. Org-private Blueprints follow the same shape inside per-Sovereign Gitea repos.

License

All Blueprints and the Catalyst control plane are open source. Each component carries its own upstream license (typically Apache 2.0, MPL 2.0, or BSD-3); see each component's LICENSE file.

OpenOva charges for support, managed operations, and expert services — never for access to code. See docs/BUSINESS-STRATEGY.md §10.

Contributing

PRs welcome. The contribution path for Blueprints (including Crossplane Compositions) is documented in docs/BLUEPRINT-AUTHORING.md §13. Issues and discussions on GitHub.

Cloud-native is the foundation. Catalyst is how you operate it.

README.md Unescape Escape