AI Governance · Cornerstone
What the Voluntary AI Safety Standard's 10 guardrails actually mean in practice — and how to implement each one without paralysing the AI work that's already moving.
Quantum Associates — Quantum Associates
· 13 min read
In September 2024 the Department of Industry, Science and Resources published the Voluntary AI Safety Standard — ten guardrails Australian organisations are expected to apply when developing or deploying AI systems. Eighteen months on, the framing of “voluntary” is doing increasingly thin work. Federal agencies are referencing it in procurement. APRA-regulated entities are mapping internal AI controls to it. Boards are asking management for an alignment statement. Several industry bodies have built it into their member guidance. Treating it as a real obligation is now the prudent move.
This guide is what we actually do with the Standard inside engagements. It maps each of the ten guardrails to the practical implementation work, names the most common failure modes, and points at the documentation patterns that survive an internal audit. It is opinionated where the Standard is silent and quiet where the Standard is specific.
The Voluntary AI Safety Standard is process-focused, not technical. It doesn’t tell you which model to use, what accuracy thresholds to hit, or which architectures pass muster. It tells you how to manage the lifecycle of AI inside your organisation — governance, accountability, transparency, testing, monitoring, communication, contestability, recordkeeping, supplier engagement, stakeholder engagement.
That’s actually useful. The technical realities of AI move too fast for a regulator to keep up; the lifecycle-management practices around them move much more slowly. Most of the Standard’s ten guardrails would have been recognisable to a 1990s IT-governance professional with the terminology swapped.
What this means in practice: implementing the Standard is mostly a documentation, communication, and process exercise. You don’t need to rebuild your AI architecture to comply. You need to ensure the lifecycle around it is governed.
The single most useful thing you can do for AI governance is name the person accountable for AI inside the organisation. Not a committee. Not a “responsible AI function.” A person whose name appears in the engagement letter when AI work is commissioned, who signs off on the AI risk register, who fronts the board on AI questions.
In practice this is usually the CIO, CTO, or in larger organisations a dedicated Chief AI Officer reporting to one of those. Smaller organisations consolidate it with the CRO or COO. The role matters less than the named accountability.
The “publish” element of Guardrail 1 is the part most organisations underweight. The accountability process needs to be discoverable internally — staff should know who to contact about an AI question — and the organisation’s external AI posture (if it has one) should be visible. A two-page AI accountability statement on the intranet is the minimum viable baseline.
This is the AI risk register and the controls around it. The risk register lives next to your existing risk register (in the GRC platform of your choice, or in a spreadsheet for smaller organisations). It identifies AI-specific risks — bias, drift, hallucination, prompt injection, data leakage, model availability, regulator exposure, reputational risk — and maps each to:
The Standard doesn’t prescribe the format. We typically use a five-by-five risk matrix consistent with the organisation’s existing risk methodology, because adoption is faster when the AI register looks like the rest of the registers.
This is the security and data-protection guardrail. For organisations already running mature information security and data governance functions, the AI-specific additions are usually:
For most organisations this isn’t a separate program of work — it’s an extension of existing IT security policy. The Standard’s contribution here is to force the explicit conversation; the controls themselves are mostly already familiar.
The evaluations and observability guardrail. In most organisations this is genuinely new work — AI systems require evaluation patterns that don’t map cleanly to traditional software testing.
Three layers, in order of priority:
The Standard names this guardrail well; we’d argue it should be the highest-priority of all ten because failures here cascade into every other guardrail. Without evaluations, “risk management” is theatre.
The human-in-the-loop guardrail. For low-stakes AI (internal-productivity tools, document drafting) the human is the user reviewing the output, and Guardrail 5 is satisfied trivially. For high-stakes AI (decisions affecting customers, regulated outcomes) the design needs to make the human’s intervention point explicit and meaningful.
Common patterns:
Designing this guardrail in late is much harder than designing it in early. We typically have the human-control conversation in week one of any engagement, because the architecture decisions cascade from there.
The transparency guardrail. The simplest, most-skipped guardrail. If an AI is making or materially influencing a decision that affects a person, the person should know.
In practice:
We don’t see organisations get penalised for using AI; we see them get penalised for not disclosing they’re using AI. The disclosure cost is near-zero and the trust dividend is real.
The contestability guardrail. The natural extension of Guardrail 6: if you’ve told the person AI is making the decision, they need a way to challenge it. Most organisations already have complaints, dispute, or appeals processes for non-AI decisions; these processes need to explicitly accommodate AI-influenced ones.
What this looks like in practice:
For regulated industries (financial services, insurance, anything touching credit decisioning) the contestability requirements are also driven by sector regulation — APRA, ASIC, AFCA. The Standard’s Guardrail 7 aligns with these obligations; it doesn’t replace them.
The supply-chain guardrail. AI systems are almost always built on third-party components — foundation models from OpenAI / Anthropic / Google / Meta, vector databases, observability tools, evaluation platforms. Each supplier needs to be assessable.
The practical implementation is a supplier-AI questionnaire that goes into procurement. For each AI-related supplier, you need to know:
For mature procurement functions this is just an extension of existing vendor risk assessment. For less-mature functions it’s a new capability — typically the easiest place to start is a one-page supplier-AI assessment template.
The recordkeeping guardrail. The least exciting and most consequential of the ten. Internal audit, external audit, the regulator (when one eventually shows up) will want documentation.
The records the Standard envisions:
We typically structure this as a single AI Governance Pack — a folder of documents per AI system, plus the organisation-level governance docs. The pack travels with the system: an auditor or regulator can pick it up and assess.
The stakeholder-engagement guardrail. For internal AI (productivity tools, internal decisioning) this means consulting the people who will use and be affected by the system before deployment, not after. For external AI (customer-facing) it means consulting representative customers, including from groups likely to be disadvantaged by the system.
In practice this is usually:
This is the guardrail that catches the most rework. Engagement happens late, the feedback contradicts the architecture, the system gets rebuilt. We almost always recommend the engagement happens in the first week of an AI design phase, not the last.
If your organisation has no existing AI governance posture and you’re starting from zero, the fastest defensible path is roughly six weeks of focused work:
That’s roughly the scope of our productised AI Governance Review. The output is an alignment statement against each of the ten guardrails, a risk register, an AI policy, and the board-ready briefing pack.
The Voluntary AI Safety Standard is voluntary today. The federal government has signalled intent to move to a mandatory regime for high-risk AI applications, with consultation underway through 2025–2026. Organisations that align to the voluntary standard now will absorb a future mandatory regime as an evidence-update exercise; organisations that don’t will face a much larger retrofit.
Industry regulators are moving on their own timelines. APRA’s expectations for AI within regulated entities are already substantively aligned with the voluntary standard via CPS 234 (information security) and CPS 230 (operational risk). The OAIC has issued AI guidance grounded in the Privacy Act. ASIC has signalled scrutiny of AI in credit and investment decisioning.
The pattern is consistent: every Australian regulator is treating the voluntary standard as the baseline of professional practice, and adding sector-specific requirements on top. Aligning to the voluntary standard well aligns to most of the sector-specific requirements as a side-effect.
It’s worth being explicit about what the Standard does not require, because over-reading it is the second most common implementation failure (after under-reading).
The Standard does not require:
If your AI program is being slowed by a policy that cites “alignment with the Voluntary AI Safety Standard” as the reason, there’s a good chance the policy is over-reading the Standard. The Standard is permissive about implementation; it’s specific about lifecycle.
After running this work across several engagements, three patterns recur:
Pattern 1: Governance built for AI that doesn’t exist yet. A governance framework drafted in anticipation of future AI use, untested against any real AI system, gets quietly ignored when actual AI work starts. The fix is to draft governance against the actual AI systems already in use (often more than the organisation thinks) — even if those systems are small.
Pattern 2: Disclosure as legal language only. Privacy policies get updated with paragraphs about AI; product UIs stay silent. The disclosure is technically present but not where it would actually inform an affected person. The fix is to design disclosure into the product, not the legal docs.
Pattern 3: Evaluations as one-time. A model gets evaluated at deployment, and never again. Six months later behaviour has drifted and no one knows. The fix is to make evaluations continuous from day one — same evals run on every change, alerts on drift beyond tolerance.
All three are fixable inexpensively if caught early. All three are expensive to fix retrospectively.
If you’re looking to align your organisation to the Voluntary AI Safety Standard, the productised AI Governance Review is the four-week engagement designed for exactly this. It produces the alignment statement, the risk register, the policy, and the board-ready briefing pack.
If you want to talk through your situation first, the discovery call is 30 minutes, no pitch.
Related insights
AI Governance
How AI engagements map to APRA's existing prudential standards — what CPS 234 (information security), CPS 230 (operational risk) and CPG 235 (data risk) require of any AI system you put in production.
AI Governance
The questions a board should ask management about AI. Aligned to the Voluntary AI Safety Standard and the audit-committee-friendly framing AU boards actually need.
Next step
30 minutes, no pitch, no deck — just a working conversation about how this applies to your situation.