Question 1

Which foundation model do you typically recommend?

Accepted Answer

It depends on the use case and the deployment constraints. We use Anthropic Claude as a default starting point because it tends to produce more reliable agentic behaviour and has strong AU-data-residency options via AWS Bedrock; OpenAI GPT and Google Gemini are common alternatives where the use case fits their strengths. We have no commission-based vendor relationships — every recommendation is based on your constraints.

Question 2

Can you work with on-premise or private-cloud LLM deployments?

Accepted Answer

Yes — for governance-sensitive sectors (defence, parts of healthcare, certain APRA-regulated work) on-prem or VPC-deployed open-weights models are appropriate. We help select between Llama, Mistral and similar; we run the deployment work in partnership with your infrastructure team.

Question 3

How do you measure whether a GenAI system is "working"?

Accepted Answer

Three layers: (1) automated evaluations against a labelled dataset, run on every change; (2) production observability — cost per request, latency at p95, quality drift indicators; (3) the operator metric the system was meant to move (time saved, cases auto-triaged, etc.). All three live on a dashboard you own from day one.

Question 4

What about hallucinations / accuracy?

Accepted Answer

Hallucinations are a system-design problem more than a model problem. RAG with proper retrieval, prompt-level constraints, citations + audit trails, and an evaluation harness that detects regressions — these are the patterns that work. We don't promise zero hallucinations; we promise the architecture and observability to detect and prevent the kinds that matter for your use case.

Generative AI implementation — production systems, not demos.

Offerings inside Generative AI.

GenAI pilot to production

RAG implementation

LLM evaluations + observability

Prompt engineering + fine-tuning advisory

We’re typically the right partner when…

Generative AI Pilot

Tech we work with day-to-day.

Common questions.

Which foundation model do you typically recommend?

Can you work with on-premise or private-cloud LLM deployments?

How do you measure whether a GenAI system is "working"?

What about hallucinations / accuracy?

Talk to a senior partner about your Generative AI engagement.