Google’s Andi Gutmans on the shift to agent-scale data management

June 23, 2026 8 min read

Table of Contents

As you guide enterprises from systems of intelligence to systems of action, what do you believe is the single greatest technical hurdle to achieving autonomous, reliable agentic workflows?
If a human-in-the-loop is one way to help guarantee quality, what if you want to build the human out of the loop – or at least have the human ‘over the loop’? What kind of mechanisms can be built into agentic workflows to help guarantee quality?
Do you build in system-prompt type guardrails, as we see in some AI architectures, to help with quality?
How do you see the tension between open source innovation and the proprietary nature of modern foundation models, given that the models providing the ‘intelligence’ are increasingly opaque and controlled by a few massive corporations?
You’ve championed the idea of the ‘borderless’ lakehouse. How do you reconcile that with the reality of increasingly fragmented, multicloud enterprise environments?
If an AI agent can reach across every cloud and database in the company, how do you prevent it from accessing data it shouldn’t? How does the ‘borderless’ dream balance freedom of access with the strict security and compliance required by a modern enterprise?
Whose responsibility is that scoping? Are you just providing the tool and leaving the responsibility to the customer? How far does Google’s responsibility go?
Having managed database services at AWS and Google, what is the most significant misconception organisations have about the ‘data gravity’ required to power generative AI effectively?
You’ve talked about the zettabytes of data under your control. If an individual struggles to manage a single 1TB drive on their home PC, how can enterprises manage data at this scale?
You spoke during the keynote about the massively beneficial results for humanity that can come from these technologies. How do we ensure that it’s not used for harmful purposes, such as military operational uses, or by regimes accused of war crimes?

At last week’s Google Summit in London, the company’s agentic data cloud vice-president and co-creator of web server scripting language PHP, Andi Gutmans, said enterprises are set to transition from “systems of intelligence” to “systems of action”.

In that world, as data volumes reach zettabyte scale, work will transition from human-scale data stewardship to agent-scale automation, utilising artificial intelligence (AI) agents to organise data, manage metadata and build ontologies. For enterprises, that means the task of managing and activating data will undergo a fundamental shift.

Gutmans helps lead the hyperscaler tech giant’s strategy for businesses to leverage their data estates for autonomous agentic workflows.

We asked him about the technical hurdles to achieving reliable and safe autonomous workflows, the use of multi-agent architectures for quality verification, the engineering behind Google’s “borderless lakehouse”, the tension between the open source world he came from and the huge potential for lock-in to proprietary AI models, security boundaries in multicloud environments and the ethical responsibilities of model providers.

As you guide enterprises from systems of intelligence to systems of action, what do you believe is the single greatest technical hurdle to achieving autonomous, reliable agentic workflows?

If a human-in-the-loop is one way to help guarantee quality, what if you want to build the human out of the loop – or at least have the human ‘over the loop’? What kind of mechanisms can be built into agentic workflows to help guarantee quality?

Do you build in system-prompt type guardrails, as we see in some AI architectures, to help with quality?

How do you see the tension between open source innovation and the proprietary nature of modern foundation models, given that the models providing the ‘intelligence’ are increasingly opaque and controlled by a few massive corporations?

You’ve championed the idea of the ‘borderless’ lakehouse. How do you reconcile that with the reality of increasingly fragmented, multicloud enterprise environments?

If an AI agent can reach across every cloud and database in the company, how do you prevent it from accessing data it shouldn’t? How does the ‘borderless’ dream balance freedom of access with the strict security and compliance required by a modern enterprise?

Whose responsibility is that scoping? Are you just providing the tool and leaving the responsibility to the customer? How far does Google’s responsibility go?

Having managed database services at AWS and Google, what is the most significant misconception organisations have about the ‘data gravity’ required to power generative AI effectively?

You’ve talked about the zettabytes of data under your control. If an individual struggles to manage a single 1TB drive on their home PC, how can enterprises manage data at this scale?

You spoke during the keynote about the massively beneficial results for humanity that can come from these technologies. How do we ensure that it’s not used for harmful purposes, such as military operational uses, or by regimes accused of war crimes?

Related Articles

Yahoo picks Intigriti to run crowdsourced bug bounty programme

Designed to deceive – reviewing the Post Office scandal inquiry

How digital models help Anglian Water manage leaks

Union chief calls for outsourced services to be brought in-house

Do government services need a rethink for AI and automation?

UK government signs more partners to boost AI skills across the country