Seven Principles of Ambient Agents
Snowplow Analytics
There has been a robust debate between the OpenAI and LangChain teams, thoughtful contributions from Anthropic and LlamaIndex, and a standout“back to basics” manifesto from Dexter Horthy (“12 factor agents”).
Highlights & Annotations
In the first post in this series, I described a set of “ambient agents” working together to improve an online shopping experience. Ambient agents are found in the arena: both reactive to changes in their environment and proactive about solving problems without being instructed.
Ref. 2B75-A
Your agent should be capable of proactive problem-solving. The irritating package delivery that needs rescheduling? You merely express the intent: “This is a pain point for me, deal with it.” … Your agent takes over, interfacing directly with the vendor’s AI agent. It analyzes your calendar for true availability (understanding your preference for uninterrupted deep work blocks or the variability of picking your kids up from dance class), negotiates a new slot, and updates your schedule, all without further intervention. It simply informs you of the resolution.
Ref. D8C9-B
Our agents need to be autonomous operators, so that they are not limited in their effectiveness by the availability – and capability – of their human operators. If we are to realize the full potential of agents, we need to give them the right span of control to be as effective as possible.
Ref. 38BE-C
Ambient agents aren’t ‘coin-operated’, stuck waiting for external input from humans. They know what their goal is, they know what their action space is, and they independently make decisions and take action in pursuit of those goals.
Ref. 4135-D
Note that autonomous operation does not mean that there aren’t other agents or humans in the loop for authorization purposes. Like a military officer, the agent has a clear span of control, but decisions that fall outside of that span have to be sent up the “chain of command” for approval.
Ref. E5B7-E
For any data architects or engineers reading this – think of this as the ambient agent consuming the gold-layer signal, not the bronze-layer data (per the Medallion Architecture). Stream processing is essential to turn the firehost of raw events into clear signals that can be incorporated into the agent’s context window (pull), or delivered in real-time to the agent via Kafka events or Akka messages (push).
Ref. 6F63-F
Today, most agentic applications rely on the world-ontology that is baked into the underlying LLM through its own training data. But given how critical semantic reasoning is, it’s not surprising that more AI engineers are working to encode this understanding formally, and build agents that can develop that semantic representation over time. In pursuit of this, engineers are embracing semantic layers, knowledge graphs and ontologies; VCs are also joining the dots.
Ref. D76D-G
The end game for semantic reasoning includes the entire loop of perception, semantic interpretation, action prediction, decision-making, and learning from outcomes – it’s about integrating all environmental information into a coherent semantic framework to guide decision-making and continuously improve. A good place to learn more is the 2024 paper ‘Unifying Large Language Models and Knowledge Graphs: A Roadmap’.
Ref. 29A7-H
LLMs are a fundamentally stateless technology – but an ambient agent needs to have a memory like an elephant, not like a goldfish. Being set a long-term goal, an ambient agent has to be able to remember its observations, its actions over time, and its own impact on the environment and other agents; in short, it needs to be able to track its progress against its goal.
Ref. E4EC-I
Long-term persistence is the central design concept of a set of “agentic memory” startups, including Cognee, Mem0 and Zep. There is also some interesting work coming out of ServiceNow Research, with their idea of “tape agents”, a skeuomorphic concept of a central and shared cassette tape that stores all of the history of these agents.
Ref. C0FA-J
Ambient agents are not a new type of agent. Instead, they represent the coming wave of architectural patterns for building sophisticated, goal-oriented, agents. These principles put the high agency – the autonomy – into AI agents.
Ref. 9057-K
The seven principles are as follows:
Ref. E26E-M
The autonomous principle is so clear and compelling that it is starting to appear in vendor marketing for AI agents, for example from Bloomreach, emphasis mine:
Ref. 7A5C-O
Ambient agents continuously monitor the world as things happen, responding to their real-time observations rather than waiting for explicit instructions.
Ref. 0661-P
For this principle, I am deliberately using the term ‘perception’ rather than the more common language of ‘event-driven’. Why? Ambient agents don’t consume the raw firehose of every granular environmental event, any more than my prefrontal cortex directly consumes my eyeballs’ rod and cone data.
Ref. F17B-Q
These agents will be assembled inside each enterprise, inside each SaaS vendor, and across organizational boundaries; this is what Google’s Agent2Agent Protocol exists to facilitate.
Ref. DF8C-S
Asynchronous communication via event streams
Ref. ABFE-T
seven principles: continuous perception for ambient agents.
Ref. EFD4-U