4 Patterns of Agentic Reasoning Design
Today, the way most of us interact with Language Models (LMs) is akin to a non-agentic workflow. We prompt them, they generate an answer, and that's it. For the use case of asking ChatGPT to write an essay for you, it's like asking someone to write it without ever revising it.
Surprisingly, existing LMs excel in this mode, but when you contrast it with an agentic workflow, and you'll see a world of difference.
In an agentic workflow, the process is more iterative. Imagine having an LM write an essay outline, conduct web research, draft the first version, evaluate its own work, and revise accordingly. This approach yields remarkably better results, a fact often overlooked.
To illustrate, let's consider a coding benchmark called the Human Eval Benchmark. When tasked with coding problems using zero-shot prompting, LLMs perform reasonably well. However, when wrapped in an agentic workflow, their performance surpasses even the latest models. This observation holds significant implications for how we approach AI applications.
The Emergence of Agentic Reasoning
Amidst the buzz around AI's future, let's look at the concrete trends and design patterns shaping agentic reasoning. While the field is vast and chaotic, there are four key patterns for agentic design:
Reflection: This involves prompting an LM to not only generate code but also evaluate its own output. By engaging in self-reflection, LMs can spot errors and iteratively improve their solutions. One example is in our video about Self-RAG:
Tool Utilization: LMs are increasingly being employed as tools across various domains, from code execution to analyzing data and doing research. This expansion of LM capabilities boosts the potential of what LMs can achieve.
Planning: Although still emerging, this pattern involves interweaving multiple LMs to accomplish a task. By determining each step, what model to use, what choices to make, it enables autonomous driving of a complex multi-step agentic workflow.
Keep reading with a 7-day free trial
Subscribe to The MLnotes Newsletter to keep reading this post and get 7 days of free access to the full post archives.