Jokes aside, I think RAG is becoming less relevant in some ways partly because of Agents but not only. When ChatGPT first launched, it had one big limitation: it didn’t know anything beyond its training data. That meant it lacked the context to be useful for most business-specific tasks. On top of that, you couldn’t feed it much custom data in a single prompt. This is where RAG (Retrieval-Augmented Generation) became useful—it allowed companies to pull in information from internal documents or systems and pass it to the model in real time, so it could give more relevant answers.
Fast forward to today, and things look very different:
At the same time, RAG systems have evolved—they’ve become more complex, often combining multiple steps like retrieval, summarization, filtering, and formatting. And that’s where AI agents come in.
Agents simplify these architectures. Instead of building rigid, multi-step pipelines, we can now give the AI access to tools (like a document database or a search engine), define its goals, and let it figure out how to solve the problem. Think of this like a researcher who knows how to use a library and search tools to complete a task—you don’t need to tell them every single step.
However, agents also come with a tradeoff: they can be unpredictable. For high-stakes business scenarios, we still need modular, testable workflows—and in some cases, RAG is still the best way to structure those.
The bottom line? RAG isn’t going away, but it’s becoming just one tool in a broader toolbox. The real shift is toward agentic workflows—systems where AI can reason, plan, and use the right tools to get the job done. That’s what’s most exciting.