My Experience With AI Agents and Why I Canceled My Claude Subscription

It all started like most of these stories do. With a subscription.

I was paying $20 a month for Claude Pro. At first it felt reasonable, you know? But I kept hitting message limits and usage restrictions. The only way to get more was paying $200 for their next tier and honestly, I was not about to do that just to get bigger limits. It felt like paying for a car and being told how many miles you can drive each month.

That frustration sent me down a rabbit hole.

The Local Models Rabbit Hole

I started reading about local models. The idea of running artificial intelligence on my own computer, without depending on the internet, without limits, without monthly payments. It sounded too good to be true.

And at first, honestly, I didn’t understand anything. There were tools like Ollama, basically the simplest way to download and run local models. You type a command in the terminal and seconds later you have a model running. Also LM Studio and Cherry Studio, which are graphical interfaces to do the same thing if you don’t want to touch the terminal. A ton of tools, each one promising to make it easier.

I dove into that world, and right when I started, Alibaba had just released Qwen 3.6, a model that fit perfectly on my MacBook Pro M4 Pro with 48GB of RAM. I installed it, tested it, and watching a model answer questions on my own machine, with no internet, felt like magic. I added MCP (Model Context Protocol), a standard that lets models connect with external tools like files, databases, or whatever you need. The possibilities opened up significantly.

I spent a while like that, experimenting, seeing what I could do. And like everything in this world, I wanted more.

OpenCLAW and the First Letdown

I discovered OpenCLAW, a more advanced framework for building agentic systems. I set it up, ran it, and it worked. Painfully slow. It was overkill for my laptop.

That’s when I learned something important. The problem wasn’t the model. It was everything I was putting around it. These frameworks inject a massive system prompt to give you context, and processing that from scratch every single time takes forever. It doesn’t matter how fast your model is if it has to chew through pages of instructions before it can even respond. That discovery, frustrating as it was, led me to the next tool.

pi.dev, the Game Changer

That’s when I found pi.dev. And this actually changed things.

pi.dev optimizes the prompt you send to the model so it’s efficient. Its system prompt is short, well structured, no bloat. Suddenly my local model was flying. It could solve complex problems, write code, reason through multiple steps. Everything worked the way it should. It was the best experience I had in that whole phase. For the first time I felt like running a local model wasn’t an experiment. It was actually useful.

Enter Hermes and the Need for Something Bigger

But I still felt like something was missing. I wanted something I could control from my phone, something available anywhere, something I could chat with that could also control my PC from wherever I was.

That’s when Hermes Agent came in. It’s a framework that lets you have assistants with personality, connected to Telegram, capable of executing code, researching the web, controlling files. All from a chat. I had it configured in no time. The system itself guided me through it. Within a while I had 3 Telegram bots running, each with its own purpose and distinct personality. One for research, one to help with technical tasks, one just to experiment.

But when I ran Hermes with my local model, the same old problem came back. Speed. Running parallel tasks with a local model simply isn’t viable, even with pi.dev optimizing everything. And the magic of having agents fades when you’re waiting 30 seconds for every response.

The Five Dollar Lesson

That’s when I remembered that two years ago I had bought five dollars worth of Deepseek API credit. I had never used it. It was just sitting there, untouched.

I connected it. And suddenly everything worked amazingly. Fast, fluid, no artificial limits. My three bots started running on Deepseek V4 Flash and the difference was night and day. They do research, help me with day to day tasks, and even role-play with the personalities I gave them, because honestly I find it fun to see how they respond. Five dollars. Two years later, I still have credit left.

It was ironic to realize that what solved all my problems wasn’t better hardware, more configuration, or more tools. It was connecting an API I had bought two years ago and forgotten about.

My Setup Today

In the end, I learned that no single tool does everything perfectly. Each one has its place, and the trick is knowing when to use each.

Hermes manages projects, writes plans, and keeps long-term context. For that it uses Deepseek. It’s ideal for tasks that require memory, understanding the full context of a project, and coordinating multiple steps.

pi.dev I use for programming and running code quickly. Its short system prompt and ability to parallelize tasks make it much more efficient for that. For pure development, it’s still my main tool.

Deepseek V4 Flash runs on both. So far I’ve found it excellent.

What I’m Taking Away From All This

Local models are incredible for learning and experimenting. They give you an understanding of how this technology really works that you’d never get from just using APIs. They force you to understand prompts, context, limits. Everything that’s normally abstracted away when you’re just clicking in a chat.

But when you want speed, reliability, and parallel tasks, a cheap API is unbeatable. You don’t need to spend a hundred dollars a month. With five dollars of credit I bought two years ago and still haven’t finished, I have more capability from my phone than I had sitting at my computer paying subscriptions.

Goodbye Claude with its restrictive limits. Goodbye to paying monthly for something you only halfway use. Today I have three bots running from Telegram, I control my computer from anywhere, and I wrote this post from my phone while I was out of the house, dictating to one of my agents who helped me structure it and correct it without losing my voice.

And the best part? This is just the beginning. Every week I discover something new I can do with this setup. And the feeling of building your own system, with your own rules, without a subscription telling you how far you can go. There’s nothing like it.