← Back to Home

The (Perfect) General Agent is Claude Code

2025-10-25

Back in 2023 we imagined the increased productivity through AI and since then not that much has happened. Where are all the productivity gains that McKinsey consultants promised us?

It is clear that we must rearchitect vs retrofit and for the first time i think Anthropic has launched something that has a real shot for prosumers and companies alike to make a dent: Skills.

Quick recap on what waves i observed in terms of productivity gains:

Wave I: Enterprise RAG → Using new technology to solve a fundamental search problem. Companies like Gleam and Sana AI threw new technology at an existing problem without building new interfaces. RAG is dead and they will get either bundled into Google Drive or acquired by enterprises.

Wave II: Better Interfaces → The hypothesis was that ChatGPT and Claude would never build something truly for B2B. Companies like Langdock built better interfaces with white glove onboarding for SMBs, going through agencies, doing what ChatGPT or Gemini don't want to do because they're too consumer-focused1.

Wave III: Coding Interfaces → Now we're in the third wave. Cursor and Claude Code didn't just target developers; they built for the laziest users on earth. The irony is that this developer-first approach might be what finally makes AI useful for everyone else. Every breakthrough in Claude Code is now making its way into the Claude desktop app - and that's where the real story starts.

A Terminal is All You Need

Claude Code is perfect, because of its simplicity. The only thing you need is a file system where you can save all of your projects and chats as markdown files forever. You can execute code and little scripts and make them reusable. Websearch is also integrated.

The real killer are the tools. Agent skills are a way to revolutionize MCP tooling, and this has also now found its way into the Claude consumer app. It's amazing because you can essentially use an API, save your credentials super conveniently and safely in your local file system in an environment file, and just build anything on top.

What are Skills? They're similar to tools or MCPs, but you can tell the agent what to do in a specific way. It's like giving the agent its own context window - essentially its own claude.md file where you can specialize it. It's a standard operating procedure, and this is perfect. This is exactly what we need in companies. This is the best iteration of the productivity and tooling problem we've had so far.

The obvious take for skills is:

  • Ease of creation
  • Versatility
  • Context efficiency

The non-obvious:

  • Iteration that is quick → even for non-technical people - how can a customer support agent change a rule of an agent?
  • Build and teaching skills is democratized → Building an MCP server is actually hard
  • How are we sharing/ collaborating with these skills/ gems/ gpts → actually an open research question → should we still use git or should people like Samuel from Lix or GitButler find a better way for version control

In theory, I can connect my Gmail mailbox, my project management suite, my GitHub account, my journaling service, maybe even unofficial APIs like the iCloud API, and my calendar API. But I can also add stuff that doesn't have official support—like web scrapers—and build micro apps real quick because I can just code these and then make them reusable. The nice thing is that making them reusable was really hard in the past. You needed to host an MCP server, but now you can do it really quickly.

For example, I used to go to a gym where I could book classes. I reverse-engineered the API and got all the gym classes sent to my email. Now I can just use Claude Code on my iOS app and do it on the fly. I can say, "Hey, this is the website, please spin up a Selenium session, log in, and tell me all about the classes available next week, then auto-book them as soon as they're available" (because they were booked out fairly quickly within a couple of minutes). I could have a truly background agent that checks whenever a new class becomes available, all completely on mobile.

What's also super nice is that there's a plethora of GitHub apps and repositories that I can just remix. I used to live in Berlin, and booking an appointment with government officials (the Amt) was really hard. There was a GitHub repository that sent you an email notification fairly quickly when appointments opened up. You can just clone this, have it on your personal server all the time, and essentially vibe-code your life.

That's another good example: I really like this Obsidian continuous research type of thing, having it as a second brain. I think Obsidian or Roam Research back in the days was really flawed because you had to write everything down - you couldn't dictate it - and structuring the nodes was super tedious. Now I built a Claude skill, that goes through all my old notes that I've done over the years, extracts all the people's names I mentioned, builds an ontology, and links them together. So I can just ask, "What was the most fun trip I did with a friend of mine?" and there's a really easy way of looking it up. It's like a very intelligent contact system, and it all runs with the power of markdown files.

Even this website is another example. I just have markdown files in a folder on my computer. I have a bunch of thoughts and transcripts from old founder calls. I use Claude to explore my past knowledge or an idea, and then I put them in a project or research folder. When I feel good enough about it - even if it only reads like 70% polished - I can just push it to my own website, which also consists of markdown files, and then it's online. I just really like the interoperability of markdown files and LLMs on top of it, voice dictation, and truly being in charge of your data.

Another thing I use is the Happy Engineering iOS app. When I'm driving for four hours, the Happy Engineering app uses an ElevenLabs voice agent on top of Claude Code, so I can talk with it. I can say, "Hey, please go into my vault and tell me everything about notes on a certain topic," or "Let's start a new research project." This is all available in ChatGPT voice mode, but ChatGPT voice mode is real-time and stupid - it cannot think for a long time and doesn't have access to all of my files.

More info/demo about this setup.

Footnotes

  1. The real money lies in ads. Or they are swinging for the 10m+ enterprise and gov contracts like Palantir.