1000+ Hours in Claude Code in 60 Mins
Chapters11
The episode centers on becoming the director of your coding agents and treating Cloud Code as a second brain to make your business AI-native, with a focus on planning, verification, and evolving your system over time.
Be the director of your coding agents with a strong planning, verification, and harnessed workflow using Claude Code and Cloud Code—and treat your AI as a trusted, learning partner.
Summary
Nate Herk sits down with Cole Medine to dissect turning Claude Code into a reliable “second brain” for your business. Cole argues that true progress comes from being the director of your coding agents, not just prompting and praying, and he centers Cloud Code as a practical platform for building evolving systems. They cover how context windows and the so‑called dumb zone affect reliability, stressing up‑front planning, ongoing validation, and continuous system evolution. The chat covers concrete techniques like using hooks for security, building a harness to orchestrate multiple coding sessions, and leveraging skills for reusable prompts. They also dive into the trade‑offs between plan mode, sub‑agents, and agent teams, and how to design deterministicish workflows with assembly‑line thinking (the Ralph loop) for larger tasks. Throughout, Cole emphasizes treating the AI as a mentor you can query, learn from, and improve with—turning automation into a repeatable, auditable process. Nate and Cole share personal experiences, including a real‑world incident where proactive AI actions caused an email mishap, underscoring the need for robust guardrails. The conversation closes with practical tips, from security checks via hooks to planning that centers on intent engineering, plus how to find Cole online for deeper dives into AI coding assistance.
Key Takeaways
- Plan with context, then validate and iterate: success hinges on a strong initial plan and a reliable verification harness for each coding pass.
- Use Cloud Code as a system, not just a tool: think in terms of skills, hooks, and an orchestrated workflow (the Ralph loop) to avoid the dumb zone.
- Security and permissions matter: hooks are a primary way to enforce safety, and never assume “never delete” will be enough.
- Treat the AI as your second brain and product manager: define intent, write clear specs, and let the agent evolve the system with every failure.
- Harness engineering beats single‑session chasing: split large tasks into phased agent work to stay out of context collapse and improve determinism.
- Skills, status/visibility (status lines), and routines are your day‑to‑day affordances for reliable automation—build around these pillars.
- Edge cases are best found proactively: deliberately test inputs and scenarios to harden the automation before production use.
Who Is This For?
Essential viewing for developers and operators adopting Claude Code and Cloud Code who want reliable, auditable AI automation. Great for non‑engineers who collaborate with technical teams to build AI-native business processes.
Notable Quotes
"What would you say by the end of this podcast that everyone will have learned from you?"
—Cole frames the episode around directing coding agents and evolving cloud code as a system.
"You have your way of working with cloud code that evolves itself over time. And we’re going to talk about more than just using it to code."
—Cole explains the core premise: cloud code as an evolving system, not just a coding tool.
"The main thing is planning with context, building that thing, and then having an approach for verifying… anything you build with cloud code."
—Key four‑step pattern: context, plan, verify, evolve.
"Attention is scarce. Don’t dump everything up front; control what the model sees and allow it to discover what it needs."
—Addresses context management and the dumb zone concept.
"If you tell it never to wipe a database, it’s still going to do that."
—Illustrates the reality of model behavior and the need for robust guardrails.
Questions This Video Answers
- How do I implement a Ralph loop with Claude Code for large automations?
- What are the best security practices (hooks) for AI-powered workflows in Cloud Code?
- How can I design a deterministic-ish AI workflow using multiple coding agents?
- What is intent engineering and how does it apply to building AI-native business processes?
- Which are the top three Claude Code features for day‑to‑day automation, and why?
Claude CodeCloud CodeHarness engineeringRalph loopAI coding assistantHooksSecurity in AIAgent teamsSubagentsSkills
Full Transcript
What would you say by the end of this podcast that everyone will have learned from you? The main thing I want to talk about today is how we can be the director of our coding agents. Everyone is hearing nowadays how large language models can support up to 1 million tokens in their context. That's like the Harry Potter book five times over. Large language models have what's called the dumb zone. With Opus right now, it's usually around 250,000 tokens and I feel like it gets into the dumb zone. It definitely comes with a false sense of security with people now thinking that they have the million.
With coding agents, you spend more time planning than you actually do building. Without the verification checks, maybe it's 65 or 70, but now you can get something that is 92 on the first pass. If you tell it never to to wipe a database, it's still going to do that. If you don't allow it to delete a folder, it can still write a script to do that. Recently, something did happen to us. The agent was trying to be proactive and it actually saw something on its task list, but it misinterpreted it and it ended up sending an email to our entire list with a discount code and it was not supposed to go out.
If you have the mindset that anything that the agent can read or can touch, you have to assume that it will, even if you never ask it to, that assumption is what's going to save you from having your database deleted. All right, Cole, thank you so much for being here today. I'm so excited to dig in. I'm excited to be here. Yeah, thanks for bringing me on to your podcast, Nate. I'm looking forward to this. Absolutely. Yeah, it's been a long time since we've talked, so I'm excited to hear what you've been up to and to hear kind of like the sauce that you're going to drop on everyone today.
So real quick, what would you say by the end of this podcast that everyone will have learned from you? Yeah. So the main thing I want to talk about today is how we can really be the director of our coding agents and specifically cloud code because that's what most people use right now. That's what I use. But really, it's creating that system where you have your your way of working with cloud code that evolves itself over time. And we're going to talk about more than just using it to code. Really, I use my cloud code as my second brain.
I like to call it. I know Nate kind of calls it as AIOS. Everyone has their term for it, but really like using cloud code as the tool to make your business AI native. We're going to get into all of that and just some highle strategies that honestly you can start applying today. I love that. Yeah, I'm I'm super excited to dig in because, you know, I don't come from a formal software engineering background and I think that I would I would guess that the majority of my audience doesn't either, but obviously with the the products being called Cloud Code, I think a lot of people that I bring that up to who aren't super deep in the AI space, they obviously think that it's a tool that is for coders and you need to understand code in order to use it.
So, um I love that framing. And real quick before we jump in, you know, me and you have we've known each other for quite a bit. I feel like, you know, right when I kind of quit my job and started on the space, you were one of the main channels that I followed and I still follow to stay up to date and to to learn about how to work with AI in the right way. And um we've kind of just been able to see each other grow and and you know, check in. So, I'm really excited to dive in, but I wanted to make sure you got a chance to real quick give everyone a quick intro if they haven't seen your channel before on what you do and um yeah, what you're up to.
Yeah, sounds good. You know, before I give an intro though, I kind of want to share something a little bit about what you're talking about. Like when we first met, it's funny because I I actually remember I had um about 50,000 subscribers when Nate first reached out to me and he had like 10,000 and now it's a little bit different. I have like 200,000. You're you're almost 800,000 now, right? Like it's pretty crazy. Um it's been really cool to see you grow, how fast you've grown. But yeah, we were both like smaller channels at the time.
Um so yeah, it's it's been a long time. Wild journey. Uh yeah. Anyway, as far as what I actually do, so like Nate said, I come from a software engineering background. So, I've been an engineer my entire life. Ever since I was eight years old, actually, I I started with this language called Scratch. It's developed by MIT. So, I was just like building video games as a kid, like Super Mario Bros. and Pokemon, like really cliche stuff. Um, but that that's what got me into the world of coding. And so I took that through high school, college, got my bachelor's in computer science and um then I had just like a software engineering job in a Fortune 500 company and it was great but I always wanted to be an entrepreneur.
And so when generative AI started to really become a big thing at the end of 2022 with the release of chat GPT you know and it took the world by storm that's when I knew like okay this is where I want to go all in cuz there's like a really big opportunity for software engineers specifically to build agentic applications and so I started doing a lot of that like for my company and for friends with their startups pretty much dedicating all day and all night to it for a very long time like over a year and so it got to the point well I know a year might not feel like a long time but in the AI space a year is a long time.
So it it got to the point where like okay I got some things to teach people. So that's why I started my YouTube channel. So originally it was like really really technical like I was there like writing line by line. I wasn't even using AI coding assistants back then just showing how to build AI agents with like you know lang chain and langraph at the time. And um now that's evolved to a lot of different things like I do a lot of like focusing on AI coding assistance which is why we're talking about that today.
Um, and yeah, I quit my my full-time job like three months after starting my YouTube channel, which I think is about the same for you, Nate. Yeah. U because it's crazy like how fast when you when you do it right and and you're teaching people valuable things like how fast a channel can explode. And so now now what I'm up to is I have my AI community um similar to Nate where I've got course content, weekly workshops that I do. I've also been doing some more enterprise level training. So coming into a team and doing like a 4-hour session, helping them adopt a full system for using AI coding assistance so they can really have as like a standard for the team, you know, get away from Vive coding to really have a structured approach and helping them actually bring that into their existing processes and tech stack and things like that.
So that's been pretty awesome. And so like really like that and everything I teach in the community, I'm bringing a lot of that here to what we're going to be chatting about today. 100%. Real quick, guys, quick break to tell you about today's sponsor, ClickUp. ClickUp is the software to replace all software, which I think is pretty funny, but very true. If you guys have been following me for a while, you know that I've been using ClickUp for a long, long time. Everything that I do with my team lives in ClickUp. All of our communication, all of our project management, all of our chats, and everything I was doing with my clients back when I was running the agency day-to-day, we were also inviting them to a ClickUp.
So, it had replaced Slack for us, and it had also replaced our project management tools. So, if you're already using ClickUp, you have to try this new feature called Brain 2. But if you don't use ClickUp already, then Brain 2 is an amazing reason to try out ClickUp. It's kind of like a supercomputer that can do a ton of cool stuff. And I'll talk about in a sec. They have super agents in here. But you can switch between the different chat models that you probably already use and love. Right here, you can see that I've used Brain myself to look through everything that's going on in our projects and then create me a monthly presentation for the team.
So, what that could look like is me asking Brain to create an investor presentation pitch deck for our texttospech startup called Glido. And I told it to just use mock data, but make sure that it's professional and engaging. And just like that, we have the deck, which I can open up full screen right here. We've got the voice AI platform that makes every brand sound human. And as I start to navigate through here, you can see that we also have animations in here. So, it's not just a static, you know, slide deck. We get to actually go through and we feel the animations.
And think about the fact that this was just a one sentence prompt. If we really started to put more and more data into this thing, it would be really, really solid. And this right here is just one of the many use cases of Brain 2. So, it's not just a chatbot. Like I said, it can do things and you can build your own super agents in here. And what I think is really cool about the super agents is they're 24/7 agents. You can tag them in ClickUp. You know, you can at message them and they'll wake up and respond to you and they can search through everything.
Which is why, in my opinion, it's a lot cooler that ClickUp is doing this compared to something like chucking an OpenClaw or Hermes agent into ClickUp because these agents already have full context and can search through everything. So, right now, because you're watching this video, you can claim this super awesome offer that is on screen right now by using the link in the description. Now, let's get back to the video. Yeah. Well, I am just I'm so glad that that we both took the leap because it's, you know, it's not an easy decision, but um your brain just gets it.
And so, it's been great to see, you know, the consistency and what you've been up to. But I think that if you think back to, I don't know, 5 10 years ago when people were going out to get their, you know, CS degrees and stuff, it's like that was such a safe bet at the time, you know, and I don't think a lot of people were predicting how much how quick that was going to flip as far as like, you know, that graphic of what is AI being applied to and right now it's just majority is coding and software engineering and obviously everything's going to catch up.
But um it's just great that you were able to, you know, make that pivot and be ahead of the curve and now now here we are. So um being able to us have this conversation, one of us coming from like a non-technical background completely and one of us coming from a technical background is going to be really cool. So yeah, let's just jump right in. Yeah, sounds good. Cool. So for for what I have prepared for today, um you'll see like you'll see it shine through that I come from a technical background, but but really what it comes down to is like I'm going to bring these concepts into using cloud code for far more than just coding like I alluded to at the start.
And so I think um you know for me like I I really enjoy leaning on my technical expertise because a lot of the ways that you'll use an AI coding assistant for your ops your AIOS your second brain whatever you want to call it um you are going to be borrowing from software engineering principles whether you realize it or not. So a lot of times just as you learn how to use these tools effectively and you're just learning best practices from Nate's YouTube or Anthropics blog or Boris Journey or whoever like they're bringing software engineering principles and a lot of like product management manager principles as well.
And so yeah, like some of the examples that I have here um that will cover like they're a little technical. Um but that's really just like to illustrate how how I started using this tool and then of course I'll like generalize things a lot as well and um give some specific examples too. Um so if you if you want Nate, I can just like dive right into the first part that we have here. Okay, cool. Yeah. So, I got like just quick over I mean we'll go pretty quick through this because I I want to keep this pretty casual and I know you you do as well, Nate, but just like a few different pillars here of how we can go from simply using cloud code to what a lot of people call vibe coding, you know, prompting and praying where you're you're pulling that lever like a slot machine, getting to the point where we're really directing it and having that system for reliable and repeatable results.
And um it really can be simpler than you would think, right? Like most of what people do that you really shouldn't do is you throw in a request and you don't do much of the planning up front or the validation after. Like those are the two things that I really want to talk about here. And that applies to uh writing any code or any kind of application. It applies to evolving your system like as you're creating skills and integrations for cloud code or even just using it to automate things in your business. Um, and so yeah, the approach is you always want to plan with context, build out that thing that you're looking to do, and then have an approach for verifying like as high level as I can possibly keep it.
And then the other like kind of golden nugget here is every time you go through this loop with clawed code, any kind of agentic workflow or thing that you're building, there's always going to be an opportunity at the end to evolve your system. And we'll talk about what that means in a little bit here, but like really that comes down to there's going to be something in the way you work with cloud code that you can improve so that next time it's going to be better. And I'm being high level here on purpose cuz I'll get into some more examples.
But a lot of people don't think about doing this, right? They kind of like get to the point where it's like, okay, my application works. Like this website looks good or it's now able to automate creating invoices, like whatever it is. And they're like, all right, we're done. like let's next time I want to create an invoice, I'm just going to go through the same process again. But like really there are going to be those problems that come up over time where you can engineer so that they happen less often, right? That that system evolution is kind of what I like to call it.
So you're having you're having it learn just like you would an employee, right? Absolutely. Yeah. Like my my second brain, I literally call it my co-founder, right? So I want it to like learn me better over time and how I like to work, how I want it to work as well. Mh. Yeah. And I think this four-step kind of framework or whatever you want to call it, it yes, when you kind of maybe look at it like this, it might feel like it's a technical software engineering thing, but if you just relate that back to the same way you would maybe like let's just say build a treehouse, like you would plan that thing out first.
You would draw it out. You would understand how much wood you need and where, you would get the right gear, and then once you've built it, you're not just going to put your kids on it. You're going to like test it. You're going to make sure that thing's not going to fall. So, um it's just a great way to think about it. And especially if you think about some of the the pitfalls that these models have with like the sick of fancy essentially just being a yes man. If you say, "Hey, you know, I want to do this.
Does that look good?" And they're just going to say, "Yeah, it does." without really looking over the plan. And then on the verification side, you know, sometimes they do tell you something's done, but it's not. So having your own method of doing that as well, really important. By the way, guys, I know we are diving into a ton of information in this episode. So, what I did is I broke all of this down into a free resource guide that you can access for completely free by joining the free school community. The link for that is down in the description.
Also, if you want to check out some of the key moments from this episode and all future podcasts on my channel, then go ahead and check out the AI Automation Society YouTube channel where we're going to be posting some of the best moments from the podcast over there. I'll link that YouTube channel in the description of this video as well. Anyways, thanks guys. Let's get back to the podcast. Yeah, verification really comes down to prove to me it's actually done and working, right? Right. And so like for any kind of coding task that's things like unit tests and linting and like that's where it gets a little bit more technical, but like really you can apply that to anything.
Um like I this is an example that I'm going to spoil right now. Um I use claw code to generate this entire diagram. Like I have I had a feeling you did. Yeah. Yeah. Yeah. Yeah. So I have I have a skill. It's my scaladraw diagram skill. I've covered it on my YouTube channel actually. So I use it to build this whole thing. And um I was going to talk about this example a bit more right here when we really get into like verifying the work. Um but I think it's just such a good like non there's nothing to do with coding here.
It's just creating a diagram. But as far as far as verification goes, I actually have it take the Excal diagram and render a PNG. So there's like an integration that I built into the skill for Cloud Code. So it can render it as an image. And as a lot of you know, like Cloud Code is able to understand images incredibly well now. for like the last year, it's been so good at um even viewing like a like if I zoom out here, like there's quite a bit of context, but like it can pick out the tiniest piece of text in a larger image like this.
And so I have it look at that and then figure out like if there's any kind of like padding or spacing issues, like if there's any sort of overlap and and trust me, there was like it had to iterate a couple times to build something this big. Uh but then the the point is like it is able to iterate by itself. So, we don't really care about the initial mess ups that it has. As long as it like does that by itself, we just care about that that last thing it hands back to us when it says it's done.
So, if we have this if we have this step when it says it's done, then like it actually is or at least it's closer. I mean, it's still probably not going to be perfect, but you get the idea. Yeah. Yeah. 100%. I've done something pretty similar with my video editing pipeline with the motion graphics it adds and sometimes things would be out of bounds. But like you said, the whole idea is it's almost never going to be 100% on that first pass, but without the verification checks, maybe it's 65 or 70, but now you can get something that is 92 on the first pass.
Right. Exactly. Yeah. Yeah. It's it's good. So I mean verification, validation, whatever you want to call it, like that is one of the biggest things that I'm focusing on right now for any kind of application or automation that I'm creating. I want some kind of harness for the coding agent to be able to validate its own work for code to validate its own work. And for some things like um website design, it's actually pretty easy. There's a lot of tools out there uh maybe you've heard of Playright or Verscell's agent browser um for it to really just spin up the site, right?
It can run the command to start the website and then it can visit it just as a user would take screenshots along the way to prove things to you or even just view the the UI itself. It's pretty easy for other kinds of things that you'll build. U it can be kind of hard to have the agent really verify its own work effectively. one like really simple example kind of silly example. Um I in my spare time like I' I've always loved like video games as a kid. I mean like I talked about with Scratch.
I mean I was building like Pokemon and and uh Mario Bros and stuff. And so like I've actually like been doing a little bit of just trying to I mean I hate to admit it but Vibe Code video games, right? It's just a hobby. I'm not trying to like do something too crazy and it's more just like having it run in the background for fun. But like one of the things I had to think about is like how do I build a harness for the coding agent to be able to actually play the video game.
It's a bit trickier because they can't like coding agents they need time to think, right? So if you have a game that's running at 60 frames per second, it's not really going to be able to react to things the way that a human would. So thinking about a system where it can basically like slow down the frame rate. I know it's kind of like a silly example, but it's just like that's one of the biggest things you have to engineer for for anything is like how would the agent actually verify that as a user would because just like looking at the code it creates or the skill it builds for you like that's not enough for it to just do that sort of like review highle review which is good but like you need to wait for it to really like use the application or whatever you're making as you would.
Yeah, absolutely. And real quick, for anyone that might not have heard the term harness before, what is your kind of quick definition of that? Yeah. No, that's good. I know it gets it gets technical, right? Yeah. So, um, usually when people talk about harnesses, they're talking about something more like what I was going to talk about a bit here at the end. Um, so what I'm talking about as far as like validation is more like I mean it's it's kind of I I have to think about like how to actually explain what a harness is really.
It's it's the wrapper around the large language model, the tools and context that it has access to. So it knows what it's working on and how to work on it effectively. So if we think of like a harness for AI coding, cloud code is actually a harness, right? like it when you download Claude code and you run it, it loads a system prompt on top of Claude as a large language model. It gives it the tools so it can run commands and create files on your computer. Um, that's what really makes it a harness. And and then when I was giving the example of like a harness for testing, it's more like u giving it a system where it's like, okay, these are the commands I can run to start the game and then like slow down the frame rate so that I can interact with it frame by frame and like really stop and analyze and think before I take another action.
So it you can think of it kind of like a so I mean maybe I will just jump ahead here. You can think of the harness as the thing that just wraps the model. And then there's also that that component of the harness that you get to build yourself. I call it the AI layer. And so for cloud code, that's like your claw.mmd and your skills and your hooks and any kind of MCP servers that you're bringing in to connect it to your other platforms like your CRM or your task management software, right? That's that's building on top of the harness.
So it's kind of like the large language model is the reasoning. It's it's the brain at the center and then you pick the tool like cloud code or codeex or whatever and then you can sort of like build the context and integrations on top. Absolutely. I love it. Yeah, well said. I think something something fun anyone listening should try real quick is if you go to an AI model and ask it to explain an AI harness or an agent harness. I would be willing to bet it does the whole car analogy where the engine is the AI model and the car is the harness.
So let me know if you guys run that and and see if that's what you get. Sounds good. I mean we could we could test it right now. No, we won't we don't need to do it right now. But yeah, that's that's your homework for today. Yeah. Yeah. Yeah. Cool. Um Yeah. Yeah. So, I mean, we've talked about like validation a lot. Um, planning is the other thing that I really want to hit on cuz most people don't do enough of it. And it takes it takes patience. And this is like one of those um software engineering disciplines that I like to bring into um even when I'm talking to someone who's not writing code or who isn't technical is you have to spend I mean with coding agents you spend more time planning than you actually do building because you you really put a lot of your effort up front into the plan and then you use that to delegate as much of the coding as you possibly can or for a lot of us all of the coding to the AI coding assistant.
And so its success is really just dependent on how good is your plan. Usually you have some kind of like a lot of people like using markdown, right? I use markdown a lot. So I'll have like a single markdown document that outlines um you know like the goal. What are we building here? What is success actually look like? And like of course with that comes the validation strategy um that we've already talked about. So how does it know that uh the work is done and working well? And then um not to get like too technical here, but especially more for any kind of like coding task, you're going to have like the integration points, right?
Like if you're building on top of an existing automation or application or website, whatever, like what are the parts of the codebase that we actually have to touch? And so if you are more technical, you can sort of evaluate like make sure it's understanding it's correct of like, okay, what files are we really going to create and edit here? Not that you need that. Um, and then once you have that plan, then this is kind of what my workflow looks like. And then this is for anything. So you do some kind of like context loading up front, any sorts of like documents that your agent needs related to the task at hand.
And then I'll typically have it do some kind of research, usually using sub agents for that. So if I'm building a new application, maybe I'll have one sub agent research what's a good tech stack for this. What's a good like approach if there are people that have built similar applications, right? So like especially if you're not as technical, that can be really useful for it to just gather a lot of information and then propose a plan to you. And so that's when you you create the plan with the coding agent. This is also where usually you want to have the coding agent ask you a lot of questions.
Like I know Nate, you just put out a video today on uh Matt Poc's grill me skill, which is really good. Like you need to make sure that you that the coding agent is not assuming a ton of things about what you want it to do, like the workflow you want to build, the skill you want to build, whatever. And so having it ask you a lot of questions, clarify those things is good. So that way you can be confident that once you have that final plan like this is about this is what we're going to go and do now that both you and the coding agent are aligned on what's actually going to be done and and how you're going to validate it.
Absolutely. Yeah. I love it. When you do that, are you typically using in cloud code plan mode or are you kind of planning but not in plan mode? Yeah, usually I don't use plan mode. Okay. It's It's good, but plan mode like puts Claude code into a bit of a different behavior that I'd rather be able to control my control more myself. So, my skill for planning is like instructions for how I want it to ask me questions and then just like generally how I want to go about researching and organizing things into a plan.
And so, like I want to define the sections. If you don't, then you're just using Cloud Codes plan mode. Like it'll build something actually pretty much like this. But I just like having that more um that that higher level of control. I think that's a theme that you get a lot through my content in general is that I I like to have control and customizability cuz in the end that's how you get the best results. It's just it's kind of like that learning curve to get to the point. Um like for example, I I don't use OpenClaw or Hermes.
I have my own second brain that literally is just built directly on top of Clawed Code. And I'm a big proponent of that even though those other open source tools are very powerful because you're running something that you don't understand and it's harder for you to like really take as your own and it's not like a foundational component that you can create your own system on top of. So you're more like adopting someone else's system. And these tools have done a really really good job making it easy to extend and and really make your own.
But like in the end building something from the ground up is always going to give you the most control even though that can be pretty daunting. Yeah, I hear you. Yeah, that's interesting. I mean, it it really does make sense. I always love, you know, that's something I just say a lot, which is a very simple theory is just to be genuinely curious to understand what's going on, especially when I don't understand what these lines of Python code that it that just got written mean, you know, and the whole idea of dark code. And I guess what do you think about that whole idea because I know you talk a lot about vibe coding and and preaching understanding things at their core.
So when someone is generating automations or code that they don't understand how to read, yeah, how do they actually feel secure and safe about that? Yeah, that's a really good question. So pretty loaded, too. No, I'm No, that's that's good. I I welcome it. So I I'll answer in two ways. I'll answer first by saying that like maybe not everyone loves to hear this but like if you are using an AI coding assistant to write code cuz you're building your second brain you're creating automations whatever it is I would recommend at least trying to get to the point where you can understand the code and really at first that can be as simple as just asking cloud code or whatever coding agent to explain what it just wrote because code can look pretty intimidating but when you get over that like initial hump like it kind of reads like English and maybe that's just me being extremely ignorant because I've lived and breathed it since I was eight years old but it starts like as long as you understand the core primitives of like this is a class this is a while loop this is a if statement like it starts to read like English you're like okay I understand when this part of the code is going to execute now just asking your coding agent constantly and so um I mean like in cloud code there's the slashby the way feature so like you can always just kind of a sidecar conversation where it's like, "Hey, help me understand like what the heck is going on right here." And then it doesn't have to to dilute your main context and just kind of like keep throwing context at at Claw Co.
Like you can have that separate conversation for your own understanding and then go back to the main task at hand without it being affected. So I would recommend that. And then you know if someone is really not inclined to learn how to code like that's just not your goal. You want to use cloud code to automate things and not have to like engineer applications. I totally get that as well. Really comes down to your validation strategy is what's going to dictate how confident you can really be and what is created. So if you're spending a lot of time in this is why I say like whenever you're building something with cloud code, the way that you don't vibe code is that you sandwich the delegation of the coding between the planning and the validation process that you're heavily involved with.
Right? Like the only reason I'm ever going to say, "All right, Claude, go rip through this." is because I made sure I created a really detailed spec and I've defined like this is how you're going to tell me that you're done and how you can be confident that you actually are. I love it. Very well said. Nothing to nothing to add there. Cool. All right. Sounds good. Yeah. Um Yeah. And as far as like creating that plan with the coding agent, the most important thing is to manage the context like what your coding agent is going to really be paying attention to at the start of any kind of planning session.
So the the thing here is that attention is scarce. And so there's a big misconception right now for a lot of people where they think that like it doesn't really matter how much you throw at a coding agent because everyone is hearing nowadays how like large language models can support up to 1 million tokens in their context when they're like oh that that's like the Harry Potter book five times over I forget the exact but people like always throw like some some analogy where it just like makes it pretty obvious where it's like 1 million tokens is an insane amount of information and it actually is but there's two massive caveats here.
The first one is that that context will go way faster than you think because if it's reading through um a bunch of skills that you set up for it or a bunch of code that can be tens of hundreds of thousands of tokens very quickly and then the other thing is uh large language models have what's called the dumb zone. And so you have the the little bit of context up front. Maybe I can just draw like a quick little analogy here. So if like this is Oh, that is a fat marker. Um, hold on.
Okay, I I give up already. I'm not going to try that. Okay, so you you have to imagine this with me here, but imagine you have a box that represents the the LLM's context window. You have that initial part at the start of the conversation up to the first, you know, 100 or 200,000 tokens where the large language model feels very sharp or at least it feels like it's at its best. Once the conversation surpasses that first 100 200,000 tokens, obviously it uh depends on the model when you reach the dumb zone, you get to the point where it just feels like it's overloaded with information and it starts missing things and making mistakes that seem so obvious to you or like the kind of thing where you're like, if I had had a fresh context here, like there's no way it would have made that mistake.
Like it writes a really bad line of code or it uh doesn't use a skill that you thought it should have known to use. right? Like that kind of thing if it's in the middle of a larger workflow. And so that that's why I say attention is scarce. Like don't don't get under that false notion that you don't really have to care about how much you give it. Like if you're trying to have it handle a larger workflow, you still have to you have to be very careful like what you give it up front versus what you allow it to discover when it actually needs.
And like that's one of the most important things with skills with Claude is you're giving it procedures and best practices, but it gets to decide like, okay, now I need to rely on this process or this information. So you're not just dumping a bunch of things up front. A lot of people do that. Like even with MCP servers back in the day, they would they would connect their like 20 MCP servers to cloud code and each one of them was was uh filling the context with like 20,000 tokens up front of information because it has like all the tool calls or the tools that come with the MCP server.
And so their large language model would always act super dumb. And so they're like, I'm using the latest opus. Like why am I getting terrible results? And it's really it comes down to just how much of the context is filled right away. Yeah. Oh my gosh. It drives me nuts. It It truly drives me crazy when you hear people blaming the model when it really is kind of a skills problem. And we see this at, you know, when you look at these studies and surveys too about business adoption where it really is these people either have not yet felt the ROI because they can't they don't know enough about how to use it truly, right?
And also people claiming that they have the skills to, but they're just not doing it. And like the adoption is then another problem. But I mean, obviously I'm not doing heavy heavy coding, building software and and apps. But, you know, we're doing some pretty cool things and I've seen some people do some really awesome things and it's just there's a lot of things like you know, if you kind of think about your your diagram that you had, you got the model in the middle, you got the agent hardness around that and then obviously a huge layer is what you put in there as well and the way that you manage your stuff.
And I think that the 1 million context window specifically for you know like let's just say Opus 4.8 at the moment. Obviously, it's great, but it definitely comes with a false sense of security with people now thinking that they have the million, but when, and I know this might be outdated by next month or two months away, but let's say right now when you're in cloud code, when do you typically do your compact or a session handoff and clear and when do you get out of there? Yeah. So, with Opus right now, it's usually around 250,000 tokens, and I feel like it gets into the dump.
That's my exact number, too. Oh, really? Okay. Yeah. Yeah. Good. Cool. So, and that, by the way, is like really subjective. Like, I'm not going to um bet million dollars on on like the on Boris Churnney or someone saying like, "Yeah, it's also 250,000." Like, quarter million is just clean, right? Yeah. It just it sounds good and it is like pretty accurate. I would say like Opus 4.7 was around like 200,000 and then like Sonnet 4.6 is like honestly probably only like 100 125,000. Um like it as you go to these smaller models like the dumb zone becomes a pretty small amount of context relative to like what it theoretically can handle.
You just never want to get to that point. So then with the dumb zone thing, I've also heard stuff about the model being really good at remembering things that are at the front and the very end and the middle is where it loses. So where does that play into the whole dumb zone conversation? Yeah. So basically that issue is just amplified the more you get into the dumb zone. Yeah. And um yeah, as far as like I mean we don't have to get into like the super technical details for how the attention mechanism works for LMS, but yeah, you can think of I mean like the analogy I always like to use is the needle in the hay stack problem.
Yeah, like if you have that like little piece of information that you want the agent to remember in the middle of a massive conversation, it's like trying to find a needle in the hay stack. Like you can't expect the model to just because of the way that large language models are engineered. Um you can't expect it to like always be able to pick out that little piece of information. 100%. Yeah. Yeah. I wish you could. That would be nice if there wasn't a such thing as a dumb zone. It would make it much more convenient for us to hand it massive tasks and let it just rip through things.
But a lot of the reason we have to create a harness and like a lot of the things I'm focusing on right now on my channel and just like generally what I'm building is creating harnesses that build a workflow that can bind multiple coding agent sessions together. And so basically it's like one model does the planning and then my orchestrator will like automatically take that handoff document like the plan and then feed it into another agent for implementation. And then when the implementation is done, it'll create like an execution report and then it'll hand that off to the next agent to validate things and do a code review.
And it might sound like like that's a lot of engineering and it is, but it's very necessary right now because if you're trying to do any kind of like real work for like production grade software or building an automation that's like critical for your business, you can't just throw the whole thing at a single cloud code session unless you can like confidently build it in that um that zone that you have before you get to the dumb zone. And most of the time you just can't do that or at least you can't really trust that's going to be the case because you never know how much it's going to have to iterate on something.
And so that's why I'm really like I guess you could say bullish right now on um harness engineering which is like building a the workflow that uh orchestrates many coding agent sessions to handle a larger task. And like a really basic example of that kind of harness is the Ralph loop. It went like super viral at the start of this year. Um so I feel I feel like even if you haven't heard too much about harness engineering you probably have at least heard of the Ralph loop. And that's like really like the foundation of that kind of harness, right?
Like the Ralph loop is stringing together multiple coding agent sessions. I I wish I had uh one of my diagrams up for this right now. I'll just have to explain it verbally but like you know basically you have the first cloud code session read in your larger spec for like a bigger automation you want to build and then um it'll define like the the task list like first phase is this second phase is this and then it'll have many coding agents handle one phase at a time but it'll like do it all automatically in a loop.
That's why it's called a Ralph loop cuz like agent one will do phase one and then it'll write up its little report like its handoff to the second agent that'll continue the work. And like the main reason the Ralph loop matters is because it you can't have one agent handle that larger task without it getting into the dumb zone and like you know halfway through phase two, right? Like you have to break things up. Yeah. So it sounds like from like a a high level view the idea or kind of the mindset that you've got like this assembly line and you have an agent doing something.
Each agent kind of does one thing really well and hands over their input to the next agent in a way where the agent has enough context to understand what has been done and what is left to do and what its current job is. Yeah, exactly. Yeah, assembly line is a a really good analogy and um I mean that that applies to a lot more than than just writing code. Um like like one example that comes to mind when I think about like cuz I I know that I've been talking about like coding as an example for a lot of things, but um I I work with a lot of companies that are in sort of like the like B2B side of things.
And when you're B2B, like you do a lot of um creating quotes, like estimates, right? like you have um construction company or uh like I've worked with companies in the print industry where like they'll have like a request for like all right make me like 100,000 flyers or whatever and like for those companies one of the biggest opportunities for them to use AI is to use something like Claude to help them take in a request and automatically create an estimate like a quote for how much that uh job's going to cost cuz that's like a really really laborious job like more than you would think Like when I when I've talked to these companies, like it's crazy how much work goes into that because they have to like take the request and they have to understand like how much labor goes into you know parts obviously like depending on the industry and then they have to do research on like the latest prices for things and making sure they're getting it from the right vendor.
Like there is so much that goes into that and so like that kind of thing u is it's like a really good example like nothing to do with creating code still using something like cloud code. You can use coding agents for this to go through that larger workflow of like looking at their inventory, looking at prices, comparing vendors, um, all based on what's going to be needed to accomplish that task like that remodel that the 100,000 flyers for whatever that request is from the other company and then creating that estimate and then understanding how the company works like what kind of padding they want on top of u based on the the labor and the cost for the parts or whatever.
Like there's a lot that goes into that. And so like that's the kind of thing where like you'd build a workflow where you have one agent that's going to research inventory, one agent that's going to look at prices and and compare prices for parts, and then one agent that's going to draft the PDF, and then maybe another one that's going to make it look good. I mean, I'm kind of stretching the example here, but you get the idea of like you you actually don't have just one agent handle the entire thing for something that big.
And you are going to be doing a lot of planning, right? like you're going to plan, you're going to have a validation at the end like what kind of calculations can I do at the end to make sure that like this job uh has the the margin that we want on it for example. Yeah. Yeah. And I think I think back to one of our biggest failures back when I was still kind of in the day-to-day of running an agency was that exact use case was having to look through tons and tons of examples, past quotes, past client work, past proposals, and and needing to generate these quotes with so many different factors that go into it.
And that was one of our biggest failures because me personally, I underscoped that build. And we went into it not realizing how much actually is necessary to get to an accurate quote. So that was a great lesson for me to learn not only about the importance of asking enough questions and scoping, but just in the way that you split up the work. And I think, you know, obviously Cole mentioned he's he's talked a lot of these examples have been kind of around coding, but I don't really do much coding. I mean, at the end of the day, these automations are code.
So yes, it's coding. Yeah. But I'm not doing like software. I'm not building products, but every one of these theories that we talked about in these mindsets and frameworks has, you know, directly applies to the knowledge work is kind of what I like to call it of of what I do on the day-to-day and what probably most of you guys need to do. That gives you an insane amount of leverage right away in cloud code. And I think that when you think about your job or you think about some of your responsibilities, it's not just one responsibility.
it is. You can drill that down into so many little subtasks like Cole just said like one agent does the research, one agent does the PDF generation, all these little strings of subtasks that flow up together to actually make the overall responsibility which might be 10 little tasks that get strung together. So when you can actually break down a process by just writing it down or or you know flowing it out on on a piece of paper, it makes things a lot more clear, right? Yeah. Yeah. And one thing I want to say here is that a lot of people they want to simplify it down to just using sub aents.
So like for this this larger workflow, what if I just have my main cloud code dish out a bunch of tasks to sub aents? Like that can work for some things. I do love using sub aents especially when I'm initially planning any kind of automation or or uh application, but it's hard to really make those communicate well with each other. Like we've talked a lot about handoffs here. A lot of times one agent when it's taking that next step in a workflow it has to understand the work that was done with the by the previous one whether that's work you know actually writing code or if it's just doing research or if it's pulling information from your CRM for example like it has to have that kind of handoff document and it's really difficult to um do that well with sub agents claude has tried their hand at doing something with agent teams so they they that's kind of like the step above sub aents where they can really communicate with each other but uh that is like really unrefined.
It's a really good idea but it's really unrefined and it's very expensive like tokenheavy. And so yeah like and that's actually what I'm working on. So there's a open- source project that I'm working on called archon and that's really the problem it's solving is how can we more like the word I use is deterministic like how can we build the AI model like build cloud code into a system instead of having cloud code trying to orchestrate everything because that's when it becomes difficult for communication and everything becomes very tokenheavy right so like the way that I like to put it is we want to um pick when the AI model works in a workflow instead of having it drive the whole thing.
Mhm. Yeah. Yeah. How do you make such an autonomous non-deterministic system as deterministic as possible? Pretty much. Yep. Yep. As deterministic as possible. I wish I could say make it deterministic, but that is never going to happen. Unfortunately, that is fundamentally impossible. I love it. Yeah. Completely agree with you there. Cool. Yeah. Um, so I mean really we we've talked about most of the other things I have in the diagram here. Like we've talked about verification, making sure that it's able to check its own work and um yeah, I mean like the the main thing here is we don't really care about what it does it f on its first pass.
If we build a system where it's able to iterate, that's all we really care about as long as it doesn't take billions of tokens to get to that final stage. But like when I'm whenever I'm using cloud code for something, I'm never optimizing for speed. I mean, at least like I don't want it to be unrealistically slow, but any kind of task I have for it, I don't really care if it's something that I have to uh have it work through for a half hour or an hour and a half. Like, I'll send off that request and then I'll just go to another Cloud Code session for whatever else I have to work on or I'll do something, believe it or not, without an agent for a little bit, like if I have to uh record a video.
Um, well, I mean, maybe I'm using an agent in the video, but you get the point. But anyway, like the the point is that I don't really care how long it takes because I just care getting the best results possible. Um, and so yeah, that's why like I I spend a lot of time engineering systems for coding agents to check their own work, whether it's browser automation for a website or they silly example I gave earlier, like a way for it to sort of like play a video game as a human would. And that's like a really fascinating problem for me to solve right now.
It's just like that verification layer at the end for a coding agent, which um also extends to things like security as well. And so like that's not something as interesting to talk about right now, but like security is pretty important to me. It's something that um vibe coders get very burned for. I mean, you hear those horror stories like at least once a month of uh you know like their superb base private or secret key getting leaked in their uh JavaScript files and things like that because they're just completely vive coding. Like I mean that's like the simplest example but yeah like that kind of part of verification is really important as well.
Yeah. And on that whole element of security and what could go wrong when you think about sort of like the permission layer that you're putting around your agents. I see a lot of false sense of security once again where people think that their prompts are a good enough permission layer when really that permission layer needs to be scoped keys or you actually can't touch this at all because I think I was talking to my team and we kind of got to this conclusion of if you have the mindset that anything that the agent can read or can touch it will like you have to assume that it will even if you never ask it to That assumption is what's going to save you from having your database deleted.
Yes. And that's funny you bring up that example specifically because I was just about to say like if you tell it never to to wipe a database, it's still going to do that. Mhm. Like there was a story that went viral um like a month or two ago was someone like really high up at Meta that had their database. I'm still not convinced that's real. I feel like they might have been I don't know cuz people get so much attention when they have stupid stories like that. But but anyway, conspiracy theories with Cole, right? Yeah.
But like it is it is definitely possible and I do know some stories of that actually happening just to a smaller extent. It just feels so weird that or it sounds so stupid that it's like their actual production database was wiped. But I mean even if you have a test database wiped, it can still be a bummer if that slows you down a lot. And so so yeah, it is super important. You never want to assume that just because you tell an agent to not do something, it never will. I mean it's the same thing like if you tell a kid to not do something they just might not listen.
I mean even even adults there actually recently something did happen to us which is kind of why we started talking about this. Okay. We had this this incident where the agent had the right intentions. It was trying to be proactive and it actually saw something on its task list but it misinterpreted it and it ended up sending an email to our entire list with a discount code and it was like not supposed to go out. So, we had to like change the code, update the page, we emailed out an apology. So, if you guys are on the email list and you got that, that's what happened.
But it's just like, you know, I wasn't mad at the person who was kind of responsible for the agent. It was just a really good opportunity for us to think about, okay, why did this happen? And, you know, she wrote up a case study. We sent it to the whole team and everyone was like, okay, that's a really, really good reminder of how careful you have to be. Because, you know, if you connect to an MCP server and you don't limit the permissions, it has everything, you know. Yeah. Yep. Now that's good. Yeah. The main way that I restrict actions from my coding agent is with hooks.
So like cloud code hooks is a really good way because um basically a hook and cloud code is a little piece of code that you can run whenever a certain event happens in the tool. So whenever you start a session, whenever you end a session, right before cloud code uses a tool, you can run some kind of code that does a security check. I mean there's a lot of other things but like I love using hooks for security. And so what you can do in cloud is every time it's about to invoke a tool like it wants to write out to a file or make some requests to the web, you can uh check against that command to make sure it's not trying to mess with a folder you don't want it to touch or run some kind of command you don't want it to run.
And there's a lot of different ways you can check for that that we don't have to get into right now. Um, but that's like one of my favorite ways to make sure it's like not reading my environment variables or it's not running a a delete command for a database. Um, and it it it's really hard to make sure you're you're covering all the loopholes cuz there's a lot of things that coding agents can do. To get around those kind of checks as well. A lot of people have false false sense of security around that as well.
So, you kind of have that like first false sense of security where it's like, well, I told it to never delete my database. And then you have the second level where it's like I block all delete SQL statements. But then there's that third level that you have to like make sure you're engineering for um like for example a coding agent. If you if you don't allow it to call the like delete like remove command to delete a folder, it can still write a script to do that. So it just has to do twostep like write the script and then run the script and then it's still able to remove a file or folder on your computer.
So it's I mean they're less likely to do that. So, it's still like you're getting there if you are at least have like that that second false sense of security, but like you got to be really safe. You got to it's it's actually a tough problem to solve. Yeah, man. AGI is it's it's scary. But I would love to see and maybe you've already got one out, but I would love to see a a Cole Hooks master class because I actually just recorded one and I don't use hooks that much to be honest. Like I really don't.
I think my my main hook that I have is just to give me a noise notification when it's done or when it needs me. But yeah, like I have underutilized hooks for sure. And I'm not sure if that's because they're mainly valuable when you're doing heavy coding, but I would assume that there's a lot of things that I could be doing in my day-to-day where hooks would be really good and I need to definitely look into a little bit more how I can be utilizing them. But anyways, yeah. Yeah, I I definitely should do a master class on hooks because there's a lot of ways that I use them.
Um, yeah, since we're on the topic, like one of the really interesting way to use hooks is you can use them to automatically suggest like you can have cloud code like automatically suggest ways to uh improve your AI layer, like make your rules better, make your skills better. And a lot a lot of tools like Hermes and OpenClaw, they kind of do this. I don't think they like explicitly use hooks, but like OpenClaw for example, every like 10 20 turns, I think it's configurable. It will like kind of compact your conversation and store it as a memory, right?
So that you have like the whole like daily log thing with the uh memory MD file. Like all of that comes from what's essentially a cloud code hook. Like so with the way I use cla code with my second brain is uh every time I have a memory compaction, which I try to avoid those I don't want to get that far into conversation. um or I end a session, it automatically creates a summary of the conversation, puts that in a daily log, and then I have a process every day. It's basically like cloud code dreaming where it's going to look at the daily log and then extract any like really important things to store and sort of like promote to my primary memory file.
Like here are the decisions that I've made recently or like the things that I'm actively working on and where I'm at with them. So like hook hooks actually drives a whole thing like this terminal that that this is like the second time it's popped up. Um, that's actually a hook that just fired there. So, I'm just like testing some other things. I forgot to turn it off, which is unfortunate, but actually now it made for good illustration. I had I had a hook run as I'm talking about it. I'm just testing something else right now.
Yeah. Just just yet another way to make these non-deterministic things as deterministic as possible. So, what do we have next after this verify the work section? Yeah. Yeah. So, really, this is the last thing. So, we've already talked about the the harness, but the the last and honestly probably the most important thing is the system evolution that I talked about just a little bit earlier. And and really the the mindset here is what like I this is out of everything that makes it so you're really directing cloud code instead of just being a user of it, the building the system is the most important thing.
So anytime there's an issue that comes up, instead of just fixing the issue and moving on, it's an opportunity for you to with the help of the coding agent, with the help of cloud code, figuring out like what could we make better so that this doesn't happen again. Like maybe there's a new rule that we can add in our claw.md or there's a new document that we can give it when we're in our planning process or there's maybe an update to our skill that we could make. And I'm being kind of general on purpose here because there's a million different ways that we can improve our system.
And so this is kind of like the example that you gave, Nate, where you had the email go out or at least it went out to way more people than it should have. And so you wrote up that report of like here's what happened. Here's what we can do better. And so it's kind of doing that but like for the agent so that going forward it has that rule so it it doesn't do that thing anymore. Like maybe it uh didn't run all the validation you wanted it to. So now you just like make sure that like that's a part of the rule where it's like this you make sure you don't forget this validation kind of as a silly example but that way every bug becomes a permanent upgrade.
So once you have this kind of system in place you actually almost welcome bugs like I want something to go wrong because then I can make sure it never happens again right like I almost have I almost feel kind of nervous when everything's going too well cuz then it's like oh shoot I have no way to like make my agent better right now. So it's it can kind of become nice. Yeah. Absolutely. Yeah. Just get better over time. I've got an interesting question for you. So, I completely agree. Every single time that you have a failure, you should look at that as data and an opportunity to improve the system.
Now, what what about before you get those failures? How do you think about to your best to the best of your ability finding those edge cases or predicting what edge cases might happen and trying to build in guardrails before the whole testing part? Well, it can never be perfect, which is why I lean on this so much. But generally, when you're looking out for edge cases, um, I mean, cloud code is actually pretty good at it. It's not going to cover like nearly all the edge cases, but even just asking it like how could this go wrong is a question that sometimes people are honestly like nervous to even ask, but it's a really good question once you're done with the implementation.
And like this is a part of like my code review skill that I have built in where it's like ask yourself what could go wrong here and then try to engineer a scenario where you're really testing that. Like if I'm building an automation where I think there might be an edge case where it doesn't handle this kind of input correctly, I'm going to as a part of my agent's code review have it like create that like it'll invoke the application with that input like a web hook or whatever and try to break it and see what happens.
And then if it does break then I mean that's obviously going to be like going back here a part of our verification where um it'll then uh address that thing and then do the tests again right like iterate like you find a problem fix it and then also retest right don't forget to retest because maybe you're fixant to actually address the problem. Mhm. Yeah. Well, I think, you know, something that I've realized after responding to YouTube comments, Q&As's in the community, chatting to you, and and just seeing what's going on when people are learning these kinds of tools is that you really at, you know, the simplest way to describe it, you just have to treat it like your best friend who is the smartest person in the world.
Meaning, you know, treat it like a mentor. It's not going to laugh at you if you ask it something stupid. You just need to be curious and you need to ask ask the questions that you are wondering in your head. And I think when you kind of get over that idea that it can teach you anything and it can for the most part, especially if you ask it right, it can help you figure out the majority of your problems when you have, you know, that that sort of uneasiness because maybe you don't understand what it did.
So that's like a huge mindset shift for anyone I've talked to that is like trying to get into it and doesn't understand it. if they, you know, maybe they text me a question or they drop a question. It's just like the response can a lot of times be, "Have you asked Claude code that?" I know. Yeah. I feel bad saying that, but like, yeah, it comes, it comes to that a lot. Um, yeah. Where it's like, no, you should have just usually how I can be more helpful is like telling them what to ask exactly, like give it give it this link, give it that thing, and then here's how I'd ask it.
But yeah, a lot of times it does come down to that. And I mean, you can't you can't just ask Claude for everything because of like the psychic fancy you mentioned earlier. Sometimes if you're asking it for its opinion like asking a large language model for its opinion is a really slippery slope. But but what you can what we can ask it for is to like understand how something works. Like that's when it can do a really good job. So, like going back to the example earlier, if you're not technical, but you want to try to actually be able to understand the automations and things that it's building for you, like that's a really good thing to ask it because it's not going to like there's no sick fancy there, right?
It's not just trying to appease you. It's it's just helping you understand. The way to appease you is to explain the thing, right? Um, so like that that's a really good case just trying to understand anything. And then like what we were talking about just here with verification like trying to find edge cases. If there's anything where there's like actual empirical data like there is a way to verify that like this this automation doesn't handle this input well. I mean there's no room for sickop fancy there. It's like it it either works or it doesn't and there's not really like any kind of gray area or opinion.
So if you think there might be an edge case or it thinks there might be it can test it and then it's it's black or white. Right. So I want to hear what you think about this because You briefly mentioned the agent teams earlier and I actually find myself using them quite a bit for mainly one specific use case and I want to hear what you think about it. So really the time when I reach for agent teams is when I am trying to help you know I'm trying to decide something but I don't want to just ask for cloud code's opinion like you just said.
And so what I'll do a lot is I'll spin up like a debate panel or like a war room. Nice. And I will say, you know, like one of you guys is a CEO, one is a beginner, one is a college student, and just like a bunch of different personas, sometimes even like seven. And I will just have them all do independent research, form their own opinions, and then I'll have them debate. And then I'll just be able to read the debate, and I'll be able to sometimes I'll say like, "Keep debating until you all come to some sort of consensus." But I do that quite a bit.
And that doesn't mean whatever the agent team spits out, I do. But sometimes it's just really great for me to read through all those opinions. But I want to see do you do you like that? Do you think that's a major flaw? Like what thoughts do you have about that? I do actually like that. Uh I've I've never done that before. You've never done that? You should definitely No. Yeah. I feel like tonight I literally got to try that. It's really fun. Yeah. I like that idea a lot cuz some something that I have done experimentation with that's sort of similar.
I call it adversarial development where basically after a cloud code finishes building something, I'll have a separate cloud code session u where I prompt it specifically to play the devil's advocate. Like I want you to be mean to the other CL code session to like really make sure that it's not just being happy golucky when there are actually some some problems that need to be surfaced and like that works really well. So just generally like pitting large language models against each other is a good idea. Um, I wish I had tried that before. So, yeah, I'll I'll give that a shot.
I think that that's that's a really good use for Asian teams because at that point at that point, it's like you're not relying on it to getting the perfect answer. Like they it's very tokenheavy and the communication is never really perfect. So, that's why I don't really recommend agent teams when you're trying to do like deep development or like building any kinds of like complex automations. But, when it's more like research and just like forming a consensus, I I think that it does really it would do really well for that. I'll try it out. Cool.
Yeah. Let me know if you try it out and what you think. But I've never really talked about that or made a video because I know people would go do it and then be like, "You just killed my 5 hour limit." Right. Yeah. Fortunately. So, um, how much of your limit does it use when you typically do it? I mean, on the 200 buck 200 bucks a month plan anywhere from, you know, 4% to to 10 sometimes. Like, you know, it's not too bad. It's not too bad. But, you know, if you if you say something like a don't stop until everyone agrees and they just keep going, then you could you could run into some some trouble.
But to close us off here, I have to ask, I just did a video about my favorite features in Cloud Code. And I prefaced that whole video and basically said this is not a list of the best features or, you know, the most used or the the most useful. These are basically the way that I use Cloud Code on my day-to-day, the ones that I like the most. And I I had like a numbered list of 12 at the top. But I would love to hear from you to put you on the spot. If you had like a top three based on because I'm I'm assuming we use very differently.
What would you say are like your top three favorites? Yeah. So, uh, Hooks is definitely maybe not like my favorite favorite, but probably the one that like most people wouldn't put in their top three. And that's because of what I've been doing with it for security and then the whole um integration with the second brain. So, it's able to basically like extract summaries and like remember things over time. We definitely need a hooks coal video. Yeah, I honestly I should just do that next week. Yeah, we definitely need it. Yeah. Okay. Yeah, thanks Nate. Yeah, so yeah, hooks.
Hooks is definitely number one. And then uh here because I I mentioned some of the things here like I mean really when I there's kind of two different sorts of cloud code features. you have like the components of the AI layer like rule, skills, hooks and then you just have like general capabilities of the harness like um agent teams and slash by the way and and um dispatch like dynamic workflows things like that right it's either like it's something that you use or it's something that you build on top of um sub aents would be probably like number two um just because like I said there's dangers to using sub aents but just using them to like sprawl out and research a ton of different things.
I use it for that all of the time and especially when I'm working on more complex code bases or building out larger automations. I'm using sub aents to basically like extract context from certain parts of my system, right? Like you're responsible for getting a grounding here of like how are we going to have to mess with the front end in this application? How are we going to have to mess with the back end? Um and then honestly probably like my number of one. So I guess like hooks would be two and sub agents would be three.
Probably my my number one is skills. Even though that's like super super cliche like Yeah. It's got to be skills. It's just the best. Yeah. Yeah. Like skills. Skills dictate everything. Skill. I have a skill for making this diagram. I have a skill for scripting my YouTube videos. I have a skill for building PowerPoints. I have a skill for um I mean you could literally make so versatile. Yeah. Yeah. It's just any kind of reusable prompt. You just make it as a skill. Mhm. And cloud code has done a really good job continuing to evolve just like the way you can parameterize things like they have like um path scope skills now and you can like set if this one is to be invoked only by you or if the agent can decide to do it as well.
Um and then like talking about like verification like getting back to here like having that browser automation skill so it knows how to use a CLI. Um, like that's a whole another thing is like the the skill plus CLI combination is just really really powerful because basically any platform or tool you want your coding agent to be able to use. It's either going to be an MCP server and like those are still good but honestly what I think is even better like more token efficient is having a CLI so it has access to your CRM or GitHub or whatever through the CLI and then the skill it tells it how to use that CLI and then more it more specifically like how you want it to use that like how do you want to this CLI to be integrated in your workflow.
So like that combination I'm leaning on that for everything like my AR archon tool I was talking about earlier like it is a CLI that has a skill that comes with it. So like if you want those more deterministic workflows where you get to pick like when do we have the LLM? When are we just running code then like you build that as a workflow and then now archon with its skill and its CLI becomes a tool that my second brain can call upon whenever it wants to dispatch one of those workflows to go handle a GitHub issue or run this automation whatever it is.
Very cool. I love it. Yeah, I love the list. My top three were skills was number one. Number two, I had status line. Oh, nice. I love just a quality of life thing, you know, just seeing the model, the effort, the window. I love that. And then my number three was routines. I love the uh the cloud routines. I I I just think it's so cool that, you know, I know I know we've got the SDK and whatnot, but it's just nice to be able to schedule something that is just my cloud code going. And yeah, I think those were my top three and I'm sure they'll they'll move around.
But yeah, I appreciate you sharing yours. It was interesting to hear. I'm glad that Hooks made the list, so I'll definitely be keeping my eye out for that video, though. Sounds good. Yeah. All right. What do you use uh routines for? Um well, I've got one going now that is a a trading bot. I had that originally going with an open call agent, but I switched it over to routines just to see how how it would do there. Um but then other things just like it's actually doing worse there. Yeah, it's doing worse there right now, but I don't know if it's I mean the market and everything as well, but I think OpenClaw had just out of the box it had better memory capabilities for that sort of thing.
So, yeah, makes sense. Um, but then, you know, just your other standard stuff like checking in on the team and giving me updates throughout the week and um end of week reports, just very very simple things, but nice to throw the routines in there. So yeah, I really appreciate you walking us through all this stuff today. Is there anything else that you want to leave everyone with? Uh, that's a good question. Yeah, I I I would say that no matter how technical you are, really what it comes down to is you could think of yourself like the product manager for cloud code.
So you don't necessarily have to describe how to build something, but it's important for you to shape the vision, right? Like what are we going to build? And then a lot of people are calling this intent engineering now. I just kind of another buzz word of basically like you want you want to give like the why like cloud code this is why we're building this thing because that really actually it ends up shaping the how quite well. So like that's a big part of your planning process that's going to take you far and like it it it seems kind of silly cuz you really start to get into sort of like the personification of claude code when you're you're telling it why you're doing things but like it actually makes a difference.
you kind of have to like get over yourself and be like it's kind of cringe to treat it like a person, but like that actually is how you get the best results. So just just do it. It actually helps a lot and good plans and good specs going into whatever you're building with claude or automating. Great tip. Great tip. I actually did just yesterday read in the Claude docs on how to prompt 4.8 that it said that it said to give it the context for why you're doing something and it will probably do a better job.
So that's awesome. Cole, where can people find you if they want to watch more of your stuff or get in touch? Yeah, so YouTube channel is the main place for me to put all my content. So, you can just search my name, Cole Medine. Uh, it is not spelled as you'd think. It's me d i n. Sounds like medin. Everyone says it wrong. But yeah, that's that's my YouTube channel. And then uh also doing a lot of posting on LinkedIn as well. Same name obviously. There we go. Oh yeah, I think for the first multiple months I knew you, I thought it was Cole Meen and I was saying I was saying Meden all the time, but nice.
Good to know everyone cleared up. It's Cole Medine. That's right. Yeah, it's a Swedish last name. And uh yeah, Nate, there's there's people that have said it way worse than you. Like someone called me Melden uh live on stage at a chess tournament in high school. Like it's it's…
Transcript truncated. Watch the full video for the complete content.
More from Nate Herk | AI Automation
Get daily recaps from
Nate Herk | AI Automation
AI-powered summaries delivered to your inbox. Save hours every week while staying fully informed.









