7 new OSS projects to whip your agents into shape…

Fireship| 00:06:14|Mar 12, 2026
Chapters10
Describes how AI agents in the terminal are competing for better code and how this shapes modern development.

Seven open-source OSS projects to tame AI agents and build smarter pipelines, from agent templates to testing prompts and memory management.

Summary

Fireship’s Code Report spotlights seven open-source projects poised to sharpen AI agent systems in 2026. Angelic optimism collides with the chaos of AI-assisted coding, as the host argues traditional handcrafted code is fading, and agents can do the heavy lifting. Agency provides ready-made agent templates for roles like front-end and security engineer, enabling quick assembly of product-worthy teams in Claude Code. Prompt Fu functions as a unit-testing framework for prompts, with automated red-teaming to guard against prompt injections and other vulnerabilities. Mirrorish launches a multi-agent prediction engine that simulates an evolving social network of agents reacting to current data. Impeccable is pitched as a front-end design toolkit to simplify UIs, using commands like distill, colorize, and animate. Open Viking tackles context management by storing memory, resources, and skills in a filesystem with a tiered loading system to cut token usage. The list also includes controversial tools like Heretic for “removing woke censorship” and Nano Chat for building a tiny, self-hosted LLM, along with Recall AI for cross-platform meeting tools integration. Fireship emphasizes practical outcomes, from reducing costs to shipping features faster, and invites developers to explore these projects in real life. The video blends hype with concrete references, urging viewers to experiment rather than overfit on theory.

Key Takeaways

  • Agency provides templates for common startup roles, letting you wire up multiple agents quickly in Claude Code.
  • Prompt Fu tests prompts across different models and can perform automated red-team attacks to surface security issues.
  • Mirrorish creates a multi-agent prediction engine that derives insights by simulating a social network of agents reacting to real data.
  • Impeccable offers 17 front-end commands (including distill and colorize) to streamline UI creation without image-deep UX work.
  • Open Viking stores agent memory, resources, and skills in a filesystem with tiered loading to dramatically lower token costs and memory usage.
  • Heretic removes model guardrails using obliteration to run uncensored models like Gemma from Google, enabling unfiltered outputs.
  • Nano Chat lets you train a small language model from scratch for about $100 in GPU time, giving you full control over the model.

Who Is This For?

Developers building AI-powered apps who want ready-made agents, safer prompts, and scalable context management. Ideal for startups exploring rapid prototyping with OSS tools to ship features faster while controlling costs.

Notable Quotes

""The hard truth is that we're not going back to the good old days of handcrafted code. And the only way forward is to embrace the chaos and learn how to enslave the machines.""
Sets the video’s provocative framing and tone about AI-enabled development.
""Agency, which is a free and open-source project that provides agent templates for basically every job role you would find at a startup... you can easily combine all these agents together in claude code.""
Describes the core benefit of the Agency OSS project.
""Prompt Fu lets you test different prompts with different models to optimize what's going to actually work best in your application.""
Defines the purpose of the Prompt Fu tool.
""The single most important skill of the modern Vibe engineer is managing context. If the context is garbage, the output is garbage.""
Emphasizes context management as a key theme.
""Open Viking... organizes an agent's memory, resources, and skills into the file system.""
Explains Open Viking’s filesystem-based context store.

Questions This Video Answers

  • how does agency OSS help assemble AI agents for startup projects
  • what is Prompt Fu and how does it test prompts across models
  • what risks does Prompt Fu’s red-teaming reveal for prompt injection
  • how can Open Viking reduce token costs when using AI agents
  • what is Heretic and why is removing model guardrails controversial for AI apps
Open SourceAgencyPrompt FuMirrorishImpeccableOpen VikingHereticNano ChatRecall AIAI Agents
Full Transcript
Every developer in 2026 has the same problem. You open your editor, you write one line of code, and suddenly a dozen different AI agents are arguing in your terminal about how to do it better. And if you're one of those weirdos like me who actually enjoys the craft of writing code, congratulations. You're officially living in the dark ages of slop overflow. Instead of grinding for hours and earning those sweet dopamine hits line by line, you now just tell the AI what you want and watch it hallucinate an entire codebase. Writing code isn't fun anymore. Layoffs are intensifying. And even the CEO of Replet said that nowadays knowing how to code is actually a disadvantage. Not having a coding experience is becoming an advantage. Building a product is more efficient than ever. Unless you're a stupid programmer who cares about stupid things like architecture and security. Coders get lost in the details. But he's absolutely right. The hard truth is that we're not going back to the good old days of handcrafted code. And the only way forward is to embrace the chaos and learn how to enslave the machines. In today's video, we'll look at seven different open- source projects you've never heard of that will help you whip your AI agents into shape and build highly effective slot pipelines. It is March 12th, 2026 and you're watching the code report. In the past, if you were an indie full stack developer, it meant you had to have skills on the front end, backend. You had to understand DevOps, security, UI, UX design, and a bunch of other BS. But nowadays, you don't need to learn all that stuff. You just need to hire the right agent. And a tool that can help you do that quickly is the agency, which is a free and open- source project that provides agent templates for basically every job role you would find at a startup, like a front-end developer, back-end developer, security engineer, a growth hacker, Twitter engager, and many others. You can easily combine all these agents together in claude code, which can more efficiently help you go from zero to an actual product without needing to directly implement every personality and skill. That's cool, but when you put these agents to work, how do you know your prompts are any good? Well, that's where Prompt Fu comes in. Another open- source tool that was just recently acquired by OpenAI that you can think of like a unit testing framework for your prompts. If you're using AI to build an app that lets the end user interact with AI, also half the battle is figuring out if you're using the best model with the best prompt. But prompt FU lets you test different prompts with different models to optimize what's going to actually work best in your application. On top of that, it can also do automated red team attacks to find out if your app is vulnerable to things like prompt injection, which is important because if your chatbot can be tricked into revealing your API keys by a 14-year-old on Discord, your app is probably going to fail. The failing sucks, but it's a lot easier to not fail when you can predict the future. And Mirrorish can help you do that. It's a multi- aent AI prediction engine that starts by extracting a bunch of data from the internet, like breaking news and financial trends. It then uses that data to create a digital world where multiple agents with independent personalities then react to and discuss the data almost like a miniature evolving artificial social network. Yeah, it might be in Chinese, but if you don't know how to speak Chinese yet, all I can say to you is low hola, you're falling behind. Like for example, if you want an app idea that's guaranteed to make you a billion dollars, you can spin up Micro Fish to analyze trends at the macro and micro level, then predict a strategy that's guaranteed to make you rich. It's really that easy. But here's the problem. You go to build that app and the UI has these dumb purple gradients like every single other vibe coded app. Well, to fix that, you need Impeccable, an open- source project optimized for front-end design. It's a skill that comes with 17 different commands that can help your UI not suck so much. Like, one thing that drives me crazy is that many AI chatbots create UIs that are way too complex. Well, with impeccable, we can use the distill command to simplify everything in one go. Then we can use commands like colorize to add our brand colors. Then slowly add in commands like animate and delight to make the UI look more unique and special. But perhaps the single most important skill of the modern Vibe engineer is managing context. If the context is garbage, the output is garbage. An open- source project trying to make your context better is Open Viking, a database designed specifically for AI agents. Instead of jamming everything into a vector database, Open Viking organizes an agents memory, resources, and skills into the file system. Not only is that a sane way to unify your context, but it also uses a tiered loading system which can dramatically reduce token consumption and save you a bunch of money. And it automatically compresses content and refineses long-term memory, which will make your agent smarter the more you use it. But depending on your project, you may not need an agent that's more smarter. You may need an agent that's more based. And that's where Heretic comes in. Virtually all models out there have guardrails that prevent you from doing fun things like cooking method in your shed or building high yield thermonuclear warheads. Heretic allows you to remove this draconian woke censorship using a technique called obliteration. This approach allows the tool to be completely automatic and doesn't require any expensive post training. All you have to do is take a smart yet highly censored model like Google's Gemma, run this tool from the command line, and now you have a model without the bubble wrap that will obey any command. But maybe that's not even enough to satisfy your unhinged ambitions. In that case, you may want to just build your own LLM from scratch. And believe it or not, you can actually do that with Nano Chat, which implements the entire LLM pipeline, including tokenization, pre-training, fine-tuning for chat, evaluation, and a web UI, so you can actually talk to it. What's crazy though is that you can use it to train your own small language model for about $100 in GPU time. It's not going to be Claw, GPT5, or Gemini, but at least it gives you a model that you have absolute control over. But the only thing that's a bigger waste of time than writing code by hand is going to meetings. And that's why you need to know about Recall AI, the sponsor of today's video. If you've ever tried building AI meeting tools from scratch, you know it's a nightmare trying to maintain separate integrations for Zoom, Google Meet, Microsoft Teams, and all the others. Recall solves this problem by giving you one unified API that works across every meeting platform. You can set up a meeting bot or desktop recording with a few lines of code like I'm doing here and it'll capture transcripts, recordings, and metadata in real time. The thousands of companies like HubSpot and ClickUp use it to handle all their meeting infrastructure. And most teams are able to ship recording and note-taking features to production in a few hours instead of months. Check out recall.ai/fireship to get $100 in free credits to try it out for yourself. This has been the Code Report. Thanks for watching and I will see you in the next one.

Get daily recaps from
Fireship

AI-powered summaries delivered to your inbox. Save hours every week while staying fully informed.