Firecrawl is DEAD - Gemini 3.1 Flash Lite is the New Web Scraping King

Income stream surfers| 00:10:54|May 8, 2026
Chapters20
Discusses lightweight agentic workflows for simple data extraction, emphasizing responsiveness and API cost constraints.

Gemini 3.1 Flashlight finally shines for lightweight web scraping, delivering near-perfect URL accuracy at a fraction of the cost with Gina-based workflows.

Summary

Income stream surfers' video with the host dives into Gemini 3.1 Flashlight and its surprising edge in web scraping. After an initial disappointment with the model, the creator demonstrates a fast, cost-effective approach that pairs Gina (which turns websites into HTML/markdown) with Gemini 3.1 Flashlight to extract structured data. The key payoff is astonishing accuracy: 96/96 URLs returned as HTTP 200, and 36/36 image links are real, all while keeping costs unusually low. The clip also hints at the model’s new audio input capabilities and potential video support, expanding use cases beyond text. Grounding—using Google search to inform results—remains a major cost factor, so the presenter cautions limiting queries to avoid skyrocketing expenses. Finally, Harbor SEO.AI is pitched as the vehicle for deploying this scraping approach at scale, with the creator sharing real-world metrics like 363 pages published and 160,000 impressions to back up the tool’s value. This is a compelling case for rethinking low-cost, high-speed scraping pipelines in 2024+.

Key Takeaways

  • Gemini 3.1 Flashlight achieved 96/96 real URLs (HTTP 200) in a Gina-based scraping workflow.
  • All 36 image links tested were real, demonstrating high accuracy for visual data extraction.
  • Grounding costs can explode (from $10 to $150 per day) if not carefully managed, so limit queries to 1–2 per run.
  • The combined Gina + Gemini 3.1 Flashlight workflow offers fast, inexpensive LLM scraping suitable for repackaging into articles or structured data.
  • Harbor SEO.AI is positioned as a cost-effective platform to operationalize LLM scraping at scale, with real-world metrics like 363 pages published and 160,000 impressions.

Who Is This For?

Essential viewing for developers and SEO teams exploring cheap, fast web scraping pipelines using Gina and Gemini 3.1 Flashlight, and for Harbor users evaluating scalable LLM scraping in production.

Notable Quotes

"96 out of 96 uh URLs that were returned were 200."
Shows perfect hit rate in the tested URLs, underscoring accuracy.
"36 out of 36 URLs returned HTTP 200."
Reinforces the reliability of the scraping approach.
"the cost is 153 uh but not only that, the main reason I actually use this model is because it's super super fast."
Emphasizes efficiency and cost considerations driving the choice.
"this model 3.1 Flash Light is even faster."
Highlights performance advantage over prior iterations.
"it's input output, as you can see here, is 0.25 25 and 150 output"
Mentions pricing structure and batch considerations.

Questions This Video Answers

  • how does Gemini 3.1 Flashlight compare to other scraping models in 2024
  • can Gina turn any site into markdown for LLM scraping
  • what are the cost pitfalls of grounding in large-scale scraping
  • how to implement Gina + Gemini 3.1 Flashlight for cheap web scraping
  • what is Harbor SEO.AI and how does it integrate with LLM scraping pipelines
Gemini 3.1 FlashlightGina (website-to-markdown/HTML tool)LLM scrapingGrounding cost managementHarbor SEO.AIWeb scraping at scaleHTTP 200 accuracyImage extractionAudio input features
Full Transcript
So look at this guys. As it says right here, designed for lightweight agentic workflows, simple data extraction and applications where responsiveness and API costs are the primary constraints. This is super interesting. I'm going to show you why in this video. So as you may or may not have seen, Google just dropped Gemini 3.1 Flashlight. And at first I was not bothered about this model. Mainly because most of the things I do is around coding, right? So, whenever a new model comes out, this is a test that I did of 3.1 flashlights and it was absolutely terrible. This is open code trying to use flashlight. The results were awful, guys. Okay, but there is actually a use case for this model that I'm going to talk about in this video. So, you guys may or may not know that I'm a big fan of Flash models for a very specific reason. This right here, responsiveness and API costs. So just as proof of that, this is um Harbor's Gemini spend, which this is not our only API cost. I have other API costs as well, but this is basically what we spend on three flash. This is currently three flash, right? The cost is 153 uh but not only that, the main reason I actually use this model is because it's super super fast. But this model 3.1 Flash Light is even faster. Okay. So, I'm going to show you how you can use 3.1 flashlight for web scraping or why you should use 3.1 light for web scraping. So, I did a very very interesting experiment which I'll show you in just a second. But basically, I I can just show you here as well. I tested two men. Which is, you know, a site that many people will be familiar with with Gina reader. Right? So, what I did was I fed two men.it to Gina. If you don't know what Gina is, it's basically just a it it turns a website into HTML, right? So if I press get response here, you can see it turns the website into in this case markdown, right? So if you want to extract links or images from a website, then you can use Gina to basically turn your or other people's websites into markdown with images inside it. so that you can extract those images using LLMs. This is known as LLM scraping. Right? So basically what I do is you get some URLs, you feed them into Gina, you get the clean markdown and then you use Gemini 3.1 Flashlight to extract JSON structured data. This is LLM scraping in a very basic format. And it's actually mainly the the the main thing that Harbor does as well as other things is it uses LLM scraping to find people's products, people's information, etc. and put them into the blog posts that we write. Okay, so I actually know quite a lot about this stuff. Now, the first test that Claude did, it did something that I It just does this every single time. I've done hundreds of these tests, right? Uh trying to test different models. it always cuts the kind of context off at 80,000. If you don't feed the full page to Gemini, it will not be able to extract things. So, at first I thought this model was really really bad because it it basically cut down the HTML at like the header. So, it was just showing the header to the model. However, when I realized this, I fixed this and I did the real test here, which unbelievably, right, and this is super super unbelievable. 96% of the URL, oh, sorry, 96 out of 96 uh URLs that were returned were 200, right? As in 200 um code. So, they they are real, right? 36 out of 36 URLs returned HTTP 200. You guys, I this is crazy. Honestly, for the pricing of this, that is absolutely absurd. You can scrape things using Gina Plus uh Gemini 3.1 light flashlight for so cheap, right? So, it's input output, as you can see here, is 0.25 25 and 150 output with also having some discounts on batch pricing and also on cash. So super super interesting the amount of information you can scrape for this price you would be very very surprised and when it has 100% accuracy which I mean through the kind of tests that I did this had 100% accuracy. Yeah. So, this model was just released on the 7th of May, the the official version of this model. It also accepts audio, which I didn't know. That's really really interesting to know. So, you can um you can use this to transcribe audio as well, which is super super nice. Probably with pretty good accuracy and very very quick as well. It looks like you can also input video. I'm not sure about this. I haven't looked into this, but if that's true, that's very very interesting. So this is probably the one of the best models for just general scraping. Now one thing just to note guys is grounding. Okay, grounding is basically using Google search as perplexity does for example and we do a lot of this inside harbor as well. It's probably our main expense to be honest with you. The problem with grounding is it's per query. So just bear this in mind. Okay. If you're using grounding, you need to tell the model to only do one query or two queries because otherwise it'll do 30 queries. And my god, the costs are insane. We went from spending like $10 a day to $150 a day, which luckily I noticed and stopped. But just if you're doing grounding, just be extremely careful, okay? It's probably not worth it. It's probably much better to do just traditional LLM scraping instead of grounding. However, we do do grounding for certain things because I think it works really well for things like keyword research and things like that. Yeah. So, this is kind of the cost at scale. Let's say you scrape a,000 pages. I don't like the word product here. It shouldn't be here. So, just 1,000 pages. 1,000 pages at 8 bucks with batch API being four bucks is really really good. Um, yeah. I mean, the the price on this is crazy. How cheap intelligence has got is is really really interesting. Now I just want to return to this quickly. 96 URLs were real here and 36 out of 36 images were also real. Right? So I just want to reiterate this point because these are not easy links. So you can see the test here. I did this with Claude code. Uh it basically just said set up um the scraping system um and then test the URLs to see if the URLs that are given back from the LLM were real. Right? This is the actual test that I gave it. This was a really really good test for me personally. For context, the likelihood that GPT 5 Nano will hallucinate one of these links is about 50%. Right, which was our old method of doing this. I actually I upgraded to Gemini 3 flash because Gemini uh GBT 5 Nano was just too inconsistent and its hallucination rate was way too high. Even with Gemini Gemini 3 flash, the hallucination rate was about 75%. So to have a 100% real URLs here is extremely interesting at a very very good price and with a million context. This is probably the best new scraping model on the market. So I'm probably going to go and put this in Harbor now. Although to be honest with you, I don't actually use LLM scraping like this anymore. The way that I do LLM scraping is the basically so if we just go back to this the page URL is fed to Gina right and then the instead of extracting structured data for images what it does is it like adds like a number next to the link and then I just use code to like extract the link just because I couldn't get what I was trying to get which was 100% accuracy with um what any of the models that I used basically. So I mean to get 100% accuracy with this model which is extremely cheap is really really interesting. But yeah guys this is definitely a massive upgrade for LLM scraping. Uh you should go and check this out. It's extremely cheap. It's extremely fast and LLM scraping is something that you can basically resell. So what you actually resell is the JSON structured data or in Harbor's case, what we then do is we take that structured data and we write an article, right? So we are selling the articles, but it's the we may as well sell the structured data, right? We're just repackaging the structured data into an article. I really cannot write with a touchpad. I don't know why I even tried. But yeah, with that with all that being said, guys, let me just quickly talk about our sponsor, which is me, Harbor SEO.AI. I'm trying to make this into the best SEO tool on the market. It's extremely cost- effective. It only costs 29 uh euros a month, right? And yeah, I'm really really trying to make this an amazing tool. 363 pages published, 160,000 impressions, 1,500 clicks. These are real numbers from real search consoles anonymously gathered to just display on the pricing page basically. I've recently released agency. I've got the writer into probably the best spot it's ever been in. I mean, we even made eyesuit happy, which I thought was impossible. Um, we managed to actually make consistent content that when they looked at it, they just said this is I mean, they were they were saying they were basically saying they are the most beautiful articles they've ever seen. So, um, I'm really really happy with Harbor and where it is. So, go and check it out in the description. There's a link in the pin comment and in the description. You can get a free trial. Go and check it out today. But all that being said, guys, I'm going to go and put this inside Harbor and see what the effect is on cost mainly. Um, but like I was saying before, most of the cost actually comes from the grounding and not from the LLM scraping itself. So, I don't think this is going to cost me or save me that much money, but it might save other people money. And also just generally this is probably the best LLM scraping model on the market now. So shout out to Google for actually giving me something useful and also for you guys this is also a useful model. With that being said guys thank you so much for watching. If you are watching all the way to the end of the video you're an absolute legend and I'll see you very very soon with some more content. Peace out.

Get daily recaps from
Income stream surfers

AI-powered summaries delivered to your inbox. Save hours every week while staying fully informed.