Google, my favorite tech agency for causes precisely as this one, has executed it as soon as once more. It’s got the worldwide neighborhood of builders supercharged with one new product. This one is named Gemma 4.
What’s the hype? Properly, a totally open-source mannequin that competes with AI fashions 20 instances its measurement. And this one isn’t simply your common AI chatbot. It has been custom-built for superior reasoning and agentic workflows. That means, AI handles your total duties, in your system, even with out the necessity for the web.
Your private LLM, if you’ll.
In fact, that was sufficient to get AI-savvy folks internationally to attempt their arms on it. And the outcomes are nothing in need of revolutionary. Right here, I share a listing of a number of the high such initiatives, that are easy but efficient use circumstances that individuals have managed to convey to life, all due to Gemma 4.
However earlier than we dive in, here’s a little concerning the new AI mannequin by Google for these unaware.
Gemma 4: An Open-source AI Revolution
As I discussed, Gemma 4 is not only one other mannequin you open for chat and shut 5 minutes later. Google calls it its most clever open mannequin household but. And all this firepower is supposed to assume by means of multi-step duties, work with instruments, generate code, and run by yourself {hardware}. That alone is sufficient to make the builders sit up.
After which comes the half that actually fuels the hype: Google says Gemma 4 delivers unusually excessive intelligence for its measurement. It is available in 4 sizes, with the bigger fashions rating among the many high open fashions on the earth whereas competing with methods far greater than them. Meaning builders are all of the sudden getting a mannequin that feels highly effective, versatile, and really usable for actual initiatives. Open, multimodal, agent-ready, and lightweight sufficient to run in locations the place frontier AI often doesn’t. That’s precisely why Gemma 4 is beginning to really feel much less like a mannequin launch and extra like a shift.
You’ll be able to study all concerning the new Gemma 4 right here.
For now, we will have a look at how builders around the globe are placing the succesful mannequin to make use of.
1. Run Claude Code with Gemma 4 for Free
This was a correct “wait, you are able to do that?” second for me.
A developer confirmed methods to use Claude Code coding workflows with Gemma 4 working regionally in your machine. Which mainly means you get Claude’s coding assistant by yourself laptop computer, with out paying per immediate and with out consistently relying on the cloud. The setup makes use of Ollama to run Gemma 4 regionally, and the tweet frames it as a beginner-friendly course of that takes roughly quarter-hour on a laptop computer.
Why is that this cool? As a result of it turns Gemma 4 from “one other AI mannequin launch” into one thing immediately sensible. As a substitute of treating AI like a chatbot tab you open and shut, you’ll be able to plug it right into a coding workflow and let it assist with writing, fixing, and understanding code proper in your system. And sure, the entire enchantment right here is precisely what received folks hyped about Gemma within the first place: no subscriptions, no API key drama, extra privateness, and way more management.
What is occurring right here?
In quite simple phrases:
Claude Code = the coding workflow/interface folks like
Gemma 4 = the mind offering the coding assist
Ollama = the engine that runs the mannequin regionally in your laptop computer
The fundamental setup appears like this
set up Ollama
obtain a Gemma 4 mannequin suited to your machine
set up Claude Code in VS Code
join Claude Code with Gemma 4 and begin coding regionally
2. Run Gemma 4 on an iPhone, Utterly Offline
Once I stated ‘your private LLM’, this was the Gemma 4 undertaking I used to be referring to.
Think about an AI mannequin in your pocket. No web, no cloud connection, and no month-to-month charge. Sharbel on X confirmed simply that – Gemma 4 working instantly on an iPhone. Meaning the AI mannequin is just not sitting on some distant server ready on your request. It’s proper there on the cellphone, dealing with duties regionally like a pocket-sized mind.
The stream is straightforward and wild on the similar time:
obtain Regionally AI
discover Gemma 4 below the ‘Handle Fashions’ possibility
obtain it and use it for on-device reasoning and duties
That opens the door to all types of non-public AI experiences. Assume non-public assistants, offline research instruments, native observe evaluation, and even agentic workflows on the go. And that’s precisely why Gemma 4 has folks so excited.
3. Run Gemma 4 on a Nintendo Change
In case your native LLM in your iPhone wasn’t sufficient, right here comes Gemma 4 working on a Nintendo Change. Sure, an precise gaming console. maddiedreese shared Gemma 4 working regionally on the machine at round 1.5 tokens per second. That pace is clearly not constructed for high-pressure workplace work, however that isn’t the purpose right here. The purpose is {that a} trendy multimodal, agent-ready mannequin can now be squeezed into locations the place AI was by no means actually anticipated to reside.
And that’s precisely why this use case hits so arduous. The workflow itself is straightforward in spirit:
take a compact Gemma 4 mannequin
optimise it sufficient to run on weaker {hardware}
load it onto the Change regionally
use the console as a tiny offline AI machine
Gemma 4 is making one factor very clear right here: highly effective AI is leaving the cloud and coming into private units in all types of weird, great methods. At this fee, builders are mainly treating each display round them like a possible dwelling for an LLM.
4. Use Gemma 4 for Offline Audio Transcription on a Telephone
That is the place issues begin getting critically enjoyable. ai_for_success confirmed Gemma 4 E2B getting used for audio transcription on a Pixel 10 Professional. In plain English, meaning your cellphone can hearken to a brief audio clip and switch it into textual content, regionally, without having a giant cloud setup that sends each request backwards and forwards. The submit notes that it helps as much as 30 seconds for now, which can sound small, however actually, even that is sufficient to present the place that is heading.
Why is that this thrilling? As a result of it takes AI out of the “chatbot field” and turns it into one thing your machine can do in the actual world. The stream is superbly easy:
file or feed in a brief audio clip
let Gemma 4 E2B course of it on-device
get the spoken phrases again as textual content
all with out relying totally on the web
Think about the probabilities it opens up: fast note-taking, voice memos, assembly snippets, lecture highlights, and even simply changing your random burst of genius into textual content earlier than it disappears. It’s not a full-blown studio transcription but. However as a glimpse of what small, native AI can already do on a cellphone, that is completely wild.
5. Flip a Mac Studio into Your Personal Zero-Token AI Workhorse
This one is pure power-user vitality. jessegenet shared Gemma 4 31B working on a Mac Studio, hooked as much as OpenClaw, and the road that actually jumps out is that this: “$0 in token bills now.” That’s the dream, isn’t it? A critical native AI setup that may chat, purpose, and run workflows by yourself machine, feeling that fixed token-ticking behind your head.
What is occurring right here is definitely quite simple:
Mac Studio = the muscle
Gemma 4 31B = the mind
OpenClaw = the workflow/operator layer
Outcome = a neighborhood AI assistant that feels way more like your personal system than a rented chatbot
Why that is such a giant deal: most individuals expertise AI by means of an internet site or app. This setup flips that fully. As a substitute of going to the AI, the AI lives with you, proper in your machine. Prepared for longer chats, {custom} workflows, non-public work, and repeated use with out per-prompt pricing stress from a hosted supplier. That’s when Gemma 4 begins wanting much less like “one other mannequin launch” and extra like the start of a correct private AI workstation.
6. Flip Gemma 4 right into a Actual-Time Imaginative and prescient Assistant in Your Browser
This one is very like a full-time AI assistant that’s approach smarter than the usual AI chatbots you employ day by day. measure_plan constructed an app that mixes Gemma 4’s imaginative and prescient capabilities with Roboflow’s RF-DETR. The result’s a browser-based setup that may have a look at what your digital camera sees and make sense of it in actual time. We are able to study from the submit that Gemma handles the precise understanding, whereas RF-DETR does the first-pass object detection. In different phrases, one mannequin spots what’s within the body, and the opposite explains what’s going on.
That combo opens up a variety of enjoyable prospects actually quick:
RF-DETR finds the objects within the scene
Gemma 4 interprets these objects and provides context
the entire thing runs reside within the browser on a neighborhood machine
The super-cool undertaking exhibits Gemma 4 doing far more than chatting or coding. It’s beginning to act like a visible mind. Level your digital camera someplace, and the system can start figuring out what’s there, following the scene, and describing it again within the language of your selection. Now think about such a system as an assistive software or a sensible digital camera app that helps information you thru a course of that’s fully new to you. The probabilities are merely wild.
7. Make Gemma 4 Deal with Actual-world Duties to Begin Your Day
Think about an AI that checks your calendar at the beginning of the day, after which sends messages that have to be despatched to your contacts, with out you even typing a phrase. OsaurusAI created precisely this in a undertaking with Gemma 4 26B. Working regionally at round 50 tokens per second, the AI was in a position to learn a calendar and textual content contacts. That may be a massive bounce from “AI can chat” to “AI can really do issues for me.”
The concept is straightforward:
Gemma 4 does the pondering
your apps like Calendar and Messages present the info
the AI acts like a correct assistant on high of them
Why this issues: as soon as a mannequin can transfer this quick regionally, it stops feeling like a demo and begins feeling like an actual private agent. The sort that may test your schedule, discover the proper particular person, and show you how to take motion immediately. All of this, with out sending each little request to the cloud.
8. Make Gemma 4 Audit an Total Code Repository on a Tiny Setup
That is the sort of demo that makes builders grin. UnslothAI confirmed Gemma 4 E4B (4-bit) finishing a full repo audit by executing Bash instructions and power calls regionally. The wild half is that it reportedly runs on simply 6GB RAM. That isn’t “AI writes one helper perform.” That’s AI stepping by means of an actual codebase, utilizing instruments, and serving to examine the entire thing, identical to a mini coding agent by yourself machine would.
The setup is superbly easy:
run a compact Gemma 4 mannequin regionally
give it entry to fundamental instruments like Bash
let it examine recordsdata, transfer by means of the repo, and purpose over the code
get a code audit without having an enormous cloud setup
This one is way more relatable because it exhibits Gemma 4 doing precise developer work, not simply code autocomplete cosplay. And the truth that it will possibly occur on such modest {hardware} is precisely what makes Gemma 4 really feel so disruptive. Highly effective AI is one factor. Highly effective AI that matches into extraordinary machines is a revolution in itself.
9. Flip Gemma 4 into an Precise On-Gadget Agent with Agent Abilities
This one is a helpful characteristic that Google itself launched together with the Gemma 4. Omar Sanseviero, who’s the Developer Expertise Lead at Google DeepMind, introduced Agent Abilities for Gemma 4 on X just lately. A lot because the title suggests, Agent Abilities work precisely like the abilities now we have seen with Claude or different AI fashions. It’s an Android app expertise launched with Gemma 4, the place you’ll be able to import totally different expertise and let Gemma 4 E2B purpose by means of and use them instantly on-device. Meaning your cellphone is not only chatting again. It’s beginning to behave extra like an actual native agent.
What makes this thrilling is how easy the concept is:
load expertise into the app
let Gemma 4 perceive the duty
have it use these expertise step-by-step
all regionally on the machine
Agent Abilities takes Gemma 4 past chatbot territory and into one thing way more helpful: AI that may really do issues in your cellphone, not simply discuss them. And since it runs on-device, it additionally pushes the entire “private AI” thought a lot nearer to actuality.
10. Make Gemma 4 Flip Photographs into Songs
I’ve stored probably the most enjoyable for the final. As soon as you’re executed utilizing the brand new Gemma mannequin for all of your work, it’s time to have some enjoyable with it. ai_for_success, in his X submit, shares methods to just do that. He constructed an agent ability that lets Gemma 4 E2B name Lyria 3 and generate songs. Sure, precise songs. The submit says it really works for image-to-song, which implies you’ll be able to present the system a visible, let Gemma perceive it, after which have it set off music era round that vibe.
The stream is tremendous easy:
give it a picture
let Gemma 4 perceive what’s in it
use the agent ability to name Lyria 3
get a track impressed by that visible enter
Why is that this such a cool last instance? As a result of it exhibits Gemma 4 doing what all nice agentic fashions ought to do: not simply reply prompts, however assist create one thing new. One minute, it’s studying pictures. The subsequent minute, it’s making music out of them. That may be a inventive that exhibits a variety of human contact to it.
Additionally Learn:
Conclusion
These initiatives present precisely why Gemma 4 feels greater than a traditional mannequin launch.
From coding assistants and offline iPhone LLMs to video understanding, repo audits, agent expertise, and even image-to-song era, builders are already stretching it in all instructions. Sensible, or for pure enjoyable, Google’s new launch has change into the go-to AI mannequin inside days of its launch. And all of this, for one very potent purpose – it runs regionally, all at no cost.
Such widespread traction early on is often the clearest signal {that a} product has landed properly. Individuals don’t simply check it, they begin constructing with it. Extra importantly, Gemma 4 is exhibiting what the subsequent section of AI may appear to be: extra private, extra native, extra controllable, and much much less depending on big cloud setups for any of your initiatives.
In fact, these are the early experiments. The true wave of Gemma 4 initiatives could solely simply be getting began. So be sure to keep tuned to this area for extra such updates on the brand new Gemma mannequin.
Login to proceed studying and luxuriate in expert-curated content material.
Preserve Studying for Free


