5 Agent Projects to Build with Gemini 3.5 Flash
Google dropped Gemini 3.5 Flash at I/O 2026, and the numbers are eye-popping: it beats the flagship Gemini 3.5 Pro on coding and agentic benchmarks while running four times faster and at half the cost. But this isn’t just another benchmark victory lap — it’s a model purpose-built for the agent era. Developers who treat it as “cheaper auto-complete” are missing the point entirely. Google’s own Antigravity 2.0 platform demonstrates what this shift looks like at scale: agent-first development where models plan, execute, and iterate autonomously. But you don’t need Antigravity’s managed infrastructure to leverage Gemini 3.5 Flash. Here are five projects you can build this weekend that take advantage of the model’s speed, cost profile, and agentic reasoning.
Project 1: Automated Code Review Agent with Git Integration
Time to build: 3-4 hours
What you’ll learn: Agent-loop architecture, Git API integration, diff parsing
Build a bot that watches your GitHub PRs, fetches the diff, and runs Gemini 3.5 Flash against it with a structured prompt. The model’s fast inference means reviews complete in under 3 seconds even for 500-line diffs.
import google.generativeai as genai
from github import Github
genai.configure(api_key="YOUR_API_KEY")
model = genai.GenerativeModel("gemini-3.5-flash")
def review_pr_diff(diff_text: str) -> dict:
prompt = f"""Review this code diff for:
1. Security issues (SQL injection, XSS, auth bypass)
2. Performance regressions (N+1 queries, unbounded loops)
3. Breaking API changes
4. Missing error handling
Return JSON with findings per category. Be specific about file and line.
Diff:
{diff_text}"""
response = model.generate_content(prompt)
return parse_json_response(response.text)
The 4x speed advantage means you can run this synchronously in CI without slowing down builds. At $0.0375 per million input tokens (half of Pro), reviewing 100 PRs/day costs under $2.
Project 2: Competitive Research Agent with Scheduled Web Crawling
Time to build: 4-5 hours
What you’ll learn: Tool-augmented agents, multi-step reasoning, structured output parsing
Build an agent that monitors competitors, crawls their changelogs, and produces weekly intelligence briefs. Gemini 3.5 Flash handles the agentic loop — planning which pages to visit, extracting structured data, and synthesizing insights — without the latency penalty of larger models. The architecture:
- Cron-triggered agent wakes up with a list of competitor URLs
- Agent decides which pages to crawl (changelog, pricing, docs)
- Fetches content, extracts structured data with function calling
- Compares against last week’s crawl to identify changes
- Generates a markdown brief with “what changed” and “why it matters”
tools = [
{
"name": "fetch_url",
"description": "Fetch content from a URL",
"parameters": {"url": "string"}
},
{
"name": "compare_versions",
"description": "Compare two snapshots and return differences",
"parameters": {"old": "string", "new": "string"}
},
{
"name": "save_brief",
"description": "Save the generated brief to disk",
"parameters": {"content": "string", "filename": "string"}
}
]
# Gemini 3.5 Flash handles the agent loop natively
chat = model.start_chat(enable_automatic_function_calling=True)
chat.send_message("Monitor competitors X, Y, Z for new features this week")
Project 3: Personal Finance Audit Agent
Time to build: 5-6 hours
What you’ll learn: Sensitive data handling, multi-format parsing, privacy-first architecture
Build an agent that ingests your bank statements (PDF, CSV), categorizes spending patterns, identifies subscription waste, and generates a monthly savings report. Gemini 3.5 Flash’s instruction-following is precise enough for financial categorization without hallucinating amounts. Critical design choice: this agent runs locally or on a private server. Gemini 3.5 Flash’s efficiency means it can run on consumer hardware with quantization, keeping your financial data off third-party servers. The agent flow:
- Parse PDFs → extract transaction list
- Categorize each transaction (groceries, subscriptions, transport, etc.)
- Flag: duplicate subscriptions, price increases, unused services
- Calculate: “You spent $1,247 on food delivery. Cooking 3 more meals/week saves ~$280/month.”
- Generate actionable report in markdown
Project 4: Documentation Health Monitor for Your Codebase
Time to build: 2-3 hours
What you’ll learn: Embedding-based retrieval, stale-doc detection, CI integration
Connect your repo’s /docs folder to Gemini 3.5 Flash and let it detect stale documentation automatically. The agent compares doc claims against actual code, identifies outdated examples, and proposes edits.
def check_doc_health(doc_path: str, code_path: str):
doc = read_file(doc_path)
code = read_file(code_path)
prompt = f"""Compare this documentation against the actual code.
Documentation:
{doc}
Actual code:
{code}
Identify:
1. Outdated function signatures
2. Missing parameters
3. Examples that no longer work
4. Deprecated API references
For each issue, provide the corrected documentation snippet."""
return model.generate_content(prompt)
Run this in CI on every PR. Gem 3.5 Flash’s speed means doc health checks add ~4 seconds to pipeline time — negligible compared to the cost of shipping broken docs.
Project 5: Multi-Source News Aggregator with Bias Detection
Time to build: 6-8 hours
What you’ll learn: Parallel agent orchestration, source-comparison reasoning, fact-verification patterns
This project deploys multiple Gemini 3.5 Flash agents in parallel — one per news source — then has a “synthesizer” agent compare coverage and flag discrepancies. The result is a daily brief that tells you not just what happened, but what different sources disagree about. Architecture:
- **Crawler agents** (3-5): Each assigned to one source (Reuters, TechCrunch, Ars Technica, etc.), extract top 10 stories
- **Analyzer agents**: Classify each story by topic, sentiment, factual claims
- **Synthesizer agent**: Compare coverage across sources, flag “Reuters says X, TechCrunch says Y,” produce consensus brief
The key insight: Gemini 3.5 Flash’s 4x speed lets you run 5 parallel agent streams in under 8 seconds total. With Gemini 3.5 Pro, this would take 30+ seconds and cost 2-3x more. The agent era demands models that are fast enough to not bottleneck the orchestration layer.
Why Speed Matters for Agents
The shift from “chat” to “agents” changes the cost calculus entirely. In a chat interaction, 500ms vs 2,000ms latency is a UX difference. In an agent loop with 8 sequential tool calls, that’s 4 seconds vs 16 seconds — the difference between “interactive” and “go get coffee.” Gemini 3.5 Flash’s speed enables multi-step reasoning patterns that were previously only practical with local small models or expensive hosted APIs. Combined with Google Antigravity 2.0’s agent orchestration layer, the agent era is genuinely arriving — not as a demo but as a deployable pattern. As we covered in our analysis of agentic orchestration earlier this year, the infrastructure is finally catching up to the ambition. The five projects above all run on the Gemini 3.5 Flash free tier or at costs under $10/month for moderate usage. Start with Project 1 this weekend — an automated code reviewer that catches security issues before they reach production is worth more than any amount of benchmark bragging.
🔗 Related Articles
- Google Antigravity 2.0 Shifts Dev to Agent-First at I/O 2026
- Google I/O 2026 AI Roundup: Every Feature You Actually Need to Know
- OpenAI Adopts Google’s SynthID: Why Watermarking AI Images Is the Easy Part — And Why Verification Is the Real Battle
Discover more from Susiloharjo
Subscribe to get the latest posts sent to your email.