AI Lab - 3600Labs

★ FEATURED · 24 MAY 20268 MIN READ

Giving an AI agent tools - without burning your hand.

Most "agent" demos collapse the moment they touch a real API. Here's the small set of patterns - sandboxes, allow-lists, approval gates, and deterministic fallback - that we use to ship production agents that don't go sideways.

AI AGENTS · TOOL-USE · PRODUCTIONRead field note →

19 MAY 20266 MIN

RAG that actually answers the question.

Why most knowledge-base bots are bad - and the boring infrastructure (chunking, re-ranking, per-tenant data) that makes them good.

RAG · Read →

11 MAY 20265 MIN

A WhatsApp AI for the corner store.

The unglamorous architecture behind an assistant that handles enquiries, bookings, and follow-ups on the world's most-used app. India edition.

VOICE AI · Read →

02 MAY 20264 MIN

Invoice processing without the OCR pain.

Multimodal models flipped the playbook in 2025. Here's a pipeline you can stand up over a weekend - including the human-review step you'll forget.

DOC INTEL · Read →

27 APR 20267 MIN

Voice AI for appointments: the latency budget.

Every millisecond between user-stop-talking and bot-start-talking matters. The architecture we use to stay under 400ms end-to-end.

VOICE AI · Read →

★ OPEN SOURCE · 21 APR 2026RELEASED

prompt-gym - a tiny eval harness we use on every project.

A 200-line library to run regression tests on prompts and agent runs. Catches the "this used to work" failures before they hit production. MIT licensed.

github.com/3600labs/prompt-gym★ 1.2k · Star on GitHub →

15 APR 20266 MIN

AI agents vs chatbots: what businesses actually need.

The taxonomy that helps us recommend the right thing to clients - and stops them from buying an "agent platform" they don't need.

AI AGENTS · Read →

08 APR 20265 MIN

Sales follow-up agents: 12 patterns that work.

From cold lead nurture to closed-won onboarding. The flows we've shipped, the ones that flopped, and what makes the difference.

AI AGENTS · Read →

01 APR 20263 MIN

Why we still write SQL by hand.

Yes, the LLM could do it. We don't let it. A small rant on engineering hygiene.

OPINION · Read →

25 MAR 20269 MIN

The internal-tools renaissance.

Why your team's most-used software in 2027 won't have a Hacker News thread. And why we love it.

OPS · Read →

18 MAR 20264 MIN

What an FDE actually does in week one.

Hour-by-hour from one of our Forward Deployed Engineers, fresh from a 3-week warehouse deployment. Not glamorous. Effective.

★ FDE · Read →

10 MAR 20265 MIN

Choosing your AI vendor in 2026.

OpenAI vs Anthropic vs Gemini vs open-source. The framework we use to pick - and when we mix.

ADVISORY · Read →

What we're
shipping & breaking.

Giving an AI agent tools - without burning your hand.

RAG that actually answers the question.

A WhatsApp AI for the corner store.

Invoice processing without the OCR pain.

Voice AI for appointments: the latency budget.

prompt-gym - a tiny eval harness we use on every project.

AI agents vs chatbots: what businesses actually need.

Sales follow-up agents: 12 patterns that work.

Why we still write SQL by hand.

The internal-tools renaissance.

What an FDE actually does in week one.

Choosing your AI vendor in 2026.

One field note in your inbox.
Every Friday.

What we'reshipping & breaking.

Giving an AI agent tools - without burning your hand.

RAG that actually answers the question.

A WhatsApp AI for the corner store.

Invoice processing without the OCR pain.

Voice AI for appointments: the latency budget.

prompt-gym - a tiny eval harness we use on every project.

AI agents vs chatbots: what businesses actually need.

Sales follow-up agents: 12 patterns that work.

Why we still write SQL by hand.

The internal-tools renaissance.

What an FDE actually does in week one.

Choosing your AI vendor in 2026.

One field note in your inbox.Every Friday.

What we're
shipping & breaking.

One field note in your inbox.
Every Friday.