Welcome, AI Engineers!

👋🏽 I'm Anup. I'm an AI and Software Engineer building production Agentic AI and Generative AI systems. I work on RAG pipelines, multi-agent architectures, and multi-cloud deployments.

My newsletter, The AI Engineering Brief, covers practical AI engineering for people shipping real systems. X/Twitter/Bluesky: @anup.

Claude Code

Who Still Understands the Code?

AI coding agents make you dramatically faster. The cost they carry is quieter: a slow erosion of how well you understand the software you are shipping. Here is how I have come to think about that trade, and how I try to stay on the right side of it.

24 Jun

AI Engineering

Designing teams for an agentic world

AI coding agents are changing the economics of software development and the shape of engineering organisations. Here is how leaders should rethink build-versus-buy decisions, talent, team structure, platform strategy, and AI governance.

21 Jun

Agent Memory

The Frontier of Agent Memory: From Recall to Experience

Part 3 of a 3 part series post about AI Agent memory architecture.

03 Jun

How Modern Agent Memory Architectures Work

Part 2 of a 3 part series post about AI Agent memory architecture.

31 May

Agent Memory

Why Context Is Not Enough

Part 1 of a 3 part series post about AI Agent memory architecture.

29 May

Agentic Architecture

Welcome to Middle Loop Engineering

Where engineering rigour goes now that AI writes the code

11 May

GPUs

How fast does it serve? Throughput, latency, and picking the right GPU

Part 2 of 2 on inference engineering for AI engineers.

07 May

Zed

My new favourite code editor.

Claude Code

My pair programmer in the terminal.

Obsidian

Where my second brain lives. Daily notes go in, post outlines come out.

Wispr Flow

How I dictate notes and posts when typing would slow me down.

AI for Humans

Educational, light-hearted discussions breaking down AI concepts

Google DeepMind: The Podcast

Research-focused, neuroscience-inspired models, safe and ethical AI

Latent Space

Technical updates, real-world engineering, foundational models

NVIDIA AI Podcast

In-depth interviews on AI transforming industries

VRAM Calculator: 2026 LLM Tool

LLM VRAM Calculator

A free VRAM calculator for self-hosted LLMs that picks the smallest fitting GPU instance (H100, B200, A100 ) across FP16, FP8, FP4.

Eliza Redux: A Voice AI crisis support Agent

Voice AI crisis support chatbot that responds in under a second. Built and deployed in 90 minutes for a hackathon win. Handles natural conversation, breathing exercises, safety checks, and can locate nearby emergency services.

ViT: 2024 - Paper Replication

Vision Transformer (ViT) Paper PyTorch Implementation

PyTorch replication of the Vision Transformer paper. Applied transformer architecture to image classification to deepen my understanding of attention over patches.

EfficientNetB2: 2024 - Paper Replication

EfficientNetB2 Computer Vision Model

An EfficientNetB2 feature extractor computer vision model to classify images of food as pizza, steak or sushi. The model performs at 95%+ accuracy, and can classify an image at 0.03 seconds inference time per image.

Welcome, AI Engineers!

The AI Engineering Brief Newsletter

Who Still Understands the Code?

Designing teams for an agentic world

The Frontier of Agent Memory: From Recall to Experience

How Modern Agent Memory Architectures Work

Why Context Is Not Enough

Welcome to Middle Loop Engineering

How fast does it serve? Throughput, latency, and picking the right GPU

Recommended Resources and Tools

Zed

Claude Code

Obsidian

Wispr Flow

Recommended Podcasts

AI for Humans

Google DeepMind: The Podcast

Latent Space

NVIDIA AI Podcast

Shipped

LLM VRAM Calculator

Eliza Redux: A Voice AI crisis support Agent

Side Projects

Vision Transformer (ViT) Paper PyTorch Implementation

EfficientNetB2 Computer Vision Model