Technical Reference — 21 Chapters

Memory Management Units
and TLBs

A rigorous, implementation-level guide to address translation — from foundational virtual memory concepts through AI/ML accelerator memory systems. Targets systems engineers, OS developers, hardware architects, and ML infrastructure teams.

21Chapters

224SVG Figures

3Architectures

~198KWords

x86-64 ARM64 RISC-V GPU / AI Accelerators

Chapters

Memory Management Basics

23 SVGs292 KB

Virtual Memory Concepts

10 SVGs230 KB

Page Table Structures and Implementation

15 SVGs288 KB

Translation Lookaside Buffer (TLB) - Deep Dive

11 SVGs332 KB

IOMMU and DMA Remapping - Deep Dive

8 SVGs350 KB

Memory Protection and Access Control

10 SVGs297 KB

Page Faults and Exception Handling

10 SVGs756 KB

Advanced MMU Topics - System Integration and Optimization

10 SVGs323 KB

Advanced Page Table Optimizations

10 SVGs350 KB

Hardware Accelerators and External MMU Access

8 SVGs351 KB

Virtual Memory Challenges in AI/ML Accelerators

13 SVGs403 KB

When MMU Architecture Breaks - AI at Scale

9 SVGs264 KB

Machine Learning for Memory Management - The False Hope

7 SVGs130 KB

Software-Managed Memory for AI Workloads

8 SVGs222 KB

Beyond Traditional MMU - Alternative Translation Architectures

9 SVGs187 KB

Advanced TLB Optimization Techniques

11 SVGs215 KB

Page Table Walker Microarchitecture

11 SVGs183 KB

MMU-Level Vulnerabilities: Spectre, Meltdown, and Paging Exploits

12 SVGs206 KB

CXL and the Disaggregated Address Space

13 SVGs198 KB

Confidential Computing and the Untrusted Hypervisor

8 SVGs125 KB

Hardware Memory Safety — CHERI, MTE, and Capability-Based Addressing

8 SVGs127 KB

Reading Paths

🖥️

Systems / OS Developers

Chapters 1–9 form a complete foundation covering paging, faults, reclaim, and optimizations. Chapter 19 extends this to CXL-attached memory.

⚙️

Hardware Architects

Chapters 4, 5, 10, 15, 16, 17, 18 cover translation hardware, IOMMUs, advanced TLB design, PTW microarchitecture, and paging-level vulnerabilities. Chapter 19 covers CXL disaggregation.

🤖

AI/ML Infrastructure

Chapters 11–14 address GPU and accelerator memory, LLM serving, and ML-based optimization. Chapter 20 covers confidential computing for AI workloads. Chapter 21 covers hardware memory safety relevant to AI system integrity.

🔒

Security Researchers

Chapter 6 covers the full protection model; Chapters 5 and 12 cover device and multi-tenant isolation; Chapter 18 covers MMU-level vulnerabilities including Meltdown, Spectre, L1TF/Foreshadow, and MDS in depth; Chapter 20 covers confidential computing, TDX, SEV-SNP, and ARM CCA; Chapter 21 covers hardware memory safety — CHERI, ARM MTE, and capability-based addressing.

Architecture Coverage

Processor Family	Key Structures Covered
x86-64 (Intel / AMD)	CR3, PML4/PDPT/PD/PT, PCID, INVPCID, INVLPG, KPTI, SGX, VT-d, EPT, AMD NPT
ARM64 (ARMv8 / v9)	TTBR0/TTBR1_EL1, ASID, TLBI, TrustZone, Stage-2 (IPA→PA), SMMUv3
RISC-V	satp, Sv39/Sv48/Sv57, ASID, SFENCE.VMA, VMID in hgatp, G-stage translation
GPU / AI Accelerators	NVIDIA UVM, NVLink/NVSwitch peer-to-peer, TPU HBM, Intel Gaudi2, PagedAttention

Format

Each chapter is a self-contained HTML file with:

All SVG diagrams embedded inline
Print-optimised CSS with page-break controls
Linked Table of Contents
Pandoc-standard typography
Sidebar navigation (this sidebar)

Open any chapter directly in a browser, print to PDF, or host as GitHub Pages.

Citations

Content cites peer-reviewed literature, processor architecture manuals, and production system papers:

Intel SDM, AMD64 APM, ARM ARM, RISC-V Privileged Spec
ISCA, MICRO, ASPLOS, USENIX papers
IEEE-style references in every chapter
Minimum 8 references per chapter; AI chapters ≥ 12

Speculative claims about proprietary implementations are avoided.

Memory Management Unitsand TLBs