← Series hub

This index explains what each document covers and suggests reading paths depending on your time budget and role.

Core Documents (Series Pages)

PageWhat it coversBest for
Executive SummaryThe story, the numbers, and the 3 pillarsExecs, CTO/VP Eng, PMs
Phase 1: TimelineKey milestones and why the architecture had to changeEveryone
Phase 2: ArchitectureLDC/unitization, multi-active, DB and MQ foundationsArchitects, senior engineers
Phase 3: OperationsCapacity planning, full-link stress testing, incident commandEngineering leadership, SRE
Phase 4: Technology OverviewMiddle platform, payment flow, risk control, SOFAStackArchitects, ICs
Phase 4: Deep DiveInternals: RPC, MQ, storage engine, transactions, ML risk controlDeep technical readers
Modern Tech ComparisonMapping to Kubernetes, Kafka/Pulsar, gRPC, modern DBs, service meshTeams modernizing today
Phase 5: SynthesisPatterns, anti-patterns, KPIs, decision frameworkLeaders + architects

Reading Paths (By Time Budget)

10–15 minutes (Executive)

  1. Executive Summary

60–90 minutes (Engineering leadership)

  1. Executive Summary
  2. Phase 1: Timeline
  3. Phase 2: Architecture
  4. Phase 3: Operations
  5. Phase 5: Synthesis

6–10 hours (Full technical deep dive)

Read everything above, then:

  1. Phase 4: Technology Overview
  2. Modern Tech Comparison
  3. Phase 4: Deep Dive

Quick Reference (What to Copy)

If you are building a high-throughput system:

  • Copy the ideas: unitization/cells, multi-active, deterministic readiness, downgrade strategies, and automation as a product.
  • Don’t blindly copy the tooling: the modern equivalents are often different (Kubernetes, modern DBs, Kafka/Pulsar, gRPC, service mesh).

Notes

This series focuses on repeatable peak reliability, not just peak performance. Peak success is treated as an operational system: architecture + testing + monitoring + playbooks.