From round-robin to KV-cache-aware routing — how AI broke every assumption traditional load balancing was built on.