ML researcher @kalomaze says a KV cache bug caused transformer inference routing errors that exposed model outputs to other users · Digg