TorchTPU: Running PyTorch Natively on TPUs at Google Scale

(developers.googleblog.com)

45 points | by mji 4 hours ago

3 comments

  • in-silico 1 hour ago
    This is great to see.

    I did trained some research models using the existing PyTorch/XLA on TPUs, and it was a mess of undocumented behavior and bugs (silently hanging after 8 hours of training!).

    If anyone is trying to use PyTorch on TPU before TorchTPU is released, you can check out the training pipeline that I ended up building to support my research: https://github.com/aklein4/easy-torch-tpu

  • Reubend 1 hour ago
    Sounds good, but my main question is: is this a fork, or a new backend they're building in (like MPS)?