This looks like a very similar project to "Diffusion Models Are Real-Time Game Engines"[1] that circulated on HN a few months ago [2], which was playing DOOM. There's some pretty interesting commentary on that post that might also apply to this.
I'd like to do a deeper dive into the two approaches, but on a surface level one interesting note is Oasis specifically mentions using a use-specific ASIC (presumably for inference?):
> When Etched's transformer ASIC, Sohu, is released, we can run models like Oasis in 4K.
I'd like to do a deeper dive into the two approaches, but on a surface level one interesting note is Oasis specifically mentions using a use-specific ASIC (presumably for inference?):
> When Etched's transformer ASIC, Sohu, is released, we can run models like Oasis in 4K.
[1] https://gamengen.github.io/
[2] https://news.ycombinator.com/item?id=41375548