Because my understandings is that, however you get to 100K, the 100,001st token is generated the same way as far as the model is concerned.
Because my understandings is that, however you get to 100K, the 100,001st token is generated the same way as far as the model is concerned.