Cerebras' benchmark is most likely under ideal conditions, but I'm not sure it's...

qeternity · on Nov 19, 2024

I'm not talking about that. I and many others here have spun up 8x or more H100 clusters and run this exact model. Zero other traffic. You won't come anywhere close to this.

aurareturn · on Nov 19, 2024

  I'm not talking about that. I and many others here have spun up 8x or more H100 clusters and run this exact model. Zero other traffic. You won't come anywhere close to this.

8x H100 can also do fine tuning right? Does Cerebras offer fine tuning support?

danpalmer · on Nov 19, 2024

In that case I'm misunderstanding you. Are you saying that it's "BS" that they are reaching ~1k tokens/s? If so, you may be misunderstanding what a Cerebras machine is. Also 8xH100 is still ~half the price of a single Cerebras machine, and that's even accounting for H100s being massively over priced. You've got easily twice the value in a Cerebras machine, they have nearly 1m cores on a single die.

sam_dam_gai · on Nov 19, 2024

Ha ha. He probably means ”at a batch size of 1”, i.e. not even using some amortization tricks to get better numbers.

danpalmer · on Nov 19, 2024

Ah! That does make more sense!