efficiency per token has tanked but it's still faster.
given this is the first generation for Cerberas hardware this is the worst it's ever going to be.
when it reaches the main 5.3 codex efficiency at this token rate this kind of articles will seem silly in retrospect
when it reaches the main 5.3 codex efficiency at this token rate this kind of articles will seem silly in retrospect