Chinese large language model startup StepFun's speech model Step-Audio R1.1 (Realtime) ranked first globally in the Speech Reasoning category with an accuracy rate of 96.4 percent, according to data ...