I think this is likely:

Chinese AI developers are increasingly pivoting from general-purpose chatbots towards embedding voice AI assistants into daily applications in search of broader commercial uses for generative AI technologies. The growing industry focus on speech models reflects expectations that voice interfaces could become a key gateway for deploying AI across industries. As one of the most intuitive forms of human-computer interaction, voice requires little user training.