Does the project's evaluation dataset only support English, or does it support both English and Chinese?
Does the project's evaluation dataset only support English, or does it support both English and Chinese?