Corali-Lang Benchmark: Detailed Analysis

QWEN 3 NEXT 80B INSTRUCT

The QWEN 3 Next 80B Instruct results are particularly interesting, as across five runs, all responses were consistently the same. The results suggest that the model is still struggling to apply new patterns, preferring to stick with normal English. Contextual understanding of who is speaking remains weak, as evidenced by incorrectly answered true or false multiple-choice questions.

Categories: AI Benchmark

Cookies Notice

Our website use cookies. If you continue to use this site we will assume that you are happy with this.