Corali-Lang Benchmark: Detailed Analysis

GLM-5


From the results of the answers, it can be seen that GLM-5 has capabilities equivalent to Gemma 4 31B. Errors also occur in understanding the context. In story question number 3, the answer is “Scared,” because the speaker is Coral, not Corali, so it uses normal English. The model makes one error by answering “Scarad,” which is the wrong context, but the pattern for Corali language using the addition of -ad is correct.

The strength of this model
– can understand long contexts, unfazed by misleading Coral’s fashion descriptions,
– can learn a new language: Corali language without being fixated on English,
– can absorb data from narratives, not just explicit rules
– can read Corali language patterns well and apply them to other English words

The weakness of this model
– make mistakes when understanding the context

Categories: AI Benchmark

Cookies Notice

Our website use cookies. If you continue to use this site we will assume that you are happy with this.