Corali-Lang Benchmark: Detailed Analysis

In this post, I’ll discuss the Corali-Lang Benchmark results in more depth. This benchmark was created for a Kaggle competition: Measuring Progress Toward AGI – Cognitive Abilities.

The writeup can be found here. The writeup includes a link to the Corali-Lang Benchmark, along with five tasks and an Excel file containing the benchmark results. The data in the Excel file is taken from the task output data in JSON format. To make it easier to read and analyze, I’ve combined it into a single Excel file to analyze each model’s responses. By understanding the model’s responses, we can identify the model’s strengths, weaknesses and why it could fail.

I’ll discuss the model’s strengths and weaknesses in detail on their respective pages, along with screenshots of the results in the Excel file.

Hello, I’m Lusiana!

Welcome to my learning adventure!

I’m interested in learning new things and am currently interested in Artificial intelligence (AI).

The Coralab is my “imaginary” laboratory. I’ll be posting about the things I learn about Artificial intelligence (AI) here.

PS: Btw, this lab is available in dark and light mode. Enjoy!

PPS: Actually, I still can’t believe I’m back writing blog after several years. Usually when I write blog, I don’t write fiction and vice versa, but now I’m doing both, so good luck for me and my energy.

Cookies Notice

Our website use cookies. If you continue to use this site we will assume that you are happy with this.

Corali-Lang Benchmark: Detailed Analysis

TABLE OF CONTENT

ABOUT

DISCLAIMER

Categories

GET IN TOUCH

Cookies Notice

TABLE OF CONTENT

You may also like

What is Inside Runway?

Ask 13 LLMs About Reflective Paragraph: Detailed Analysis

Widgets

ABOUT

DISCLAIMER

Categories

Tags

GET IN TOUCH

Cookies Notice