-
Russian Academy of Sciences
- Moscow, Russia
Pinned Loading
-
mtbbench
mtbbench PublicForked from bunnelab/mtbbench
MTBBench is a benchmark designed to evaluate the reasoning capabilities of multimodal large language models (LLMs) in complex clinical decision-making scenarios. It focuses on two core challenges i…
Python
-
-
-
-
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.

