Pinned Loading
Repositories
Showing 10 of 12 repositories
- alba-benchmark Public
Code for the paper: ALBA: A European Portuguese Benchmark for Evaluating Language and Linguistic Dimensions in Generative LLMs
AMALIA-LLM/alba-benchmark’s past year of commit activity - p3b3-benchmark Public
Code for the paper: P3B3: A Multi-Turn Conversational Benchmark for Measuring European and Brazilian Portuguese Variety Bias in LLMs
AMALIA-LLM/p3b3-benchmark’s past year of commit activity - alba_benchmark_viewer Public
AMALIA-LLM/alba_benchmark_viewer’s past year of commit activity - datatrove-amalia Public Forked from huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
AMALIA-LLM/datatrove-amalia’s past year of commit activity - inspect_evals Public archive Forked from UKGovernmentBEIS/inspect_evals
Collection of evals for Inspect AI
AMALIA-LLM/inspect_evals’s past year of commit activity - lm-evaluation-harness Public Forked from EleutherAI/lm-evaluation-harness
A fork of lm-eval-harness.
AMALIA-LLM/lm-evaluation-harness’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…