AI Benchmark Engineer | Native Language Specialist - Spanish

LILT

MadridTrabajos en informática25 February 2026

Tam açıklama

About The Opportunity We are building a rigorous, verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects, non-English data processing, and complex locale/encoding edge cases in terminal workflows. We are seeking experienced native-speaking software engineers to design, build, and validate these benchmarks. You will create high-signa…

View full job description on Adzuna →

İlanlara dön İlanlara göz at