AI Benchmark Engineer | Native Language Specialist - Spanish
LILT
MadridTrabajos en informática25 February 2026
Tam açıklama
About The Opportunity We are building a rigorous, verifiable evaluation suite of Terminal-Bench tasks designed to test the limits of large language models on multilingual software challenges. Our goal is to measure multilingual robustness across prompt language effects, non-English data processing, and complex locale/encoding edge cases in terminal workflows. We are seeking experienced native-speaking software engineers to design, build, and validate these benchmarks. You will create high-signa…
View full job description on Adzuna →