All jobs

Senior Software Engineer- Evaluation (f/m/d)

Alephalpha3h ago
HeidelbergHybridFull-timeSenior Level5+ yrs exp

Top focus

Software EngineerSenior Software EngineerSoftware Engineer Ii

Aleph Alpha Research’s mission is to deliver category-defining AI innovation that enables open, accessible, and trustworthy deployment of GenAI in industrial applications. Our organization develops foundational models and next-generation methods that make it easy and affordable for Aleph Alpha’s customers to increase productivity in development, engineering, logistics, and manufacturing processes.

Team Culture At Aleph Alpha, we foster a culture built on ownership, au tonomy, and empowerment. Teams and individual contributors are trusted to take responsibility for their work and drive meaningful impact. We maintain a flat organizational structure with efficient, supportive management that enables quick decision‑making, open communication, and a strong sense of shared purpose.

About the Role As a Senior Software Engineer for LLM Model Evaluation , you will work in the pre-training evaluations team. Our mission is to give meaningful signals during pre-training runs and provide additional metrics to other teams to make informed decisions (ablations).

We are a mix of researchers and engineers, and you will support our engineering efforts. Major points include improving the testability of our code through design and architecture changes, and lowering the time it takes for an end-to-end integration of a new benchmark.

No two days are the same. Things move fast, and your ability to focus and prioritize is what lets you unblock the team day-to-day while designing the tooling and automation that speeds us up long-term. Your Responsibilities/Profile You drive these changes through incremental, hands-on modifications of our code.

Simultaneously, you are expected to work on smaller day-to-day tasks, e.g., maintain our repositories, investigate a spurious benchmark result, or iron out an out-of-memory error. You will have real influence on what gets built and how. Your work directly shapes how quickly we can experiment and improve our models.

Capable, driven and open individual that thrives in a dynamic environment: LLMs are rapidly evolving, and we maintain flat hierarchies and the possibility to make an impact across a wide range of areas. Hence - above all - we are looking for highly talented individuals that thrive in such an environment.

You should add something unique that helps our efforts, but nobody needs to tick a long list of boxes. Core Qualifications Software engineer with ability to write code that other strong engineers want to build on. Ability to incrementally convert a code-base with accumulated complexity into a more testable and explainable state.

Explainer: A lot of decisions we make together. Communicating and convincing the team of your ideas is pivotal skill. Taking initiatives to drive and deliver high-impact work Degree in computer science, engineering, or a related field. Strong Python skills.

Deep interest in and willingness to learn about LLM training. Preferred Qualifications (We encourage you to apply even if you don't check every box!) Experience working with distributed systems. Experience with infrastructure tooling and container orchestration such as docker, Kubernetes, infrastructure as code etc.

Experience with LLM evaluation, benchmark design or evaluation dataset curation. Understanding of foundation model training: how data, scale, and architecture affect capabilities. Familiarity with statistical methods. What we offer Become part of an AI revolution! 30 days of paid vacation Access to a variety of fitness & wellness offerings via Wellhub Mental health support through nilo.health Substantially subsidized company pension plan for your future security Subsidized Germany-wide transportation ticket Budget for additional technical equipment Flexible working hours for better work-life balance and hybrid working model Virtual Stock Option Plan JobRad® Bike Lease

Required skills

PythonDockerKubernetes
Posted on JobRush — the end-to-end AI job-search platform.