All jobs

Applied Scientist - LLM, Alexa Conversational Modelling Intelligence

Amazon Development Center Germany GmbH3h ago
DE, BE, BerlinOnsiteFull-timeMid Level2+ yrs exp

Top focus

Applied Scientist
  • As an Applied Scientist II in the Alexa Conversational Modelling Intelligence team within Alexa AI, you will drive model post-training for Large Language Models that power Alexa+. You'll adopt and adapt state-of-the-art techniques — including supervised fine-tuning, RLHF
  • preference optimization — running rigorous experiments and translating findings into production-ready solutions that directly improve the customer experience for millions of users worldwide. You will own the full model development cycle from data curation through training, evaluation
  • deployment. Your day-to-day will involve developing evaluation methods and metrics, diagnosing model defects
  • iterating on recipes to move concrete quality and efficiency benchmarks. You'll write clean, reproducible code, contribute to shared tooling
  • collaborate closely with scientists and engineers to bring models from experimentation to scale. You are technically curious, experiment-driven
  • motivated by real customer impact. You will also advance the state of the art by publishing at top-tier NLP/ML conferences (ACL, EMNLP, NeurIPS, ICML, ICLR) — contributing to the broader research community while grounding your work in measurable outcomes. Key job responsibilities As an Applied Scientist II in the Alexa Conversational Modelling Intelligence team, you will own the end-to-end model development lifecycle for LLMs that power Alexa+. You'll design and execute training recipes — including supervised fine-tuning, reinforcement learning from human feedback
  • preference optimization — iterating rapidly on data, hyperparameters
  • architectures to move quality and efficiency metrics. Your work will directly shape how millions of customers interact with Alexa daily. You will build robust evaluation frameworks to measure model performance, diagnose failure modes
  • quantify improvements. This includes developing benchmarks, implementing LLM-as-a-judge pipelines
  • conducting rigorous defect analysis to identify where models fall short and why. You'll translate these insights into targeted improvements that close gaps in conversational quality, safety
  • fluency. You will collaborate closely with research scientists and engineers to bring models from experimentation to production at scale. You'll contribute to shared tooling and infrastructure, write clean and reproducible code
  • document your methods so the team can build on your work. You are also expected to advance the state of the art by publishing findings at top-tier NLP/ML venues (ACL, EMNLP, NeurIPS, ICML, ICLR), ensuring your research drives both customer impact and scientific contribution. A day in the life As an Applied Scientist II, your day will involve launching and monitoring training runs, analyzing experiment results
  • iterating on model recipes based on evaluation data. You'll participate in science reviews with fellow researchers, sync with engineering partners on deployment readiness
  • deep-dive into model outputs to understand behavioral patterns. You'll balance hands-on experimentation with collaborative problem-solving — working across the Alexa AI organization to align model improvements with customer-facing goals and product priorities. About the team The Alexa Conversational Modelling Intelligence team builds industry-leading LLM-based conversational technologies that customers love. Our mission is to push the envelope in LLMs for Alexa to deliver the best-possible customer experience. As an Applied Scientist, you'll contribute directly to that mission through model development and experimentation.
  • PhD in computer science, machine learning, engineering
  • related fields - Knowledge of at least one programming language such as Java, C#, JavaScript, Python, Ruby or Perl - Experience in designing experiments and statistical analysis of results - Hands-on experience building, training
  • Have publications on top-tier conferences, such as CVPR, ICCV, ECCV or NeurIPS - Experience working with large, complex data sets - Experience working effectively with science, data processing
  • software engineering teams - Experience in written and verbal communication skills to communicate with technical and non-technical audiences, including senior leadership - Experience building and deploying LLM solutions in production or at scale. - Hands-on experience with Large Language Models training and fine-tuning via pre-training, SFT, and/or RLHF/preference optimization. - Experience with LLM evaluation — building benchmarks, LLM-as-a-judge
  • defect/quality analysis. - Familiarity with modern training/inference infrastructure (e.g., distributed training, RL frameworks, model serving). Amazon is an equal opportunities employer. We believe passionately that employing a diverse workforce is central to our success. We make recruiting decisions based on your experience and skills. We value your passion to discover, invent, simplify and build. Protecting your privacy and the security of your data is a longstanding top priority for Amazon. Please consult our Privacy Notice ( https://www.amazon.jobs/en/privacy_page ) to know more about how we collect, use and transfer the personal data of our candidates. m/w/d Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner.

Required skills

PythonJavaC#JavaScriptRubyPerlmachine learningstatistical analysisLLMdata processingsoftware engineeringdistributed trainingreinforcement learningmodel serving
Posted on JobRush — the end-to-end AI job-search platform.