All jobs

Sr Software Dev Engineer, Stores Foundational AI -SFAI

Amazon.com Services LLC2h ago
United StatesOnsite$168.1K–$227.4KFull-timeSenior Level5+ yrs exp
H-1B verified · 2310 LCAs

Top focus

Software EngineerSoftware Engineer IiSenior Software Engineer
  • We're building a foundational LLM for Amazon Stores that fuses general world knowledge with Amazon e-commerce domain knowledge to provide new and improved shopping experiences for our customers. We are searching for pioneers who are passionate about technology, innovation
  • are ready to make a lasting impact on the industry. You'll be working with talented scientists and engineers to innovate on behalf of our customers. If you're fired up about being part of a dynamic, driven team, then this is your moment to join us on this exciting journey! Key job responsibilities In this role you will leverage your engineering background and expertise to help develop generative AI for shopping. As a Senior Software Development Engineer, you will: Architect and build scalable ML infrastructure that powers the training and deployment of large language models—directly shaping the future of AI-driven shopping experiences for all Amazon customers Drive technical innovation by designing experimentation frameworks and tooling that accelerate breakthrough insights, enabling scientists and engineers to iterate faster and smarter Lead cross-functional initiatives partnering with applied scientists and engineering teams to translate frontier research into production systems that delight customers Mentor and elevate the team through technical leadership, code reviews
  • architectural guidance—raising the bar for engineering excellence across the organization Own impactful projects end-to-end across diverse technologies—from distributed computing and ML operations to prompt engineering—while navigating ambiguity and making strategic trade-offs that balance innovation with delivery A day in the life On any given day, you may work on: Design and build end-to-end RL post-training pipelines (rollout → reward → optimization) at cluster scale Improve RL training stability (PPO / GRPO / RLOO) by monitoring and tuning key metrics such as reward, KL divergence
  • policy stability Optimize RL post-training efficiency (GPU utilization, batching, sequence packing, async rollouts) Partner with research scientists to translate new RL algorithms into scalable, production-ready systems Profile and eliminate bottlenecks across compute, networking
  • storage Build observability systems for training dynamics, system health
  • experiment tracking Collaborate cross-functionally to run experiments, iterate quickly
  • unblock research progress Mentor engineers and contribute to system design and long-term technical roadmap
  • 5+ years of non-internship professional software development experience - 5+ years of programming with at least one software programming language experience - 4+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience - Experience as a mentor, tech lead or leading an engineering team - Knowledge of Machine Learning and LLM fundamentals, including transformer architecture, training/inference lifecycles
  • optimization techniques - Demonstrated ability to drive technical direction and influence engineering decisions across teams
  • Knowledge of ML frameworks including JAX, PyTorch, vLLM, SGLang, Dynamo, TorchXLA
  • TensorRT - Experience with distributed systems, big data technologies
  • machine learning infrastructure Amazon is an equal opportunity employer and does not discriminate on the basis of protected veteran status, disability
  • other legally protected status. Our inclusive culture empowers Amazonians to deliver the best results for our customers. If you have a disability and need a workplace accommodation or adjustment during the application and hiring process, including support for the interview or onboarding process, please visit https://amazon.jobs/content/en/how-we-hire/accommodations for more information. If the country/region you’re applying in isn’t listed, please contact your Recruiting Partner. The base salary range for this position is listed below. Your Amazon package will include sign-on payments and restricted stock units (RSUs). Final compensation will be determined based on factors including experience, qualifications
  • location. Amazon also offers comprehensive benefits including health insurance (medical, dental, vision, prescription, Basic Life & AD&D insurance and option for Supplemental life plans, EAP, Mental Health Support, Medical Advice Line, Flexible Spending Accounts, Adoption and Surrogacy Reimbursement coverage), 401(k) matching, paid time off
  • parental leave. Learn more about our benefits at https://amazon.jobs/en/benefits . USA, WA, Seattle - 168,100.00 - 227,400.00 USD annually

Required skills

PythonMachine LearningJAXPyTorchvLLMSGLangDynamoTorchXLATensorRTdistributed systemsbig data technologies
Posted on JobRush — the end-to-end AI job-search platform.