AI Frameworks Engineer

Intel•22h ago

United StatesHybridFull-timeMid Level3+ yrs exp

H-1B verified · 239 LCAs

Job Description

Join Intel's AI Frameworks PyTorch team to shape the future of Artificial Intelligence (AI) and High-Performance Computing (HPC). As an AI Frameworks Engineer, you will design, build, and optimize AI software frameworks that enable breakthrough scientific discoveries and machine learning innovations at scale.

You will contribute to the success of the new Argonne AI Center of Excellence, a partnership between Intel and Argonne National Labs. This role shall focus on identifying performance bottlenecks and functional limitations in PyTorch, vLLM, and related AI framework software when running key AI workloads and models at large scale.

After identifying issues that impact functionality, scalability, and performance, you will help design and implement optimizations that maximize the utility of Intel CPUs, GPUs, and AI accelerators. You will work as part of a larger team developing and optimizing next-generation AI framework software, enabling efficient training, inference, and deployment of modern AI models.

Your innovative work will improve performance across Intel architectures and empower HPC and AI applications to achieve higher throughput, lower latency, and stronger developer productivity

Key Responsibilities

Identify performance bottlenecks and additional features necessary to run Argonne AI CoE workloads.
Optimize PyTorch, vLLM, and related AI framework software for Intel CPUs, GPUs, and AI accelerators.
Enable and optimize key AI models, including large language models, generative AI models, and scientific AI workloads.
Profile AI training and inference workloads to identify issues across framework, runtime, kernel, and hardware layers.
Collaborate with cross-functional teams to define technical specifications and software requirements.
Troubleshoot and resolve complex issues across multiple hardware and software stack layers.
Contribute to software innovations that enhance AI framework performance, scalability, and usability.
Partner with engineering and architecture teams to maximize AI model performance on Intel architectures

Qualifications

You must possess the below minimum qualifications to be initially considered for this position.
Preferred qualifications are in addition to the requirements and are considered a plus factor in identifying top candidates.
Minimum Qualifications Bachelor's degree in Computer Science, Computer Engineering, Electrical Engineering, Mathematics, or STEM-related field with 3+ yrs. of experience in software development.
OR Master's degree in Computer Science, Computer Engineering, Electrical Engineering, Mathematics, or STEM-related field with 1+ yrs. of experience in software development.
OR Ph.D. in Computer Science, Computer Engineering, Electrical Engineering, Mathematics
STEM-related field with 3+ months of experience in software development. 3+ years of experience in at least one of the following: AI framework development or optimization.
PyTorch, vLLM, TensorFlow, Hugging Face, DeepSpeed, or related AI software frameworks.
AI model development, enablement, profiling, or performance optimization.
GPU or accelerator software development.
Runtime, compiler, kernel, or backend optimization for AI workloads.
Preferred Qualifications Advanced degree, Master's or PhD, in Computer Science, Computer Engineering, Electrical Engineering, Mathematics, or STEM-related field.
Proficiency in Python, SYCL/CUDA, and C++ programming.
Experience developing in Linux environments.
Background in AI framework internals, model execution, or backend integration.
Experience with PyTorch, vLLM, Hugging Face Transformers, DeepSpeed, or similar AI frameworks.
Experience optimizing AI training or inference workloads for CPUs, GPUs, or accelerators.
Background in performance profiling, memory optimization, throughput improvement, or latency reduction.
Experience enabling or optimizing large language models, generative AI models, or scientific AI models.
Understanding of deep learning algorithms, model architectures, and AI workload patterns.
Strong analytical skills and ability to solve complex software challenges.
Passion for driving meaningful advancements in AI software and scientific computing.
Take the next step in your career and join Intel's team to make a lasting impact in shaping the future of computing.
Apply today to be part of groundbreaking innovations in HPC and AI Job Type: Experienced Hire Shift: Shift 1 (United States of America) Primary Location: US, California, Santa Clara Additional Locations: US, Oregon, Hillsboro, US, Texas, Austin Business group: At the Data Center Group (DCG), we're committed to delivering exceptional products and delighting our customers.
We offer both broad-market Xeon-based solutions and custom x86-based products, ensuring tailored innovation for diverse needs across general-purpose compute, web services, HPC, and AI-accelerated systems.
Our charter encompasses defining business strategy and roadmaps, product management, developing ecosystems and business opportunities, delivering strong financial performance, and reinvigorating x86 leadership.
Join us as we transform the data center segment through workload driven leadership products and close collaboration with our partners.
Posting Statement: All qualified applicants will receive consideration for employment without regard to race, color, religion, religious creed, sex, national origin, ancestry, age, physical or mental disability, medical condition, genetic information, military and veteran status, marital status, pregnancy, gender, gender expression, gender identity, sexual orientation
any other characteristic protected by local law, regulation
Position of Trust N/A Benefits We offer a total compensation package that ranks among the best in the industry.
It consists of competitive pay, stock bonuses, and benefit programs which include health, retirement, and vacation.
Find out more about the benefits of working at Intel .
Annual Salary Range for jobs which could be performed in the US: $149,750.00-275,580.00 USD The range displayed on this job posting reflects the minimum and maximum target compensation for the position across all US locations.
Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.
Your recruiter can share more about the specific compensation range for your preferred location during the hiring process.
Work Model for this Role This role will be eligible for our hybrid work model which allows employees to split their time between working on-site at their assigned Intel site and off-site. * Job posting details (such as work model, location or time type) are subject to change. * ADDITIONAL INFORMATION: Intel is committed to Responsible Business Alliance (RBA) compliance and ethical hiring practices.
We do not charge any fees during our hiring process.
Candidates should never be required to pay recruitment fees, medical examination fees, or any other charges as a condition of employment.
If you are asked to pay any fees during our hiring process, please report this immediately to your recruiter.

Required skills

PythonSYCLCUDAC++LinuxPyTorchvLLMTensorFlowHugging FaceDeepSpeed