All jobs

Director, Site Reliability Engineer

Mastercard18h ago
United StatesOnsiteFull-timeDirector Level10+ yrs exp
H-1B sponsor

Top focus

Sre
  • Our Purpose Mastercard powers economies and empowers people in 200+ countries and territories worldwide. Together with our customers, we’re helping build a sustainable economy where everyone can prosper. We support a wide range of digital payments choices, making transactions secure, simple, smart and accessible. Our technology and innovation, partnerships and networks combine to deliver a unique set of products and services that help people, businesses and governments realize their greatest potential. Title and Summary Director, Site Reliability Engineer Who is Mastercard? At Mastercard technology, we work to connect and power an inclusive, digital economy that benefits everyone, everywhere, by making transactions safe, simple, smart
  • accessible. Using secure data and networks, partnerships
  • passion, our innovations and solutions help individuals, financial institutions, governments
  • businesses realize their greatest potential. Our decency quotient
  • DQ, drives our culture and everything we do inside and outside of our company. We cultivate a culture of inclusion for all employees that respects their individual strengths, views
  • experiences. We believe that our differences enable us to be a better team – one that makes better decisions, drives innovation
  • delivers better business results. Technology at Mastercard What we create today will define tomorrow. Revolutionary technologies that reshape the digital economy to be more connected and inclusive than ever before. Safer, faster, more sustainable. And we need the best people to do it. Technologists who are energized by the challenges of a truly global network. With the talent and vision to create the critical systems and products that power global commerce and connect people everywhere to the vital goods and services they need every day. Working at Mastercard means being part of a unique culture. Inclusive and diverse, a rich collaboration of ideas and perspectives. A place that celebrates your strengths, values your experiences
  • offers you the flexibility to shape a career across disciplines and continents. And the opportunity to work alongside experts and leaders at every level of the business, improving what exists
  • inventing what’s next. About the Role The Business Operations team is seeking a highly motivated and experienced Director, Site Reliability Engineering (SRE) to join our team. You will play a critical role in ensuring the reliability, scalability
  • performance of our applications, supporting essential services that power Mastercard's global operations. As a thought leader in your field, you will bring technical expertise, a passion for automation
  • the ability to mentor. The role of the Business Operations Site Reliability Engineer is to be the production readiness steward for Mastercard products. As Business Operations SRE, we are responsible for ensuring that our platform is stable and healthy. We break down barriers to running our products by fostering developer run ownership and empowering developers to build resilient products. We support our developers during the application build phase in software run principles that include operational design, automation, capacity planning
  • monitoring that leads to fault-tolerant, scalable products. We see the big picture and help create and enforce operations standards while facilitating an agile and learning culture. We support daily operations with a hyper focus on triage, root cause by understanding the business impact of our products and subsequently performing blameless post-mortems. The goal of every Business Operations team is to engage early in the development lifecycle to be more proactive and upfront in the development process
  • to proactively manage production and change activities to maximize customer experience and increase the overall value of supported applications. Business Operations teams also focus on risk management by tying all our activities together with an overarching responsibility for compliance and risk mitigation across all our environments. Ultimately, the role of Business Operations is to align Product and Customer Focused priorities with Operational needs by providing continuous feedback throughout the lifecycle. As part of the Business Operations team, you will:
  • Develop and execute short-term and medium-term strategic plans for Site Reliability Engineering across Mastercard, aligning functional initiatives with organizational objectives by modifying existing methods to make significant improvements to existing processes, programs, and/or policies.
  • Lead the design and implementation of cross-functional initiatives to automate operations, improve incident management, and enhance system resilience.
  • Establish and maintain governance processes to ensure adherence to reliability standards and best practices.
  • Direct collaboration across engineering, product, and business teams to embed reliability considerations into development and deployment processes.
  • Oversee incident response efforts, ensuring timely resolution and comprehensive root cause analysis.
  • Cultivate a culture of continuous improvement by promoting best practices, innovation, and proactive risk management.
  • Learn about industry trends and emerging technologies related to system automation and resilience.
  • Develop and drive the organization’s strategic vision for system reliability, scalability, and operational efficiency.
  • Manage a team of people leader(s) and/or senior individual contributor(s), conducting goal setting and performance appraisal processes, mentoring and coaching top talent within own team
  • fostering a culture of continuous improvement and operational excellence. Role qualifications: The ideal candidate will apply leadership skills independently and consistently in complex or nuanced situations to support broader goals. Recognized as a key contributor and may coach or support others informally. As a leader, you will:
  • Build diverse, high performing teams with a customer-focused mindset. Attract, grow, and develop exceptional, future-ready talent.
  • Inspire teams to look beyond their function, connect their work to enterprise impact, think end to end, and act in the best interests of the whole company.
  • Anticipate market shifts and use curiosity, innovation, and technology to turn insights into strategies that drive growth and competitive advantage.
  • Lead through ambiguity across diverse markets and regulatory environments, connecting insights and stakeholders to create clarity with sound judgment and cross cultural awareness.
  • Inspire and mobilize people and teams to act with speed, agility, and accountability in driving ambitious business outcomes, with a relentless focus on the customer.
  • Explore new ideas, ways of working and technology. Set clear direction, aligns stakeholders
  • remove barriers to progress. Guide teams through uncertainty with clarity, empathy
  • resilience. As this is a player/coach role, the ideal candidate is also a subject matter expert and strategic leader in the following skills. They apply the skills in evolving, ambiguous
  • unforeseeable contexts, regularly coach and mentor others
  • shape best practices across teams or the organization:
  • Observability - Ability to use scripting and tooling to implement observability solutions, enabling the collection, analysis
  • visualization of metrics, logs
  • traces to support incident detection, diagnosis
  • continuous service improvement.
  • Programming and Scripting - Ability to write and maintain code and scripts to automate tasks, build operational tools
  • support monitoring, deployment
  • incident response using languages such as Python, Go, Bash
  • Systems and Network Administration - Ability to configure, operate, and troubleshoot Linux/Unix systems and network components, applying knowledge of networking concepts, protocols, security, and system reliability.
  • Cloud Computing and Infrastructure - Ability to design, deploy
  • manage applications and infrastructure on cloud platforms (e.g., AWS, Azure, GCP), ensuring scalability, security, availability
  • operational efficiency.
  • Reliability and Scalability - Ability to design and operate systems for high availability, fault tolerance, and disaster recovery, while ensuring systems can scale to meet current and future demand
  • DevOps Practices - Ability to apply DevOps principles and practices, including CI/CD pipelines, containerization, and orchestration, to enable faster, more reliable software delivery and operations.
  • Troubleshooting - Capability to systematically identify, diagnose
  • resolve technical issues across systems, applications
  • networks, using analytical methods and tools to restore functionality, minimize disruption
  • ensure stable operations.
  • Capacity Planning and Performance Optimization - Ability to monitor resource utilization, forecast future capacity needs
  • optimize system performance to support growth, scalability
  • efficient infrastructure usage.
  • IT Service Management - Ability to apply IT service management principles to incident, problem
  • change management, ensuring reliable service delivery, effective incident response
  • continuous service improvement aligned to business needs.
  • Proactive Monitoring and Improvement (SRE Applications) - The ability to use application reliability signals to anticipate issues, identify risks, and drive preventative improvements that enhance application performance and availability. Mastercard is a merit-based, inclusive, equal opportunity employer that considers applicants without regard to gender, gender identity, sexual orientation, race, ethnicity, disabled or veteran status, or any other characteristic protected by law. We hire the most qualified candidate for the role. In the US or Canada, if you require accommodations or assistance to complete the online application process or during the recruitment process, please contact reasonable_accommodation@mastercard.com and identify the type of accommodation or assistance you are requesting. Do not include any medical or health information in this email. The Reasonable Accommodations team will respond to your email promptly. Corporate Security Responsibility All activities involving access to Mastercard assets, information, and networks comes with an inherent risk to the organization and, therefore, it is expected that every person working for, or on behalf of, Mastercard is responsible for information security and must: Abide by Mastercard’s security policies and practices
  • Ensure the confidentiality and integrity of the information being accessed
  • Report any suspected information security violation or breach, and Complete all periodic mandatory security trainings in accordance with Mastercard’s guidelines. In line with Mastercard’s total compensation philosophy and assuming that the job will be performed in the US, the successful candidate will be offered a competitive base salary and may be eligible for an annual bonus or commissions depending on the role. The base salary offered may vary depending on multiple factors, including but not limited to location, job-related knowledge, skills, and experience. Mastercard benefits for full time (and certain part time) employees generally include: insurance (including medical, prescription drug, dental, vision, disability, life insurance)
  • flexible spending account and health savings account
  • paid leaves (including 16 weeks of new parent leave and up to 20 days of bereavement leave)
  • 80 hours of Paid Sick and Safe Time, 25 days of vacation time and 5 personal days, pro-rated based on date of hire
  • 10 annual paid U.S. observed holidays
  • 401k with a best-in-class company match
  • deferred compensation for eligible roles
  • fitness reimbursement or on-site fitness facilities
  • eligibility for tuition reimbursement
  • and many more. Mastercard benefits for interns generally include: 56 hours of Paid Sick and Safe Time
  • jury duty leave
  • and on-site fitness facilities in some locations. Pay Ranges O'Fallon, Missouri: $152,000 - $258,000 USD

Required skills

Site Reliability Engineeringautomationincident managementsystem resiliencerisk management
Posted on JobRush — the end-to-end AI job-search platform.