All jobs

Senior Data Platforms SRE

Iqvia18h ago
Bangalore, IndiaOnsiteFull-timeSenior Level5+ yrs exp

Top focus

SreVp DataSenior Data EngineerSenior Data AnalystSenior Data Scientist

Job Overview As a Senior Data Platforms SRE, you will serve as a technical escalation point and resolve higher complexity incidents and technical problems end to end. This role requires advanced troubleshooting expertise in data platforms, operational discipline, and the ability to drive service improvements across the governance team.

You will work directly with Databricks and Snowflake platforms deployed on Azure and AWS, investigating non-trivial technical issues, implementing platform guardrails, and mentoring junior engineers to elevate the team's operational maturity.

The position combines hands-on technical work with service improvement initiatives, including defining operational metrics, tracking incident resolution trends, and contributing to SLA/SLO verification to ensure platform reliability and cost effectiveness.

Level 2 support engineers handle technically complicated issues that exceed the competence of Level 1 support, focusing on incident resolution through deep technical knowledge and advanced troubleshooting skill. Key Responsibilities Advanced Incident Handling & Escalation: Investigate non-trivial job and pipeline failures, performance and capacity symptoms, and recurring platform issues.

Coordinate escalation to platform engineering or vendors with complete diagnostics, including root cause analysis, system logs, and performance metrics to ensure rapid resolution. L2 engineers perform deep diagnoses to trace and resolve issues, managing support tickets to ensure timely resolution of all technical incidents .

Resource & Cost Governance: Perform spend anomaly checks to identify runaway queries, oversized clusters, and job loops that drive uncontrolled costs. Drive remediation actions including shutdowns and guardrails using policy and operational authority, ensuring alignment with financial controls and budget targets.

Platform Guardrails Implementation: Implement and maintain cluster policies and operational restrictions that prevent uncontrolled cost growth, aligned with governance objectives. Configure and enforce workspace-level controls, quota management, and resource limits across Databricks and Snowflake environments.

Environment Lifecycle Execution: Support workspace and environment provisioning and decommissioning activities, including Infrastructure as Code driven provisioning where defined. Execute environment setup, configuration validation, and teardown procedures following established operational standards.

Service Improvement & Metrics: Help define and track operational metrics including incident volumes, resolution trends, and mean time to resolution. Contribute to SLA/SLO verification and improve documentation quality and consistency across the service to support continuous operational improvement.

L2 support engineers maintain detailed documentation and knowledge articles as key activities in their role. Mentoring & Coaching: Mentor junior engineers on troubleshooting methodologies, ticket handling best practices, and technical documentation standards.

Review tickets for completeness, ensure clean handoffs between support tiers, and raise the overall operational maturity of the team through knowledge sharing and skills development. IQVIA is a leading global provider of clinical research services, commercial insights and healthcare intelligence to the life sciences and healthcare industries.

We create intelligent connections to accelerate the development and commercialization of innovative medical treatments to help improve patient outcomes and population health worldwide. Learn more at https://jobs.iqvia.com IQVIA is committed to integrity in our hiring process and maintains a zero tolerance policy for candidate fraud.

All information and credentials submitted in your application must be truthful and complete. Any false statements, misrepresentations, or material omissions during the recruitment process will result in immediate disqualification of your application, or termination of employment if discovered later, in accordance with applicable law.

We appreciate your honesty and professionalism.

Required skills

DatabricksSnowflakeAzureAWSIncident ManagementTroubleshootingInfrastructure as Code
Posted on JobRush — the end-to-end AI job-search platform.