Specialist – Tools & Infrastructure Reliability (March of Giants)

Ubisoft2•1d ago

CanadaOnsiteFull-timeMid Level5+ yrs exp

Apply now

Top focus

Infrastructure EngineerMl Infra Engineer

Ubisoft is a global leader in gaming, with teams across the world creating original and memorable experiences — from Assassin’s Creed and Rainbow Six to Just Dance and more. We believe that diverse perspectives help both players and teams thrive. If you’re passionate about innovation and pushing the boundaries of entertainment, join us on our journey and help create the unknown!
The position holder ensures the continuous viability, stability
performance of operational tools and infrastructure supporting GaaS game development. They design, develop
operate tools and pipelines (build, configuration, versioning, deployment, publishing) to simplify, optimize
automate development processes. They train and support teams in testing, quality, security
automation prior to launch
promote best practices to deliver a reliable, high‑performance gameplay experience. Responsibilities Advising development teams on technology and tooling choices to improve visibility, control
robustness of internal and external services. Training, supporting
guiding development teams in improving continuous integration and continuous deployment systems. Researching, integrating
developing technologies that enhance reliability, performance
productivity. Designing, operating
owning build, configuration, versioning
publishing pipelines (including packaging, signing, SBOM, artifacts). Implementing and supporting CI/CD tooling (automated tests, quality, security), IaC
secure, reproducible, controlled deployments. Maintaining tooling products to deliver exemplary service quality to the project (internal SLOs). Implementing and maintaining game deployment guidelines and documenting infrastructure implementation and technical specifications for network and server systems. Collaborating with development teams to diagnose and resolve issues related to online services. Establishing and maintaining incident‑management processes. Managing Cloud environments using appropriate tools. Developing tools and processes that allow developers to deploy services safely and efficiently. Defining and tracking SLA/SLO/SLI, deploying observability (logs, metrics, traces), managing capacity
contributing to FinOps initiatives.
Education University degree in Computer Science, Computer Engineering, or any relevant field. Experience 5–8 years of experience in software development and system administration. Experience with infrastructure automation (Cloud). Experience managing high‑throughput systems. Experience designing resilient, scalable, and redundant architectures. Experience in software development and optimization. Skills & Knowledge Strong analytical and synthesis skills. Ability to solve complex problems. Ability to adapt quickly to change. Ability to work under pressure. Strong knowledge of distributed systems. Excellent knowledge of Linux and Windows system administration. Programming languages: Python, Go, C#, or C++. CI/CD (GitLab, GitHub, Azure DevOps), IaC (Terraform, CloudFormation), containers & orchestration (Docker, Kubernetes). Observability: Prometheus/Grafana, ELK/EFK, OpenTelemetry (or equivalent). Cloud: AWS, Azure, GCP
networks (DNS, CDN, load balancing, TLS). Assets: Unreal Engine 5 (or similar engine), DevOps methodology, infrastructure automation experience.

Required skills

PythonGoC#C++GitLabGitHubAzure DevOpsTerraformCloudFormationDockerKubernetesPrometheusGrafanaAWSAzure