Specialist – Tools & Infrastructure Reliability (March of Giants)
Ubisoft2•1d ago
CanadaOnsiteFull-timeMid Level5+ yrs exp
Top focus
Infrastructure EngineerMl Infra Engineer
- Ubisoft is a global leader in gaming, with teams across the world creating original and memorable experiences — from Assassin’s Creed and Rainbow Six to Just Dance and more. We believe that diverse perspectives help both players and teams thrive. If you’re passionate about innovation and pushing the boundaries of entertainment, join us on our journey and help create the unknown!
- The position holder ensures the continuous viability, stability
- performance of operational tools and infrastructure supporting GaaS game development. They design, develop
- operate tools and pipelines (build, configuration, versioning, deployment, publishing) to simplify, optimize
- automate development processes. They train and support teams in testing, quality, security
- automation prior to launch
- promote best practices to deliver a reliable, high‑performance gameplay experience. Responsibilities Advising development teams on technology and tooling choices to improve visibility, control
- robustness of internal and external services. Training, supporting
- guiding development teams in improving continuous integration and continuous deployment systems. Researching, integrating
- developing technologies that enhance reliability, performance
- productivity. Designing, operating
- owning build, configuration, versioning
- publishing pipelines (including packaging, signing, SBOM, artifacts). Implementing and supporting CI/CD tooling (automated tests, quality, security), IaC
- secure, reproducible, controlled deployments. Maintaining tooling products to deliver exemplary service quality to the project (internal SLOs). Implementing and maintaining game deployment guidelines and documenting infrastructure implementation and technical specifications for network and server systems. Collaborating with development teams to diagnose and resolve issues related to online services. Establishing and maintaining incident‑management processes. Managing Cloud environments using appropriate tools. Developing tools and processes that allow developers to deploy services safely and efficiently. Defining and tracking SLA/SLO/SLI, deploying observability (logs, metrics, traces), managing capacity
- contributing to FinOps initiatives.
- Education University degree in Computer Science, Computer Engineering, or any relevant field. Experience 5–8 years of experience in software development and system administration. Experience with infrastructure automation (Cloud). Experience managing high‑throughput systems. Experience designing resilient, scalable, and redundant architectures. Experience in software development and optimization. Skills & Knowledge Strong analytical and synthesis skills. Ability to solve complex problems. Ability to adapt quickly to change. Ability to work under pressure. Strong knowledge of distributed systems. Excellent knowledge of Linux and Windows system administration. Programming languages: Python, Go, C#, or C++. CI/CD (GitLab, GitHub, Azure DevOps), IaC (Terraform, CloudFormation), containers & orchestration (Docker, Kubernetes). Observability: Prometheus/Grafana, ELK/EFK, OpenTelemetry (or equivalent). Cloud: AWS, Azure, GCP
- networks (DNS, CDN, load balancing, TLS). Assets: Unreal Engine 5 (or similar engine), DevOps methodology, infrastructure automation experience.
Required skills
PythonGoC#C++GitLabGitHubAzure DevOpsTerraformCloudFormationDockerKubernetesPrometheusGrafanaAWSAzure