
Site Reliability Engineer
Job Description
Posted on: August 15, 2025
Job Title: Site Reliability EngineerExperience Level: Level 3 (senior): 5-7 yearsLocation: Montreal (Day 1 onboarding onsite / in office presence 3x week)dablDuration: 12+ months (extendable contract)Job Responsibilities
• Maximizing the availability and performance of supported systems through optimized and automated plant management, ongoing problem management, and architecture reviews with dev-side peers
• Reduction of the cost of support (hours of effort) through the elimination of operational issues, optimization and automation of tasks, development of operational tools and driving client self-service to minimize constraints
• Identification and prioritization of technical debt that is impacting client (i.e. software developers) productivity, system reliability or the efficiency of the Ops team
• Collaboration with other SREs in Application Infrastructure to share solutions
• Complex troubleshooting of front to back development environment issues
• Maximize Ops team product knowledge and support capabilities to minimize the escalation rate to the departments feature engineers/developers
• Consulting with clients (the Firms development community) to maximize productivity, including troubleshooting their issues with using MSDEs solutions
• Experimentation with new tools and techniques
• Being operationally responsive, including sharing on-call rotation with the rest of the global team (with a time-off in lieu system)
Required Qualifications / Skills
• Strong Linux troubleshooting skills
• Task automation experience in any programming language, preferably Python
• Practical experience of implementing monitoring / observability solutions using Prometheus and Grafana
• Experience with using version control (Bitbucket, Github), issue tracking (Jira), continuous integration (Jenkins, Azure DevOps, Github Actions), automated testing, or deployment automation
• Excellent communication skills to work with peers / third party vendors
• Confident collaboration skills
Desired Skills
• Experience with site reliability engineering practices, like service level objectives (SLOs), error budgets, blameless postmortems, toil reduction
• Experience with Docker / Kubernetes
Apply now
Please let the company know that you found this position on our job board. This is a great way to support us, so we can keep posting cool jobs every day!
CanadaJobs.works
Get CanadaJobs.works on your phone!

Civil Engineer

Mechanical Building Engineer

Digital Sales Specialist

Business Development Senior Manager - Water & Wastewater Conveyance

