site reliability engineer jobs
Senior Reliability Engineer
Easily apply4IR SolutionsRemote- $130,000–$185,000 a year
- Full-time
- On call
- Vision care
- Dental care
- Design complex IT/OT architectures—in cloud and on-prem—that are secure, recoverable, and sized appropriately.
- Moneris SolutionsToronto, ON
- $142,000–$186,000 a year
- Full-time
- On call
- Paid time off
- Profit sharing
- Lead and manage SRE engineers supporting the reliability, availability, and performance of business‑critical applications and platforms.
- Moneris SolutionsToronto, ON
- $142,000–$186,000 a year
- Full-time
- On call
- Paid time off
- Profit sharing
- Lead and manage SRE engineers supporting the reliability, availability, and performance of business‑critical applications and platforms.
- Alten CanadaMontréal, QC H2L 2N2
- $100,000 a year
- Full-time +1
- Paid time off
- Language training provided
- RRSP match
- Extended health care
- Chef de file mondial de l’industrie de l’ingénierie et du conseil TI avec plus de 58 000conseiller·e·s à travers le monde, le Groupe ALTEN optimise la…
Integrations & Support Engineer
Easily applyHCB CanadaNorth York, ON M3B 2T5- $75,000–$90,000 a year
- Full-time +1
- On call
- Mileage reimbursement
- Paid time off
- Vision care
- Dental care
- Paid sick leave
- Employee assistance program
- Improve deployment reliability, automation, and operational efficiency with new tooling and scripting.
- The Integration & Support Engineer is the operational and…
View similar jobs with this employerCanadian TireToronto, ON- $79,000–$131,000 a year
- Full-time
- Weekends as needed +3
- Profit sharing
- Collaborate with technology leaders and stakeholders to define the SRE strategy and best practices for ensuring the reliability, scalability, and performance of…
View similar jobs with this employerSenior Safety & Reliability Engineer
Easily applyNewHORIZON AIRCRAFTLindsay, ON- Full-time +1
- Employee stock purchase plan
- Vision care
- Dental care
- Stock options
- Relocation assistance
- Life insurance
- Develop and maintain reliability predictions using industry standards.
- Perform reliability analyses (e.g., FMEA, FTA, RBD) to identify and mitigate potential…
View similar jobs with this employerSenior Safety & Reliability Engineer
Easily applyNewHORIZON AIRCRAFTOttawa, ON- Full-time +1
- Employee stock purchase plan
- Vision care
- Dental care
- Stock options
- Relocation assistance
- Life insurance
- Develop and maintain reliability predictions using industry standards.
- Perform reliability analyses (e.g., FMEA, FTA, RBD) to identify and mitigate potential…
3D Modelling & Production Drawing Support Technologist / Engineer
Easily applyMultiple openingsBlumara Corp.Halifax, NS- Full-time +1
- Paid time off
- Vision care
- Dental care
- Life insurance
- Employee assistance program
- Disability insurance
- Coordinating with engineers, technologists, designers, and project teams.
- Blumara is looking for Engineers and Technologists in Halifax to support 3D modelling…
Ingénieur(e) en fiabilité / Reliability Engineer
Easily applyCOREcruitmentBromont, QC- $85,000 a year
- Full-time
- 4–5 years of experience in reliability engineering, industrial maintenance, or a similar technical environment.
Senior Enterprise Platform Reliability Engineer
Easily applyEssex Weld SolutionsEssex, ON N8M 3G6- $80,000–$90,000 a year
- Full-time +1
- Dental care
- Life insurance
- Employee assistance program
- Paid vacation
- Extended health care
- Company events
- Operational documentation and reliability process development.
- Take ownership of infrastructure reliability and operational continuity.
Senior RPA Automation Engineer
Easily applyFuture ElectronicsKirkland, QC H9H 3L1- From $90,000 a year
- Full-time
- Tuition reimbursement
- Dental care
- Life insurance
- Employee assistance program
- Disability insurance
- On-site gym
- Optimize automations for reliability and performance.
- We are establishing a new enterprise automation capability to improve and optimize business processes…
Senior Pega Production Support Engineer (L3 Support)
Easily applyNewHRConnectionsToronto, ON M5G 1S5- $120,000 a year
- Permanent
- Strong experience supporting enterprise applications on the Pega Platform.
- Hands-on experience with Pega Tracer, Clipboard, Rules Debugging, Job Schedulers,…
- The Land Administration Company IncCanada
- $130,000–$155,000 a year
- Full-time +1
- On call
- Paid time off
- Vision care
- Dental care
- Life insurance
- Disability insurance
- Casual dress
- Define and track service reliability objectives for critical systems.
- Translate technical risks, reliability concerns, and infrastructure investment needs into…
- DexterraHalifax, NS
- $65–$70 a year
- Full-time
- We are seeking a Site Technical Manager to oversee the daily operation and maintenance of multiple facilities.
- Provides verbal and written reports as requested.
- Tangerine BankToronto, ON M2H 0A1
- We offer flexible and accessible banking options, innovative products, and award-winning Client service.
- Our environment is primarily Google Cloud, and this…
By creating a job alert, you agree to our Terms . You can change your consent settings at any time by unsubscribing or as detailed in our terms.
People also searched:
Career Resources:
Job Post Details
Job details
Pay
- $130,000–$185,000 a year
Job type
- Full-time
Shift and schedule
- On call
Benefits
Pulled from the full job description
- Vision care
- Dental care
Full job description
About This Role
We deliver mission-critical IT/OT infrastructure—in cloud and on-prem—for industrial customers that can't afford downtime.
Small team. Hard problems. Practical solutions. No bureaucracy. No blame. No egos.
We ship it, own it, and make it better—blameless but accountable, shoulder to shoulder. We work hard. We stay human. We trust each other. We figure it out.
If you know what to do, delight in building it, and feel the ownership to support it—keep reading.
What You'll Do Customer Delivery
- Design complex IT/OT architectures—in cloud and on-prem—that are secure, recoverable, and sized appropriately
- Work directly with customers to understand their environment and estimate effort
- Own customer solutions end-to-end: requirements design build support
- Build or use reusable modules when it makes sense—build bespoke when it doesn't
- Deploy and manage Kubernetes-based infrastructure and stateful applications across diverse customer environments
Incident Response & Ownership
- Participate in on-call rotation alongside the rest of the team—everyone here supports what we ship
- Own incidents through resolution, then drive root cause analysis that eliminates the class of problem—not just the symptom
- Build the runbooks, alerts, and automation that make the next incident less likely or less painful
Infrastructure & Automation
- Work with Infrastructure-as-Code tools to provision and manage diverse customer environments
- Implement and maintain GitOps workflows for in-cluster deployments
- Ensure all infrastructure and application changes are declarative and version-controlled
- Automate self-healing and system updates—reduce manual intervention and keep environments current
Observability & Reliability
- Build and maintain monitoring, alerting, and dashboards using Prometheus, Loki, and Grafana
- Define SLIs and SLOs that reflect what actually matters to customers
- Surface real problems, reduce noise, and continually improve reliability and team efficiency
Shape the Future
- We don't have everything figured out. You'll help build, create, and shape how we operate
- Contribute to standards, patterns, and processes that make us better—not bureaucracy for its own sake
- Bring the SRE mindset: automate toil, prefer boring/stable systems, and relentlessly improve
What We're Looking For
- 5+ years in SRE, DevOps, or Infrastructure Engineering
- Strong Kubernetes skills in production environments—you'll troubleshoot real clusters, not just tutorials
- Experience with GitOps tooling (ArgoCD, Rancher Fleet, FluxCD, or similar)
- Solid understanding of Infrastructure-as-Code concepts (Terraform, Pulumi, Crossplane, or similar)
- Real incident response experience—you've been on-call, stayed calm, and fixed things under pressure
- Comfort with heterogeneous environments—every customer site is a little different and you need to adapt
- Clear communication skills—you can write a useful runbook, gather requirements on a customer call, and document what you learned
- Ability to operate in ambiguity—we're building clarity, not waiting for it
Strong Plus
- Azure experience (our primary cloud)
- Experience with SUSE ecosystem (SLE Micro, RKE2, Rancher, Longhorn)
- Industrial, manufacturing, or OT environment experience
- Familiarity with Inductive Automation's Ignition platform and MQTT
- Experience in a startup or small-team environment where you wore many hats
The SRE Mindset
This matters here. We need someone who:
- Sees repetitive manual work as a problem to automate, not a fact of life
- Prefers stable, predictable, "boring" production over clever and fragile
- Supports what they create—no throwing things over the wall
- Treats incidents as opportunities for systemic improvement
- Works well on a small team where everyone carries weight
- Stays current with SRE practices, emerging technologies, and cloud/edge trends
A Few Honest Words
This is a startup. Hours can be demanding. Priorities shift. You won't have a team of 30 backing you up.
What you will have: the autonomy to make real decisions, teammates who own their work, and customers who genuinely depend on what we build. We work hard because the work matters—and we have fun doing it.
If you want a structured 9-5, predictability, and a clear ladder—this probably isn't the right fit.
If you want to build, learn, and be part of something that's actually going somewhere—let's talk.
What We Offer
- Comprehensive benefits (Medical, Dental, Vision, 401K)
- Fully remote—work from anywhere in the world
- A team where it's safe to be honest, learn from mistakes, and get better together
Additional Information
We are committed to the principle of equal employment opportunity for all employees and to providing a work environment free from discrimination and harassment.
Pay: $130,000.00-$185,000.00 per year
Work Location: Remote