Manager, Reliability Engineering

Related keywords: remote job java developeraccount manager remote jobdevops remote job

This page contains product affiliate links.

Company Overview

Wiser Solutions, a leader in omnichannel retail intelligence, empowers over 750 global brands and retailers. They provide data collection and analysis with remarkable 98% accuracy, serving as a preferred source for insights into pricing dynamics, promotional effectiveness, competitive activity, and retail execution. By integrating various platforms, Wiser aims to enhance its efficiency and reduce complexity in operations, ultimately facilitating better decision-making for their clients.

Job Overview

The Manager of Reliability Engineering position at Wiser Solutions is a pivotal role responsible for collaborating with engineering and product leadership to outline the long-term roadmap and quarterly deliverables of the team. Located remotely in Canada, this role involves overseeing a team dedicated to the management and development of core cloud infrastructure and ensuring a seamless developer experience.

Key Responsibilities

Lead Developer Experience and Cloud Infrastructure

You will manage a team focused on enhancing the overall experience for developers, ensuring a stable, scalable cloud-native platform.

Build Developer-Centric Platforms

Your responsibilities include designing and improving internal platforms, CI/CD pipelines, and tools to enable product engineers to launch features efficiently and securely.

Foster Developer-Centric Product Thinking

You will be required to gather developer feedback, monitor platform metrics, and iterate based on user needs to ensure the infrastructure serves internal customers effectively.

Drive AI-First Infrastructure Innovation

Leading initiatives aimed at infrastructure automation and self-healing systems will be a crucial part of your role to enhance reliability and reduce operational complications.

Champion Infrastructure Automation

Striving towards the full automation of infrastructure provisioning, deployment, and monitoring will help in ensuring repeatability and reliability.

Implement SRE Best Practices

Introducing Site Reliability Engineering principles to improve system resilience and uptime will be part of your job role.

Lead Platform Consolidation

You will spearhead efforts to unify the platforms that originated from multiple acquisitions while managing any related technical debt.

Enhance Observability and Reliability

Increasing system observability through comprehensive metrics, tracing, and alerting will be essential to detect and resolve production issues swiftly.

Collaborate Across Engineering

Working closely with various teams—application developers, QA, security, and product teams—to ensure a strategic alignment of infrastructure and development platforms is vital.

Improve SLAs and Operational Excellence

This role will necessitate continuous improvement of service level indicators (SLIs), objectives (SLOs), and agreements (SLAs) to exceed uptime goals.

Manage a Distributed High-Performing Team

You will lead a distributed team of infrastructure and Site Reliability Engineering (SRE) engineers, focusing on recruitment, mentoring, and team development.


🎁 Get your FREE ebook!

Share this page using the buttons below and download our e-book "Essential Soft Skills for Today’s World" instantly.

Once shared, you’ll see the download button on any page you visit!

✅ Thanks for sharing!

You can now download your ebook:

📥 Download "Essential Soft Skills for Today’s World"

Foster a Culture of DevOps and Ownership

Promoting ownership amongst development teams through tools and platforms that support full lifecycle responsibilities will be encouraged.

Technologies Used

Wiser Solutions utilizes a variety of technologies to support their engineering practices:

  • Cloud & Infrastructure: AWS, Docker, Kubernetes (EKS), Terraform, Vault
  • Observability & Reliability: Prometheus, Grafana, New Relic, PagerDuty
  • CI/CD & DevEx: GitHub, GitHub Actions, Concourse, ArgoCD
  • Languages & Data: Node.js, Java, Python, PostgreSQL, MongoDB, RabbitMQ, Elasticsearch

Required Qualifications

Candidates should have:

  • 10+ years of professional experience in infrastructure, SRE, or platform engineering, with 2+ years in a leadership role.
  • A Bachelor's degree in Computer Science, Engineering, or a related field is preferred.
  • Proven experience in managing high-performing teams and the ability to coach and mentor engineers.
  • Proficiency in sprint planning and executing cross-functional collaboration with excellent communication skills.
  • An understanding of treating internal platforms as products and applying product management principles to infrastructure services.
  • Strong operational expertise within high-availability systems, distributed architectures, and incident response.
  • Experience managing CI/CD systems and developer tooling to enhance engineering productivity.

Bonus Points

Candidates with experience in the following areas will have an advantage:

  • Large-scale PostgreSQL, MongoDB, RabbitMQ, or Elasticsearch deployments.
  • Managing data center environments.
  • Familiarity with Windows Server infrastructure in hybrid cloud setups.
  • Historical exposure to time-series databases or event-sourced system architectures.

Career Prospects and Salary

Although specific salary information is not disclosed in the provided text, the role as Manager, Reliability Engineering suggests a competitive salary based on experience and industry standards in IT management. Given the responsibilities and required qualifications, candidates can expect a compensation package that reflects their skill level, taking into account the remote nature of the job and location.

Conclusion

This position at Wiser Solutions is a compelling opportunity for seasoned professionals seeking a challenging role in a reputable company, focusing on enhancing the developer experience in a cloud environment. If you are well-versed in the required technologies and have a passion for improving infrastructure processes, this role may be the perfect next step in your career.



This job offer was originally published on jobicy.com

Wiser Solutions

Remote- Canada

Software development

Full-time

June 20, 2025

4 views

0 clicks on Apply Now


Similar job offers


This job offer summary has been generated using automated technology. While we strive for accuracy, it may not always fully capture the nuances and details of the original job posting. We recommend reviewing the complete job listing before making any decisions or applications.