Related keywords: mental health remote jobdata engineer remote jobprogramming remote job
The position of Staff Site Reliability Engineer at Achievers focuses on the intersection of software engineering and operations, dedicated to building and maintaining reliable, scalable cloud systems. This role is pivotal for the management and enhancement of Achievers' global infrastructure, which relies heavily on modern technologies like Google Cloud Platform (GCP) and Kubernetes (GKE). Furthermore, the integration of AI-driven workflows is a key aspect of this position, making it an exciting opportunity for candidates with a passion for innovative technology.
In this role, the Staff Site Reliability Engineer will lead several critical initiatives:
Architectural Leadership: Design and improve global, high-availability infrastructures utilizing GCP and GKE, ensuring that systems can handle high concurrency and are resilient against failures.
AI & Automation Strategy: Implement solutions that reduce repetitive operational tasks. This includes developing AI-integrated workflows such as automated alerts, bots for incident triage, and methods to handle infrastructure drift via automation.
Cross-Functional Influence: Actively collaborate with various teams including Product, Engineering, and Leadership to identify long-term reliability risks while managing complex change.
Infrastructure-as-Code (IaC): Establish best practices using Infrastructure-as-Code tools like Terraform, enhancing workflows for development teams.
System Resiliency: Lead high-level initiatives in designing disaster recovery methods, networking across multiple regions, and creating architecture that adheres to zero-trust security standards.
Technical Mentorship: Guide and mentor fellow SRE members through design reviews and promote best practices to elevate the team's skill set and capabilities.
The ideal candidate will possess the following skills and qualifications:
Experience: Approximately 15 years of systems engineering experience is necessary. This should include in-depth knowledge of Linux kernels, TCP/IP networking, and cloud-native architecture.
GCP Proficiency: Hands-on experience managing production workloads on Google Cloud Platform and GKE is essential.
AI and Automation Integration: Proven ability or a strong vision for utilizing AI tools to automate System Reliability Engineering tasks will set candidates apart.
Programming Skills: Proficiency in programming languages such as Python or Go for developing internal tools and automation frameworks.
Observability Mastery: A comprehensive knowledge of observability frameworks, with the ability to leverage these tools for data-driven decision-making.
Database Management: A strong foundation in managing relational databases like MySQL and MongoDB at scale.
Communication Skills: Exceptional communication abilities are critical for converting complex technical infrastructure challenges into actionable insights for non-technical stakeholders.
While not required, applicants with the following qualifications may have a competitive edge:
Practical experience with Service Mesh (e.g., Istio) and advanced features of GCP Networking.
Background in migrating legacy automation systems to AI-augmented CI/CD workflows.
Achievers prides itself on fostering a culture of recognition and personal growth. The company emphasizes the importance of their employee recognition and rewards platform in building a culture where employees feel seen and valued every day. The team is composed of passionate builders who are committed to innovation and personal development. With over 4.3 million users across 190 countries, Achievers offers a diverse environment with significant opportunities.
Salary: The salary range for this position is between $124,000 and $170,000, reflecting the experience, skills, and market data of the candidate.
Annual Compensation Review: Compensation is reviewed at least annually based on individual performance and role impact.
Health Benefits: Comprehensive health insurance and life coverage begin on the first day of employment.
Flexible Vacation: Employees can recharge with flexible vacation options to maintain a work-life balance.
Parental Leave: Generous parental leave policies accompanied by a top-up from the employer.
Retirement Contributions: Employer-matched RRSP contributions enhance future financial security.
Professional Development: Opportunities for professional growth through programs such as LinkedIn Learning and mentorship.
Employee Assistance Program: Comprehensive mental health and legal counseling services offered to employees and their families.
Diversity and Inclusion: Achievers promotes a diverse workplace where everyone can do their best work, ensuring an accessible recruitment process for candidates from all backgrounds.
The working model is primarily remote, but employees are encouraged to spend time at the beautiful Liberty Village office in Toronto to foster collaboration and relationships. Regular events designed to promote connection, belonging, and well-being within the team are also part of the company culture.
The Staff Site Reliability Engineer role at Achievers presents an exceptional opportunity for a seasoned professional seeking to drive impactful technological advancements. The competitive salary and comprehensive benefits, partnered with a nurturing work culture, make this position appealing to any job seeker eager to make a difference in a dynamic environment.
This job offer was originally published on remoteOK.com
January 31, 2026
15 views
0 clicks on Apply Now
This job offer summary has been generated using automated technology. While we strive for accuracy, it may not always fully capture the nuances and details of the original job posting. We recommend reviewing the complete job listing before making any decisions or applications.