Senior Site Reliability Engineer
Senior Site Reliability Engineer Write your story with Zip Join Zip’s Technology function, responsible for building and... more info
Senior Site Reliability Engineer - Remote Airlock Digital is the global leader in application control and allowlisting, providing cutting-edge solutions that protect organizations worldwide from cyber threats. Our innovative platform allows businesses to "Allow what you trust. Prevent what you don't," helping them safeguard critical systems with confidence. We are expanding rapidly across Australia, North America, and EMEA to support our growing global customer base. Our team members are friendly, collaborative, and humble rock stars – we work hard and are always willing to help each other out. We are proud to have been recognized as one of Australia's Greatest Places to Work in 2024. Our leadership team emphasizes flexibility, understanding that team members thrive when they can work in an environment that suits them. What We Are Looking For We are seeking to appoint a Senior Site Reliability Engineer to join our dynamic team and ensure the reliability and scalability of our systems and services. This opportunity will suit someone who has a passion for optimizing performance and thrives in a collaborative environment. A brief view into what the role entails: Designing, implementing, and maintaining highly available, scalable, and fault-tolerant systems and services. Introducing best practices around observability, SLOs, and reliability. Continuously monitoring the performance, availability, and security of Airlock Digital systems and services, proactively identifying and resolving issues. Identify areas for improvement across the organization and drive engineering-wide technical change in the field of site reliability. Collaborate with cross-functional teams to implement and maintain deployment pipelines, monitoring tools, and automated testing frameworks. Develop and maintain documentation of systems, processes, and procedures to ensure knowledge transfer and continuity. Lead incident response, root cause analysis, and post-mortem activities to identify and address underlying issues. Work with Software Developers to design and implement scalable and resilient application services and infrastructure. Participate in on-call rotation to ensure 24/7 support for critical systems and services. What You’ll Need 5+ years of hands-on experience in Site Reliability and Observability Engineering, DevOps, or Infrastructure Engineering. Commercial experience in at least one programming language such as Python or Go. Solid experience with automation tools such as Ansible and containerization tools like Docker and Podman. Deep understanding of distributed systems, networking, operating systems, and cloud computing. Proven ability to work autonomously, managing tasks independently with minimal direction. Strong troubleshooting and problem-solving skills, with experience in incident response and root cause analysis. Systematic problem-solving approach, effective communication skills, and a sense of ownership and drive. Experience with Splunk for log management and data analysis. Excellent communication skills and the ability to share ideas and opinions in a collaborative environment. Applicants will need to have unrestricted rights to work in Australia to be eligible for this role. Successful applicants will need to be willing to obtain a National Police Check as part of the recruitment process. Bonus Points (nice to have, but not essential): Experience in leading and mentoring team members. Familiarity with monitoring tools such as Datadog, New Relic, Dynatrace, or Grafana. Experience with Zabbix for monitoring and analytics. Previous experience working in a start-up or scale-up environment. What We Offer Flexible Work Environment: This role can be either a hybrid position working remotely and in person at the Adelaide office, or a fully remote role, based anywhere in Australia. Paid Volunteering Time: We encourage giving back to the community with dedicated paid volunteering hours. Birthday Leave: We offer team members the opportunity to celebrate their birthday annually with a paid day off during their birthday month. Paid Parental Leave: We offer 6 weeks of paid parental leave for team members who have been with us for at least 6 months. Home Office Allowance: A $1,200 allowance to help you set up your home office. We believe in supporting our team members both personally and professionally, fostering an environment where everyone can thrive. Please note: All interviews are conducted virtually. We encourage you to apply as soon as possible. No contact from recruitment agencies, thank you. #J-18808-Ljbffr
Senior Site Reliability Engineer Write your story with Zip Join Zip’s Technology function, responsible for building and... more info
Senior Site Reliability Engineer (Product SRE) Location: Melbourne, AU / Sydney, AU / Brisbane, AU We welcome applications... more info
Career Opportunities: Senior Site Reliability Engineer - .Net/C# focused (1017745) Requisition ID 1017745 - Posted - WooliesX... more info