About the Role:
As a Sr Principal Engineer in our IT Resiliency Office, you will play a pivotal role in ensuring the reliability and resilience of our technology infrastructure. You will lead the charge in designing, implementing, and maintaining robust systems that can withstand disruptions and recover swiftly.
Key Responsibilities:
- Resiliency Architecture: Create and put into practice robust solutions for on-premises and cloud-based settings.
- Chaos Engineering: Drive chaos engineering initiatives to proactively identify and mitigate potential system vulnerabilities.
- Monitoring and Alerting: Establish and maintain standards for system monitoring and alerting, ensuring rapid detection and response to incidents.
- Collaboration: Collaborate with cross-functional teams to align resiliency efforts and prioritize initiatives.
- Automation: Leverage IaC tools (e.g., Ansible) to automate infrastructure provisioning and configuration for improved efficiency and consistency.
- Incident Response: Participate in incident response activities, analyzing root causes and implementing preventive measures.
- Standards and Best Practices: Develop and promote best practices for resiliency engineering across the organization.
- Reporting and Documentation: Provide regular reports on resiliency activities, risks, and improvements to leadership.
Qualifications and Experience:
- Bachelor’s degree in engineering, computer science, or a similar discipline
- 5-10 years of experience in platform engineering, DevOps, and infrastructure automation
- Strong understanding of cloud technologies (AWS, Azure, GCP) and on-premises infrastructure
- Familiarity with infrastructure as code (IaC) tools, such as Ansible and Terraform
- Proven ability to design and implement highly available and fault-tolerant systems
- Knowledge of chaos engineering principles and practices
- Excellent problem-solving and troubleshooting skills
- Strong communication and collaboration skills
What We Offer:
- Competitive salary and benefits package
- Opportunity to work on cutting-edge technologies
- A collaborative and innovative work environment
- Potential for career growth and development
Join a Global Leader:
If you are a passionate and experienced resiliency engineer, we invite you to join our team and contribute to building a more resilient future.
Please apply for a resume online, and the digitalxnode evaluation team will reach out to you in case your profile gets screen-selected. We will keep your data in our repository, and our team may reach out to you for other positions.