Location: Raleigh,NC, USA
**The position is described below. If you want to apply, click the Apply Now button at the top or bottom of this page. After you click Apply Now and complete your application, you'll be invited to create a profile, which will let you see your application status and any communications. If you already have a profile with us, you can log in to check status.**
Need Help? (
_If you have a disability and need assistance with the application, you can request a reasonable accommodation. Send an email to Accessibility (...@truist.com?subject=Accommodation request)_
_(accommodation requests only; other inquiries won't receive a response)._
**Regular or Temporary:**
Regular
**Language Fluency:** English (Required)
**Work Shift:**
1st shift (United States of America)
**Please review the following job description:**
The Head of Cloud Automation and Observability is responsible for driving the automation and observability strategy across the organization's cloud infrastructure. This role ensures that cloud environments are efficient, scalable, and highly reliable through the implementation of cutting-edge automation frameworks and observability practices. The successful candidate will lead a team focused on enhancing cloud operations by automating repetitive tasks, improving monitoring capabilities, and ensuring the seamless operation of cloud services.
**ESSENTIAL DUTIES AND RESPONSIBILITIES**
Following is a summary of the essential functions for this job. Other duties may be performed, both major and minor, which are not mentioned below. Specific activities may change from time to time.
Primary Roles & Responsibilities
Automation Strategy & Implementation
o Develop and execute a comprehensive automation strategy for cloud infrastructure, focusing on efficiency, scalability, and resilience.
o Lead the design, development, and deployment of automation tools and frameworks to streamline cloud operations, including provisioning, configuration management, and incident response.
o Collaborate with DevOps, engineering, and IT teams to identify opportunities for automation and to implement Infrastructure as Code (IaC) practices using tools such as Terraform, Ansible, or CloudFormation.
Observability Strategy & Monitoring
o Establish and lead the observability strategy for cloud environments, ensuring end-to-end visibility into system performance, availability, and health.
o Implement and maintain advanced monitoring, logging, and tracing solutions to proactively detect and resolve issues in cloud services.
o Work closely with SRE (Site Reliability Engineering) teams to set up alerting mechanisms and dashboards that provide actionable insights into the performance of cloud resources.
Operational Excellence
o Drive continuous improvement of cloud operations by automating routine tasks, reducing manual intervention, and minimizing human error.
o Ensure that all cloud environments are monitored 24/7, with robust alerting systems in place to respond quickly to incidents.
o Develop and enforce best practices for cloud automation and observability, including the use of CI/CD pipelines, automated testing, and continuous delivery practices.
Collaboration & Stakeholder Management
o Collaborate with cross-functional teams, including IT, security, and application development teams, to ensure alignment on automation and observability goals.
o Provide leadership and guidance to teams on best practices for cloud automation, monitoring, and observability.
o Act as a key advisor to senior leadership on cloud automation and observability trends, tools, and technologies.
Innovation & Continuous Improvement
o Stay updated on the latest trends and technologies in cloud automation, DevOps, and observability to drive innovation within the organization.
o Evaluate and implement new tools and technologies that enhance automation and observability capabilities.
o Promote a culture of continuous learning and improvement, encouraging the adoption of new techniques and tools that drive operational excellence.
Team Leadership & Development
o Lead and mentor a team of cloud automation engineers, observability specialists, and SREs, fostering a culture of collaboration and innovation.
o Provide ongoing training and development opportunities to ensure the team stays at the forefront of cloud automation and observability practices.
o Develop and manage performance metrics to assess the effectiveness of automation and observability initiatives.
Continuous Improvement & Innovation
o Stay abreast of emerging trends, technologies, and regulations in cloud computing to continually improve the cloud governance framework.
o Implement automation tools and processes to enhance governance, compliance monitoring, and risk management in cloud environments.
o Drive initiatives to improve cloud cost management and operational efficiency through effective governance.
Team Leadership & Development
o Lead and mentor a team of cloud governance specialists and analysts, fostering a culture of compliance, innovation, and continuous improvement.
o Develop training programs to ensure that all relevant stakeholders understand and adhere to cloud governance policies.
**QUALIFICATIONS**
Required Qualifications:
The requirements listed below are representative of the knowledge, skill and/or ability required. Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.
1. Bachelor's degree and ten to twenty-five years related experience or equivalent combination.
2. Managed Technology or Technology Process Teams for more than ten years or teams of thirty or more technologists.
3. Excellent knowledge of technical management and data governance.
4. Knowledge of current trends in IT hardware and systems software field.
5. Database management skills with the ability to produce reports.
6. Familiarity with the support and troubleshooting of personal computers and tablet devices.
7. Analyze situations, evaluate alternatives, and implement robust solutions
8. Interpret guidelines and analyze information to adapt or modify processes in response to changing circumstances.
9. Duties may require non-routine analysis, research and follow-through
10. The position requires strong problem solving and analytical skills with the ability to work independently and exercise sound judgment
11. The ability to make commitments and be willing to be held accountable against them, organizing workloads to meet deadlines
12. Exhibit adaptability to accept or bring about change when needed
13. Strong written and verbal communication skills
14. The ability to excel in a team environment and advance overall team objectives
15. The ability to ensure customer satisfaction by delivering excellence in products and service
16. Ability to work and communicate with peers, vendors, internal staff, including software program leadership and others
17. Consistently demonstrate professional, positive, and approachable attitude, demeanor and discretion
18. Demonstrate sensitivity in handling confidential information
19. Formulate and clearly communicate ideas to others
20. Fluency in English
21. Financial responsibility may include working within a budget to complete projects, negotiating and contracting with vendors and assisting with budget development
22. Purchase equipment and supplies as provided for in the budget
23. Ability to manage personnel with little supervision
Preferred Qualifications:
1. Bachelor's degree in computer science, Information Technology, or a related field. A Master's degree or equivalent experience is preferred.
2. 10+ years of experience in IT, with at least 5 years in cloud automation, observability, or DevOps roles.
3. Proven experience in automating cloud infrastructure in multi-cloud environments, particularly with AWS or Azure.
4. Strong background in building and managing observability frameworks, including experience with tools like Prometheus, Grafana, ELK Stack, or similar.
5. Deep understanding of cloud architecture, infrastructure as code (IaC), and CI/CD pipelines.
6. Expertise in cloud-native monitoring, logging, and tracing tools and methodologies.
7. Strong leadership skills with the ability to influence and drive change across technical teams.
8. Excellent communication and collaboration skills, with the ability to work effectively with diverse teams.
9. Certifications such as AWS Certified DevOps Engineer, Certified Kubernetes Administrator (CKA), or similar are highly desirable.
**General Description of Available Benefits for Eligible Employees of Truist Financial Corporation:** All regular teammates (not temporary or contingent workers) working 20 hours or more per week are eligible for benefits, though eligibility for specific benefits may be determined by the division of Truist offering the position. Truist offers medical, dental, vision, life insurance, disability, accidental death and dismemberment, tax-preferred savings accounts, and a 401k plan to teammates. Teammates also receive no less than 10 days of vacation (prorated based on date of hire and by full-time or part-time status) during their first year of employment, along with 10 sick days (also prorated), and paid holidays. For more details on Truist's generous benefit plans, please visit our Benefits site (
. Depending on the position and division, this job may also be eligible for Truist's defined benefit pension plan, restricted stock units, and/or a deferred compensation plan. As you advance through the hiring process, you will also learn more about the specific benefits available for any non-temporary position for which you apply, based on full-time or part-time status, position, and division of work.
**_Truist supports a diverse workforce and is an Equal Opportunity Employer that does not discriminate against individuals on the basis of race, gender, color, religion, citizenship or national origin, age, sexual orientation, gender identity, disability, veteran status or other classification protected by law. Truist is a Drug Free Workplace._**
EEO is the Law (
Pay Transparency Nondiscrimination Provision ( English_formattedESQA508c.pdf)
E-Verify (