Director - GenAI Platform Engineering (Hybrid)
We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to achieve your goals – and to help others accomplish theirs, too.Join our team as we help shape the future.
At the Hartford, we are seeking Director of platform engineering (Generative AI), who will be responsible for building our GenAIOps (Internal Developer platform) to streamline, monitor, and manage all GenAI production workloads and to address the operational challenges of running, maintaining and improving GenAI systems in production. We believe GenAI Ops is a specialized discipline and brings in flavors of multiple best practices and design solutions from DevOps, DataOps and MLOps.
The Director, GenAI Platform Engineering in the Enterprise Data Science & Analytics organization is considered a senior leader responsible for leading a team of highly skilled data, ML and cloud engineers designing scalable, secure, resilient GenAI platforms, recommending and selecting platform services, tools and associated architectures. The ideal candidate is an experienced people leader leading teams in ML, cloud and/or software engineering. The person must be able to partner across leadership, including executives, lines of business Tech. leads, Data Science team and lead the Platform Solution Architecture team translating business requirements to platform design. This role requires versatility and expertise across a wide range of skills. Someone with a diverse background/experience in Cloud engineering, DevOps, System design, & ML engineering will fit into this role seamlessly.
This role will have a Hybrid work arrangement, with the expectation of working in an office location (Charlotte, NC; Chicago, IL; Hartford, CT; Columbus, OH) 3 days a week (Tuesday through Thursday). Candidates must be authorized to work in the US without company sponsorship. The company will not support the STEM OPT I-983 Training Plan endorsement for this position.
Job Description
- Lead our Enterprise GenAI Platform engineering team, supporting internal customers leveraging data/cloud platform services, from idea generation through platform design including analysis, design, cost benefit, prototyping and hand over to engineering and data science team for implementation.
- Lead the team defining platform delivery standards, enforce operational excellence on delivery. Standardize methodologies, tools and systems for efficiency and effectiveness across all portfolios.
- This team also champion orchestration, automation and maintenance of our platform services in accordance with our EA/ML engineering platform strategy.
- Guide, architect and orchestrate immutable infrastructure that leverages programmable infrastructure, as code, and operationalize appropriate configuration management tools.
- Establish platform charge/show back, forecasting and budgeting processes to efficiently drive platform FinOps. Partner with lines of businesses to help them realize and optimize expenditure on GenAI platforms.
- Responsible for developing and enforcing platform governance, architecture, operating procedures, monitoring and system standards.
- Provides technical and team leadership for the Platform Engineering Team, including coaching, recruiting, mentoring and setting appropriate expectations and goals.
- Leads workload and workforce management and optimization activities of a team of cloud engineers, data engineers and ML engineers.
- Directly involved in capital budgeting, project and program planning and establishing strategic and annual plans for both GenAI platforms and the team.
- Brings thought leadership by collaborating with EDS LOBs, Enterprise Architecture, Infrastructure, security and other ecosystem partners.
- Conducts research on emerging GenAI/LLM technologies in support of development efforts and recommend technologies that will effectively govern and increase effectiveness of the platform.
- Identifies opportunities to enhance the platform to bring in responsible AI guardrails, observability, and experiment tracking, HITL feedback and re-play for finetuning the model.
- Bring maturity and next generation vision into the platform as we scale it to more & more complex undertakings using MultiModal, autonomous agents, and complex cognitive architecture.
Qualifications:
- Bachelor's degree in Computer Science, Computer Engineering, or a technical field.
- 10+ years of experience with AWS cloud.
- 10+ years of work experience in ML engineering, Platform Engineering or Ent. Architecture teams.
- 8+ years of leadership experience, leading teams in software development, DevOps, or ML engineering.
- 3+ years of experience in architecting production workloads in cloud infrastructure (AWS preferred, Azure, etc.).
- Experience with CI/CD pipelines, Automated Testing, Automated Deployments, Unit Testing and Integration Testing tools.
- Proven leadership experience in platform architecture or software engineering or infrastructure, designing highly scalable, fault tolerant technology solutions.
- Strong written and verbal communication skills.
- Strong Technical Knowledge and understanding the core concepts behind Dev/ML Ops, open-source LLMOps orchestration frameworks, LLM finetuning, metrics and evaluation.
- Experience with an Enterprise Architecture and governance practice.
- Ability to understand and align technical deliverables to the departmental and organization strategies and objectives.
- Leader and a team player with transformation mindset.
- Product and platform engineering mindset, to prioritize roadmaps, define features and capabilities, definition of done and ability to present to executive leaders both within the Enterprise Data Services organization and to business partners.
- Understanding of Lean and agile principals, experience with Scaled Agile ways of working.
- Provide thought-leadership to dynamic and collaborative teams, demonstrating excellent interpersonal skills and time management capabilities.
- Planning, organization, compromise, and risk mitigation.
- Industry awareness about evolving design patterns for Cloud and Software-as-a-service provider-based solution in GenAI space.
- Critical thinking and creativity for optimized solutions.
- Hands on expertise in Dev/ML Ops practices, technology solutions including both cloud and ML platforms.
- Experience with FinOps including KPIs, cost modeling, show/bill back, cost optimization and automation.
- Hands on experience in observability/telemetry, testing automation, SRE principals driving high level of automation and continuous improvement in internal customer experience.
- Experience with LangSmith, LangGraph DsPy, CrewAI is a plus.
Preferred Qualifications:
- AWS expertise: AWS Solution Architect Professional, certifications preferred.
- Hands on experience with GenAI orchestration tools like Langchain, LlamaIndex, Fine tuning RAG and vector databases.
- Experience with large-scale MLOps & platform design & implementation.
Compensation
The listed annualized base pay range is primarily based on analysis of similar positions in the external market. Actual base pay could vary and may be above or below the listed range based on factors including but not limited to performance, proficiency and demonstration of competencies required for the role. The base pay is just one component of The Hartford's total compensation package for employees. Other rewards may include short-term or annual bonuses, long-term incentives, and on-the-spot recognition. The annualized base pay range for this role is:
$151,600 - $227,400
Equal Opportunity Employer/Females/Minorities/Veterans/Disability/Sexual Orientation/Gender Identity or Expression/Religion/Age
#J-18808-Ljbffr