Senior Site Reliability Engineer
: Job Details :


Senior Site Reliability Engineer

Prove

Location: Chicago,IL, USA

Date: 2024-12-01T16:02:34Z

Job Description:
Title: Senior Site Reliability EngineerDepartment: Internal OperationsReports To: Manager, Site ReliabilityFLSA Status: ExemptLocation: Chicago, IL or Denver, COJob Summary:The Senior Site Reliability Engineer is responsiblefor bringing a software engineering approach to Prove operations. Using software as a tool to manage systems, solve problems, and automate operations tasks. You will own and lead the efforts for creating and supporting scalable and highly reliable software systems. Leveraging your knowledge and passion to improve Prove's customers' reliability experience and helping development teams to meet SLOs. You will be responsible for the services availability, reliability, capacity planning and disaster recovery, managing large systems through code.Key Responsibilities:
  • Lead projects to find innovative solutions to challenges.
  • Collaborate with senior leaders on strategic planning (e.g., resourcing, roadmap creation and execution).
  • Maintain, enforce, and provide input on processes to ensure infrastructure is maintained to be fault-tolerant, scalable, and reusable.
  • Collaborate in cross-functional teams, both technical and non-technical.
  • Promote and cultivate ownership of design, execution, and deployment of product features.
  • Ability to react quickly to changing customer and business needs.
  • Be familiar with the Site Reliability Engineering principles like error budgets and toil.
  • Extensive knowledge in relation to how software is deployed, configured, and monitored.
  • Experience around creating observable production software applications and services which facilitates answering questions in relation to unknowns.
  • Experience working with canaries and experiments.
  • Excellent communication skills to provide excellent advice and feedback to other engineers in relation to reliability and scalability.
  • Promote, maintain, and enhance our cultural values of humility, passion, inclusion, and leadership.
  • Understand the application software architecture and data flow with particular interest in all aspects that can affect performance and reliability.
  • A culture of identifying repetitive tasks, proposing, and implementing automation solutions to remove toil.
  • Exhibit a strong curiosity and passion for expanding and deepening knowledge.
  • Have a culture of leading and owning software and services (you write it you wear it) from start to end, delivering quality for users as long as documentation and training for users and team members.
  • Mentoring colleagues on different subjects related to the SRE work.
  • Leading multi-team efforts and communities in relation to the use of technologies and good practices.
  • Familiarity with chaos engineering and capacity planning.
  • Ability to deliver production ready code for operations.
  • Strong passion for producing documentation and training material for other teams.
  • Working within On Call shifts for SRE supported environments (typically every 6 weeks).Qualifications and Experience:
    • 6 to 8 years of production engineering experience OR software engineering experience with sufficient production exposure.
    • 2+ years of experience with web application maintenance, leveraging containerized workflows such as Kubernetes or Docker.
    • Expertise in applications and services telemetry using standards like Open Telemetry.
    • Good coding and automation skills around tools running in production. 2+ years of experience with higher level languages such as Java, Go or Python.
    • 2+ years of experience with operating systems and TCP/IP network fundamentals.
    • Bachelor's degree in computer science or related field.
    • Promote, maintain, and enhance our cultural values of humility, passion, inclusion, and leadership.
    • Strong passion for learning about our products and markets through in-house and external training.
    • Experience in high growth / pre-IPO Technology companies is a plus.This position description should not be considered the final description of the position. The position description is not intended to be an all-inclusive list of duties and standards of the positions. It should be assumed that we would, to some extent, structure responsibilities in accordance with the successful candidate's capabilities and changing business conditions. Incumbents will follow any other instructions, and perform any other related duties, as assigned by their supervisor.The anticipated salary range for this role is $160,000 - $170,000, company bonus and stock options. Offered salary will be determined by the applicant's education, experience, knowledge, skills, geo-location, and abilities, as well as internal equity and alignment with market data. #J-18808-Ljbffr
Apply Now!

Similar Jobs (0)