Kaseya® is the leading provider of complete IT infrastructure and security management solutions for Managed Service Providers (MSPs) and internal IT organizations worldwide powered by AI. Kaseya's best-in-breed technologies allow organizations to efficiently manage and secure IT to drive sustained business success. Kaseya has achieved sustained, strong double-digit growth over the past several years and is backed by Insight Venture Partners www.insightpartners.com), a leading global private equity firm investing in high-growth technology and software companies that drive transformative change in the industries they serve. Founded in 2000, Kaseya currently serves customers in over 20 countries across a wide variety of industries and manages over 15 million endpoints worldwide. To learn more about our company and our award-winning solutions, go to www.Kaseya.com and for more information on Kaseya's culture, please click here: Kaseya Culture. Kaseya is not your typical company. We are not afraid to tell you exactly who we are and our expectations. We have achieved record levels of success being BOLD, being GRITTY, being ACCOUNTABLE. The thousands of people that succeed at Kaseya are prepared to go above and beyond for the betterment of our customers, and the betterment of their careers and long-term financial wealth. Senior Linux Monitoring and Observability Engineer Does This Describe You: You're a creative thinker who loves teamwork. A Look Inside the Job:
- Participate in the architecture, management, and operation of the monitoring infrastructure systems including Zabbix, Alerta, Kafka, OpenSearch, Jaeger, and VictoriaMetrics
- Work with large fleets of hosts, and data at scale for both bare-metal and virtualization architectures
- Learn what makes distributed systems tick by collecting and analyzing metrics
- Instrument relevant data with visualizations and dashboards (Grafana/Opensearch-Dashboards)
- Contribute to documentation, monitoring plans and alert configuration strategies
- Provide ad-hoc support to other engineering groups in the company via Teams/Jira tickets
- Ensure system and service stability, scalability, security, and performance
- Solve complex scaling challenges
- Limited on-call responsibilities (It's not bad, trust me.)
- Understanding SRE or DevOps culture is desirable
About You:
- 10+ years of experience with Linux administration (Ubuntu preferred, but it doesn't matter.)
- 10+ years of experience with monitoring platforms at scale.
- Comfortable working with observability tools of some kind (logs/metrics/traces)
- Willingness to participate in a small team that consists of mostly remote members
- Eager to learn things quickly
- Ability to effectively manage time and push multiple projects at once
- Intermediate knowledge of at least some of our architecture components
- Comfortable working within Git based deployment workflows, using related tooling
- Solid configuration management skills (Ansible and Terraform primarily, Foreman/Puppet, Salt)
- Experience with metric query language (PromQL/MetricQL)
- Experience with scripting and system automation (Bash, Python, Go, Perl, Awk, etc.)
- Experience with Kubernetes or other microservice based environments
- Work well in an energized environment
- Excellent documentation skills (Wiki's, ticket details, blog posts, etc)
- Excellent diagnostic expertise including problem investigation, root-cause analysis, and resolution skills
- Comfortable working with open source projects and supporting them directly as needed (bugfixes /new features)
- The desire and ability to see a problem through to a complete solution
Join the Kaseya growth rocket ship and see how we are #ChangingLives ! Additional information Kaseya provides equal employment opportunity to all employees and applicants without regard to race, religion, age, ancestry, gender, sex, sexual orientation, national origin, citizenship status, physical or mental disability, veteran status, marital status, or any other characteristic protected by applicable law.