Who We Are
Sabel Systems Technology Solutions, LLC is a leading solution provider for innovative and agile Digital Engineering and Acquisition Technical Stack design, implementation, and support. We have projects across the DoD, Industry, and Academia. We operate in secure public clouds, on- premises clouds, and hybrid clouds. We provide you with large business opportunities and training within our small business agility and people first culture. You will be joining a dynamic and highly motivated team with one goal: Get quality and secure solutions in the customers hands as soon as possible .
Who We Need
The Monitoring and Observability Expert position is a full-time position and is a remote position. The primary responsibility of this position is to provide development to bring in telemetry from applications to build out dashboards for many applications in several different platforms, architectures & languages
What You'll Do
Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions of this position.
- Provide remote technical support, incident, and problem management for Sabel's Cloud Platform, computer operations and networks including software support and administration, installation, setups, error messages, system status, and downtime procedures, etc.
- Provide technical context and leadership to the assigned project team.
- Collaborate with the rest of the Engineering team to support the customer mission.
- Instrument applications, servers, and systems to pull in telemetry for actionable monitoring data.
- Build usable dashboards from regular users to executives with actionable data.
- Build tests for detection of issues to improve platform reliability.
- Build scripts to load test the platform.
- Support of Database and API communication development activities.
- Prepare technical documentation including standard procedures, reports, research and cost analysis.
- Provide technical support for DoD specific on-prem and virtual cloud hardware
- Assist with management of infrastructure support to include servers, networks, and applications.
- Assist the technical lead with projects from implementation through testing and production.
- Must be able to effectively communicate with users at all skill levels and document unique issues and solutions in a support knowledgebase or procedure/policy documents.
- Update and maintain electronic records with accurate data.
- On-call, and evening or weekend hours will be required as needed for support and maintenance issues.
- Domestic travel could be required.
- Perform other duties and special projects as assigned.
Your Qualifications
Required
- Must be customer-service and results-oriented, an experienced problem solver who seeks assistance, when necessary, a self-starter with excellent oral and effective written communication skills, able to handle multiple tasks simultaneously, and an experienced decision-maker.
- Able to work effectively on their own without direct supervision.
- Able to work as part of a distributed team spanning multiple time-zones and regions.
- 10+ years of experience with computer systems support, server installation and support, network administration, network installation, and monitoring/observability platforms.
- Experience troubleshooting complex network and systems infrastructure issues.
- Experience with enterprise level monitoring or observability platforms.
- Experience monitoring user experience down to the click level.
- Experience with Linux Administration.
- Experience installing software, patches, and updates for K8s.
- Experience with Active Directory and Windows Servers is a plus.
- Experience with Platform One Big Bang, Party Bus, and Iron Bank. (Preferred)
- Must have experience with scripting (Python, Bash, Go) for automation task.
- Must have CompTIA Security + (CE) certification.
- Must be able to obtain a security clearance (Secret/Top Secret).
- Bachelor's degree in computer science or related field.
- Experience in 2 of more of the following areas:
- Experience with Cloud platforms such as Amazon Web Services.
- Experience with DoD-specific cloud platforms such as ARCUS, CloudOne, or RogueOne
- Experience with cybersecurity within the DoD
- Experience with Docker or Kubernetes
- Experience with Elastic SIEM offerings
- Experience with Riverbed or Aternity
- Experience with jMeter or another load testing platform
- Experience with Jenkins or another application testing platform