Performance and Monitoring Engineer, Senior
: Job Details :


Performance and Monitoring Engineer, Senior

Rooms To Go

Location: Seffner,FL, USA

Date: 2024-12-15T07:14:36Z

Job Description:
This role will be responsible for the monitoring, analysis, troubleshooting and reporting for Rooms To Go's overall operational performance. This includes but not limited to Infrastructure, Application, Network and Security. This individual will take an active role in driving performance enhancements, and leading targeted process improvement initiatives.The role defines the metrics, data collection methods, and reporting mechanisms as well as implementation of an overall performance management strategy. Ensures the effective capture of all logging and monitoring of all aspects of system and application behavior to facilitate fast detection and resolution of issues. This role will be the SME in troubleshooting all performance issue across the Enterprise. This role will work closely with IT, Application Development, Project Management and external vendors ensuring the consistent tracking and reporting of metrics and performance data across the Enterprise. This role will also be the SME in measuring and reporting the financial performance of the Enterprise in the Cloud, by supporting cost transparency efforts, and helping to develop mature cost metrics and benchmarking.The individual should possess a passion for operational excellence and a high level of interpersonal and soft skills necessary to work collaboratively across the Rooms To Go Enterprise.Responsibilities:
  • Define and maintain IT's performance monitoring and reporting strategy (processes, tools, & templates); develop enhanced reporting capabilities through standardization and automation.
  • Collect, consolidate and validate performance data for inclusion in IT and business communications, including weekly reports, monthly scorecards, and executive presentations
  • Proactively analyze trends in performance across IT; collaborate with process owners and stakeholders to identify and implement process improvements to increase operational efficiency and customer satisfaction.
  • Analyze and recommend performance improvements for Rooms To Go Enterprise for capacity, availability, performance, support and security.
  • Participate in efforts to improve cost transparency; develop more robust cost metrics and benchmarking capabilities; assist in efforts supporting IT cost leadership.
  • Stays informed of production changes that could affect functionality and alerting.
  • Ability to coordinate across teams, working closely with peers to ensure the appropriate focus and sense of urgency is applied to all issues
  • Troubleshooting using logs, alerts and external data sources to determine network, application, or security issues. The ability to corelate data to determine root cause.
  • Network monitoring and management which includes discovering network components and software, analyzing network traffic, monitoring network equipment for indications of network congestion, faulty network interfaces, and faulty transmission media. The role will also include responsibility for security operations on the network including detecting, monitoring, analyzing threats and reporting security incidents.
  • Work with application architects to improve scalability and performance, and MBTF.
  • Stays informed of emerging cloud technologies and evaluates the value to Rooms To Go operations.
  • Accurately troubleshoots, reproduces, and documents issues and other pertinent information in Incident or Problem tickets.
  • Handles incident queue and performs various tasks as assigned and determines business impact.
  • Handles ad hoc requests and take on new procedures as required.
  • When working on projects, identify and track project issues and dependencies, ensure follow-through, and appropriate actions are taken to complete project on time.
  • Recommend, implement and manage cloud Automation using native Cloud tools.
  • Design and implement load testing and application performance monitoring for applications
  • Lead troubleshooting calls for performance issues.
  • Provide runbooks for other departments to execute.
  • Recommend ideas to streamline operations, improve operations, create processes to proactively determine potential issues.
  • Provide training and mentoring of other engineers.
  • Drive overall improvement of operation results for Infrastructure, Application, Network and Security, due to application of application performance monitoring tools and techniques.
  • Drive overall cost management of infrastructure in the Cloud.
  • Drive overall improvement in business and operational results due to increases in speed and uniformity of automated service creation.
Qualifications:
  • Bachelor's degree in computer science or information systems (Master's Degree preferred) or an equivalent combination of education, work experience and/or applicable certifications.
  • A minimum of five years of experience related to Performance analysis and monitoring across multiple areas including Infrastructure, Application, Network and Security for medium to large scale companies.
  • Expert knowledge of IT performance metrics. Experience with data management, report design, data visualization and presentation techniques
  • Experience with one or more Cloud platforms; Microsoft Azure, Amazon Web Service (AWS), Google Cloud or IBM Cloud as it relates to performance, monitoring and cost management.
  • Expert experience with Application and Network Performance Management Tools
  • Experience with Network or Security Operations Center.
  • Experience with BI reporting tools such as MS PowerBI
  • Familiarity with financial data.
  • Candidate must have expert knowledge of at least one scripting language such as JavaScript, bash, or PowerShell.
  • Expert experience with networking fundamentals, including TCP/IP, UDP, DNS, DHCP, VLANS, routing.
  • Expert experience with networking elements such as load balancers, proxies, routers and switches.
  • Network Security elements including intrusion protection systems, anti-virus, proxies, and firewalls.
  • Datacenter fundamentals, server hardware systems, KVM, UPS.
  • Candidate should have advanced knowledge of troubleshooting performance issues with complex large-scale multi-tier and distributed application infrastructures.
  • Superior analytical and problem-solving skills.
  • Self-motivator with the ability to work effectively with minimal supervision.
  • Excellent written and verbal communication.
  • Strong organizational skills and attention to detail.
  • Ability to lead using informal authority to drive organizational objectives.
  • Ability to work with diverse, cross-functional teams, including external vendors.
Founded in 1991, Rooms To Go is one of the largest and fastest growing furniture retailers in the US with over 250 stores in ten southern states, eight distribution centers and an e-commerce business. Rooms To Go offers a collaborative work environment within a supportive culture.Rooms To Go Benefits:
  • Medical, dental, and vision insurance
  • 401(k) with company match
  • Associate discounts including furniture
  • Company paid life and disability insurance
  • Paid time off
  • Employee Assistance Program
  • Wellness Programs
  • And more!
Rooms To Go is an equal opportunity employer. We do not discriminate in hiring or employment against any individual on the basis of race, color, gender, national origin, ancestry, religion, physical or mental disability, age, veteran status, sexual orientation, gender identity or expression, marital status, pregnancy, citizenship, or any other factor protected by anti-discrimination lawsApplicants must be authorized to work in the U.S.#LI-BZ1
Apply Now!

Similar Jobs (0)