Job Description
We are looking for someone strong in PM and data management technical skills. Someone who is familiar and has hands on experience with SQL, strong in Excel, scripting automation for populating dashboards and at the same time we need someone who can work cross organizationally to drive actions and manage accountability.
Needs experience in risk management, systems infrastructure architecture, and capacity planning, this is going to be a key . Azure experience is a plus, but if you do not have hands on experience with Azure, we still want to talk to you!
Capacity Risk management
• Collect and analyze capacity risk metrics
• Conduct periodic capacity risk assessments and publish reports based upon capacity risk management framework
• Program manage Capacity Risk Management actions across the organization to ensure closure and capacity is managed to established thresholds
• Develop/maintain Capacity Risk management processes and policies
• Track availability of required functionality for capacity risk management
Hardware decommissioning
• Program manage the execution of the hardware decommissioning process and plans
• Track availability of the required functionality for customer migrations and hardware decomissioning
• Publish regular reports for hardware decommissioning and program status
Hardware Reliability Program (as needed)
• Track hardware issues to resolution, drive root cause analysis with Azure teams and OEMs
• Develop and implement hardware reliability metrics
• Needs to know Azure’s infrastructure and tools involved
• Monitor and analyze hardware failure data
• Develop feature requirements to implement hardware reliability metrics and failure patterns
• Develop hardware requirements for hardware diagnostics and instrumentation
• Publish periodic hardware reliability reports
• Track availability of required functionality for hardware reliability program
Requirements
Qualifications:
• Cross organizational project management
• Experience with data analysis and building analytical models, SQL Server / creating queries, Excel; pivot tables
• Experience with systems capacity planning
• Experience with server hardware and networking
• Understanding of system troubleshooting methods
• Experience working with OEMs
• Experience working and troubleshooting 1000 servers at enterprise level
• Hardware and software telemetry experience
• Process development and management experience