Job Title: Infrastructure Platform Leader
Berwyn, PA, US, 19312
The Platform IT Operations Leader is responsible for the reliability, scalability, and compliance of all enterprise computing platforms supporting the company’s global operations — spanning cloud, data centers, applications, and shared services. This role oversees global infrastructure operations, cloud operations, application hosting, and platform reliability engineering, ensuring that systems meet the performance, security, and regulatory expectations of both commercial and regulated business units.
As part of the IT senior leadership team, this role will lead global operations teams and managed service providers (MSPs) to deliver always-on, compliant services that enable digital transformation, manufacturing excellence, and customer trust.
HOW YOU WILL MAKE AN IMPACT:
- Define and execute the global Platform Operations strategy covering compute, storage, cloud, database, backup, monitoring, and automation.
- Establish and maintain operating models, SLOs/SLAs, and governance frameworks aligned to ITIL and service reliability principles.
- Partner with Enterprise Architecture, Cybersecurity, and Application Development to design resilient, scalable, and compliant platform services.
- Lead operational readiness for new platforms and projects, including M&A integrations and global rollouts.
- Oversee operations of hybrid computing environments across data centers, Azure/AWS clouds, and edge/plant locations.
- Manage core infrastructure services: servers, virtualization (VMware, Hyper-V), storage (SAN/NAS/Object), backup/recovery, DNS/DHCP, and monitoring.
- Define and maintain cloud landing zones, identity integration, and platform guardrails in partnership with Security and Architecture.
- Drive infrastructure-as-code (IaC), automation, and self-service provisioning to increase agility and reduce operational burden.
- Partner with Network and Security teams to ensure secure, performant connectivity and resilience.
- Ensure reliability and performance of enterprise platforms including ERP (SAP/Oracle), MES/PLM, CRM, data lake/warehouse, and business applications.
- Oversee database operations (SQL, Oracle, PostgreSQL, cloud-native DBs) including backup, recovery, patching, and performance optimization.
- Establish platform lifecycle management, patch cadence, and upgrade programs aligned with vendor roadmaps.
- Build and lead a Platform Reliability Engineering (PRE) or Site Reliability Engineering (SRE) function focused on proactive monitoring, automation, and resilience.
- Implement enterprise monitoring, observability, and event management tools (e.g., Splunk, ServiceNow, Datadog, Azure Monitor, Dynatrace).
- Drive automation for provisioning, patching, and incident response; leverage AI/ML for predictive operations (AIOps).
- Develop KPIs for service uptime, incident resolution, and change success rates; deliver continuous improvement against targets.
- Partner with the CISO to enforce secure configurations, patch management, and vulnerability remediation across all platforms.
- Ensure operational compliance with SOX ITGC, NIST 800-171, CMMC, DFARS, ITAR/EAR, and ISO 27001 requirements.
- Maintain documented processes for audit readiness, evidence collection, and control testing.
- Lead disaster recovery and business continuity planning for critical platforms and support disaster recovery and business continuity plans for business unit based services.
- Lead global teams (employees and MSPs) delivering platform operations 24x7.
- Manage vendor relationships and service delivery performance across cloud, hosting, and MSP providers.
- Own operating budgets for compute, storage, cloud consumption, and platform services; drive FinOps and cost transparency initiatives.
- Foster a culture of ownership, accountability, and technical excellence within the global operations team.
WHAT YOU WILL BRING TO THE ROLE:
- BS in Computer Science, Engineering, or Information Systems; MS or MBA preferred.
- 15+ years in IT infrastructure and operations with at least 7+ years in global leadership roles.
- Experience managing hybrid cloud environments and large-scale enterprise workloads.
- Strong background in manufacturing IT, with understanding of regulated environments (defense, aerospace, energy, or medical).
- Proven track record of service delivery excellence, vendor management, and global team leadership.
- Cloud platforms: Azure, AWS (landing zones, monitoring, cost management).
- Virtualization & compute: VMware, Hyper-V, Nutanix, container orchestration (Kubernetes).
- Storage & backup: SAN/NAS, cloud storage, backup tools (Commvault, Veeam, Rubrik).
- Databases: SQL Server, Oracle, PostgreSQL, cloud-native DBs, backup/recovery.
- Monitoring & ITSM: ServiceNow, Splunk, Datadog, AppDynamics, Dynatrace, SCOM.
- Security: Patch management, endpoint hardening, privileged access controls.
- Automation: Terraform, Ansible, PowerShell, Azure Automation, CI/CD pipelines.
- Compliance: SOX ITGC, NIST 800-171/CMMC, DFARS, ITAR/EAR, CE/CE+, NIS2.
- Strategic and pragmatic leadership balancing innovation with stability.
- Strong communication and executive reporting skills.
- Financial acumen and experience with cloud cost optimization.
- Ability to collaborate effectively across IT, OT, and business units.
- Certifications (Preferred): ITIL v4, Azure/AWS Certified Architect, VMware Certified Professional (VCP), CISM or CISSP, TOGAF, FinOps Practitioner.
- Eligibility to work with export-controlled information; ability to obtain/maintain relevant clearances when required by customer programs.
- Willingness to visit manufacturing sites and suppliers globally.
Nearest Major Market: Philadelphia