Senior Platform Engineer
As a Senior Platform Engineer, you will play a pivotal role in executing the technical vision set by the Head of Infrastructure, with a strong emphasis on operating, stabilizing, and incrementally modernizing our on-premises infrastructure. This role requires a balance between hands-on ownership of existing platforms and legacy systems, along with the design and implementation of new, more automated, and scalable capabilities.
Working closely with Software Engineering, you will help build and evolve internal platforms and shared services that enhance reliability, security, and developer experience across both legacy and modern environments. This position demands thoughtful integration, bridging current systems with newer platform methodologies such as infrastructure-as-code, automation, and container-based workloads when appropriate.
Your efforts will concentrate on reducing operational friction, standardizing workflows, and introducing self-service capabilities while respecting regulatory, security, and on-premises constraints. By treating infrastructure and platforms as long-lived products and aligning closely with the Head of Infrastructure’s direction, you will contribute to ensuring a stable, secure foundation, while progressively enabling the next generation of our technical platform.
Job Requirements
- Fluent in English (written and spoken); German language skills are a plus.
- Practical experience with Kubernetes, including introducing and operating it incrementally in on-prem environments alongside existing platforms.
- Experience using Terraform for infrastructure-as-code to manage shared services and infrastructure components in a controlled, versioned manner.
- Hands-on experience operating on-prem Linux infrastructure in production, with ownership of availability, performance, and reliability.
- Strong experience with configuration management using Puppet and Ansible.
- Production experience operating and troubleshooting Apache and Tomcat application stacks.
- Solid operational experience with PostgreSQL, including administration, backups, performance considerations, and incident support.
- Hands-on experience operating Ceph storage, including capacity management, performance analysis, and failure handling.
- Experience supporting CI/CD and release processes, ideally using GitLab, across both manual and automated workflows.
- Familiarity with artifact repositories such as Artifactory.
- Strong automation and scripting skills, primarily using Python.
- Experience implementing and operating monitoring and observability tooling such as Zabbix and Grafana; knowledge of the Elastic Stack is a plus.
- Experience participating in on-call rotations, incident response, root-cause analysis, and remediation.
- Solid understanding of on-prem networking concepts, including VLANs, load balancing, firewalls, and DNS.
- Familiarity with security and compliance requirements in regulated environments, including certificate management, TLS, and auditability.
- Experience improving and operating release and change management processes in production systems.
- Ability to produce and maintain clear operational documentation and runbooks for day-to-day operations and on-call support.
- Demonstrated ability to integrate legacy systems with modern platform approaches without disrupting production workloads.
Job Responsibilities
- Introduce and operate Kubernetes-based workloads where suitable, integrating them with existing on-prem infrastructure and operational processes.
- Incrementally modernize the platform by migrating appropriate services and workflows toward containerized and declarative approaches without disrupting existing production systems.
- Design and evolve hybrid operational models where legacy VM-based services and Kubernetes workloads coexist consistently.
- Operate, maintain, and modernize on-premises infrastructure and platforms, ensuring availability, security, and performance.
- Oversee day-to-day operations, including troubleshooting, patching, upgrades, and capacity management for Tomcat/Apache applications, PostgreSQL, and supporting services.
- Manage configuration management with Puppet and Ansible to ensure consistent, secure, and auditable system changes.
- Operate and support Ceph storage, including capacity monitoring, performance analysis, and remediation of failures.
- Design and operate internal platforms and shared services for application deployment and runtime operations, spanning both VM-based and newer platform components.
- Support and execute application releases, collaborating on both manual and automated processes to improve reliability and repeatability.
- Build and maintain automation and tooling primarily in Python to reduce manual effort and enhance operational consistency.
- Implement and operate monitoring, alerting, and incident response using Zabbix and related observability tools.
- Participate in an on-call rotation (including weekends), handling incidents, performing root-cause analysis, and driving corrective actions.
- Identify and address operational risks and improvement opportunities across application runtime, databases, storage, and CI/CD tooling.
- Embed security, compliance, and audit requirements (ISO 27001, SOC 2, GDPR, etc.) into daily operations and system configurations.
- Maintain operational documentation, runbooks, and release procedures to support consistent execution and on-call readiness.
- Evaluate tools and automation with a pragmatic, on-prem-first approach, recommending changes that improve stability, security, and maintainability.
Job Benefits
- Competitive salary and 5+ extra holidays (30 days total).
- Hybrid working model with flexible hours.
- Prime office location in the heart of Zurich, complete with a roof terrace.
- Inclusive international team spirit with ambitious colleagues and a strong drive to achieve our goals.
- Opportunity to develop and learn within a highly talented and experienced team.
- Engage in products with real-world impact on digital privacy, security, and trust.
- Participate in semi-annual international company offsite events in Portugal, Switzerland, and throughout Europe.
- Convenient parking available right at the office.
- Spacious office featuring a leisure room and table football, along with complimentary snacks.
- Join a company committed to sustainability; our data centers operate in a climate-neutral manner.
Apply online using the form below. Please note that only applications matching the job profile will be considered.