Ähnliche Jobs

Senior Operation Engineer (m/w/d) 3 Jahre Kubernetes On Premise Erfahrung - Remote & FFM oder Berlin

Senior Operation Engineer (m/w/d) 3 Jahre Kubernetes On Premise Erfahrung - Remote & FFM oder Berlin

Senior Operation Engineer (m/w/d) 3 Jahre Kubernetes On Premise Erfahrung - Remote & FFM oder Berlin

Senior Operation Engineer (m/w/d) 3 Jahre Kubernetes On Premise Erfahrung - Remote & FFM oder Berlin

percision services GmbH

IT-Dienstleister

Berlin

  • Art der Beschäftigung: Vollzeit
  • Hybrid

Senior Operation Engineer (m/w/d) 3 Jahre Kubernetes On Premise Erfahrung - Remote & FFM oder Berlin

Über diesen Job

Senior Operation Engineer (m/w/d) 3 Jahre Kubernetes On Premise Erfahrung - Remote & FFM oder Berlin

Projektnummer
#9335
Region
Remote & FFM oder Berlin
Zeitraum
18.05.2026 bis Ende 2026 + Option
Teilen:

Im Rahmen eines innovativen Plattform Projektes im Energiesektor suchen wir im Auftrag unseres Kunden nach Unterstützung als Senior Operation Engineer (m/w/d) mit 3 Jahren On Premise Erfahrung im Kubernetes Umfeld. Die Tätigkeit erfolgt weitestgehend Remote und nach Absprache ca. 1 mal im Monat für paar Tage am Stück in Frankfurt oder Berlin.

Project Description

The team is building an internal platform for software product developers to accelerate the development and delivery of software products to tackle the massive challenges facing the energy sector. The Platform is a service oriented, cloud-native platform that is being built to provide application teams with self-service capabilities to develop, run and operate their software products. Platform provides services for application infrastructure, data, service lifecycle management, application build and delivery as well as services to operate their software products. The Platform is deployed as a hybrid cloud, encompassing both private cloud and select public clouds.

General Description

The Product Line is responsible for a product portfolio , consisting of an Infrastructure as a Service Product, a managed Kubernetes Service, a resource management service to facilitate scalable management of platform permissions and

a service lifecycle workflow engine enabling. All services together constitute a core part of an on-premise private cloud platform for all business applications , including IT/OT critical applications required for maintaining and operating the grid.

For the whole product portfolio, the product line owns the complete product flow, from product management, architecture, delivery up until Tier 3 operations.

CI/CD and Operational Readiness

Objective: Support on CI/CD pipelines and ensure operational readiness for deployments

Tasks:

• Validation of deployment artifacts from an operations perspective.

• Defining and enforcing quality assurance measures (e.g. required documentation of standard operation procedures,

successful test reports, …) to ensure the high quality of delivered products and services.

• Ensuring rollback strategies and operational monitoring (observability) are in place for production deployments.

Monitoring, Incident, Problem and Change Management

Objective: Ensure operational stability and responsiveness

• Monitoring system health, performance metrics, and service availability across multi-tenant environments.

• Identifying, analyzing, and resolving incidents, minimizing service disruption.

• Triggering root cause analysis and implementation of corrective and preventive actions.

Automation of operations critical standard processes following established software development lifecyles

Objective: Reduce operational toil and improve service reliability

• Address recurring operational issues by automating remedial standard operations processes

• Validate all automated procedures following the established software development lifecycle including staging, testing,

and validation reviews

Security and Compliance Enforcement

Objective: Ensure platform operations adhere to security and compliance standards

Tasks:

• Implementing monitoring and logging strategies to support audit and compliance requirements.

• Performing routine security scans and remediating identified vulnerabilities.

Profile Requirements

Must-have experience

• At least of 3 years of operational experience with self-managed Kubernetes clusters, self-managed services providing

Kubernetes clusters and productive applications or systems in on premise environments

• Deep understanding of networking concepts, including protocols, load balancing, and security.

• Profound knowledge and implementation experience with CI/CD processes, tooling (e.g. GitLab, Jenkins, Tekton,

Argo Workflows, and Argo CD), concepts and associated quality and security assurance for software delivery

• Fundamental understanding of core operations processes (incident management, change management, problem

management, IT Service Management) as well as SRE concepts

• Experience in gathering operational insights from monitoring or observability including SLI/SLA/SLO management

and tracking.

• Hand-on experience in documenting procedures properly and enforcing clear runbooks or playbooks.

• Hands-on experience with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog).

Must-have language skills:

• Proficiency in both speech and writing in English (at least C1).

Preferred experience

• Project experience in software engineering (in Go Lang, C/C++ or Python) with significant experience in building

RESTful services in distributed environments.

Sie suchen in eigener Sache?

Wir freuen uns auf ihre projektbezogene Bewerbung & Unterlagen über unser Bewerbungstool unten .

Unternehmens-Details

company logo

percision services GmbH

IT-Dienstleister

1-10 Mitarbeitende

Köln, Deutschland

Ähnliche Jobs

DevOps Engineer - Kubernetes / Golang / Cloud Native (m/w/d)

Workwise GmbH

Lörrach + 0 weitere

DevOps Engineer - Kubernetes / Golang / Cloud Native (m/w/d)

Lörrach + 0 weitere

Workwise GmbH

Senior DevOps Engineer - Kubernetes / Bare Metal (m/w/d)

Workwise GmbH

Espelkamp + 0 weitere

Senior DevOps Engineer - Kubernetes / Bare Metal (m/w/d)

Espelkamp + 0 weitere

Workwise GmbH

Site Reliability Engineer (m/f/d)

Solactive AG

Berlin + 0 weitere

Site Reliability Engineer (m/f/d)

Berlin + 0 weitere

Solactive AG

Senior DevOps Engineer (w/m/d)

everphone GmbH

Berlin + 0 weitere

Senior DevOps Engineer (w/m/d)

Berlin + 0 weitere

everphone GmbH

Senior SRE / Platform Engineer (m/w/d)

Jobriver HR Service

Berlin + 0 weitere

Senior SRE / Platform Engineer (m/w/d)

Berlin + 0 weitere

Jobriver HR Service

Senior SRE Engineer - Cloud Operations

Jobriver HR Service

Berlin + 0 weitere

Senior SRE Engineer - Cloud Operations

Berlin + 0 weitere

Jobriver HR Service

Senior DevOps / MLOps Engineer (all genders)

GK Software SE

Berlin + 0 weitere

Senior DevOps / MLOps Engineer (all genders)

Berlin + 0 weitere

GK Software SE

Senior DevOps Engineer (alle Geschlechter)

Jobriver HR Service

Berlin + 0 weitere

Senior DevOps Engineer (alle Geschlechter)

Berlin + 0 weitere

Jobriver HR Service

(Senior) DevOps mit Fokus Operations (m/w/d)

Jobriver HR Service

Berlin + 0 weitere

(Senior) DevOps mit Fokus Operations (m/w/d)

Berlin + 0 weitere

Jobriver HR Service