Site Reliability Engineer - Cologne, Deutschland - Giant Swarm

Giant Swarm Cologne, Deutschland

vor 2 Wochen

Ganztags

Beschreibung

The Basics

We are looking for a Site Reliability Engineer (m/f/d). You will be a key member of a tight-knit group of talented Engineers who are responsible for keeping our own and our customer's Kubernetes clusters operational and healthy. You'll also have a key role in the development of the product itself, working together with our Platform Engineers to deliver the greatest Kubernetes service possible. You will be joining our Cloud Integration Team working with Go and Kubernetes on AWS, Azure and GCP.

Giant Swarm is a fast-growing open-source infrastructure management platform used by modern enterprises. Our vision is to empower developers around the world to ship great products. We are a diverse, fully remote (since 2014), and experienced team that is growing and spreading across Europe - with headquarters in Cologne.

Your Job

You maintain, operate, and upgrade our own and our customer's Kubernetes clusters.
You will design, configure, build, and maintain distributed systems, as part of our managed Kubernetes offering.
You will use a wide variety of open-source technologies and tools from across the open-source community, including Kubernetes, Cluster-API, and Flatcar.
You understand how servers and systems work and you tweak their behavior to your needs, from kernel parameters to the infrastructure provider templates.
You will help resolve incidents on our own and our customer's clusters.
You participate in the on-call support schedule.
You are a go-to person in case our developers need advice regarding infrastructure.
We (and the majority of our customers) are currently distributed around Europe (around UTC), and thus your main time zone should be somewhere between +/-2UTC to ensure easy communication.

Requirements

You have deep hands-on knowledge of the inner workings of a Kubernetes cluster.
You must be able to configure all cluster components from the ground up with no automated deployment tools (think Kubernetes the Hard Way).
You have solid practical experience programming in Go.
The ideal candidate also has experience working with Cluster API.
You have worked with one (or more) of the major cloud providers.
You're comfortable debugging systems at all levels, from kernel & networking fundamentals right up to workloads running on Kubernetes.
You're happy troubleshooting a wide variety of issues and you're not afraid to parse thousands of lines of logs in pursuit of an answer.
You have experience with maintaining infrastructure with code and you know the pros and cons of various automation tools
You automate all the things by writing code.

About us

Every new team member changes the team. We love to learn from each other and we are looking for people who know things we don't.

Becoming part of Giant Swarm means that, by extension, you also become part of the Cloud Native community. We actively contribute to upstream projects and our quarterly hackathons will give you space to work on out-of-the-box projects. Occasionally, when we, as a team, want to fully focus on one project, we scratch all meetings and routines for a certain time to better focus during our hive-sprints.
Continuous learning is important to us - we foster this through bi-yearly personal development talks, a budget for training/certifications/coaching as well as regular feedback talks and workshops. Our teams are cross- functional and collaboration is key.
Nothing crazy, but useful Basics: We currently operate on a 32 hour workweek (or 4 day workweek, you decide). We don't count holidays but set a minimum number; You choose your own hard- and software; As a company that has almost, if not more, kids than employees, family-friendliness is crucial to us and paid parental leave is a no-brainer; We pay monthly perks that cover your costs for working remotely; We meet twice a year as an entire company and (if possible) see conferences as an important place to catch up with team members; We aim to be fully transparent (finance, salaries) unless it hurts people and trust you, based on this to make the best decisions

We failed in exactly describing our way to approach important company elements that can be described with 'buzzwords' such as agile mindset, cross-functional teams, self-organization, value of the individual or trust & teamwork. However, we truly care about them, we live them and we constantly iterate on them. Some snippets about how we do this are posted in our blog but by far not all of them.

Important note: We are not hiring job descriptions. We hire humans. :) We welcome applications from everybody, regardless ethnic or national origin, religion, gender identity, sexual orientation or age.

Reliability Engineer

vor 2 Wochen

Westlake Vinnolit GmbH & Co. KG Cologne, Deutschland

Als führender Hersteller für PVC und verlässlicher Partner für Natronlauge bietet Westlake Vinnolit individuelle Lösungen für Kunden aus den unterschiedlichsten Branchen. Westlake Vinnolit beschäftigt rund 1.400 Mitarbeiterinnen und Mitarbeiter an fünf Standorten in Deutschland. ...
Reliability Engineer

vor 3 Wochen

Vinnolit Cologne, Deutschland

Das erwartet Sie bei uns · Verfügbarkeitsanalysen der installierten technischen Aggregate und Ersatzteile mit Unterstützung des Techniker predictive Maintenance · Planung zum Ersatz technischer Apparate und Aggregate innerhalb der zu erwartenden Restlauf-Zeit mit Unterstützung de ...
Site Reliability Engineer

vor 5 Tagen

Computer Futures Köln, Deutschland

Du hast eine Leidenschaft für Softwareentwicklung, bist von modernen Technologien begeistert und suchst nach neuen Herausforderungen? In unserem "Softwareentwicklung" Team arbeitest du gemeinsam mit neugierigen Kolleg:innen, ausgeklügelten Entwickler:innen und erfahrenen Berater: ...
Reliability/ Improvement Engineer

vor 2 Wochen

Covestro Dormagen, Deutschland Ganztags

Unser Funktionsbereich CCO (Chief Commercial Officer) sorgt für einen reibungslosen und sicheren Ablauf an den internationalen Produktionsstandorten. Zudem ist er für die Planung, Errichtung und Weiterentwicklung unserer Produktionsanlagen zuständig. Dabei deckt der Bereich Proze ...
Product Reliability Engineer

vor 3 Wochen

PLARAD - Maschinenfabrik Wagner GmbH & Co. KG Much, Deutschland Ganztags

Wir sind ein erfolgreiches mittelständisches Unternehmen des Maschinenbaus im östlichen Rhein-Sieg-Kreis. Mit den von uns entwickelten, produzierten und vertriebenen Produkten, die in über 50 Industriebranchen weltweit ihre Anwendung finden, gehören wir international zu den führe ...
Product Reliability Engineer

vor 2 Wochen

PLARAD - Maschinenfabrik Wagner GmbH & Co. KG Much, Deutschland

eine erfolgreiche Verbindung Wir sind ein erfolgreiches mittelständisches Unternehmen des Maschinenbaus im östlichen Rhein-Sieg-Kreis. Mit den von uns entwickelten, produzierten und vertriebenen Produkten, die in über 50 Industriebranchen weltweit ihre Anwendung finden, gehören w ...
Site Reliability Engineer

vor 2 Wochen

easybill GmbH Willich, Deutschland CDI

easybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Aktuell haben wir mehr als aktive Kunden und wir wachsen stet ...
DevOps Engineer/Site Reliability Engineer

vor 1 Woche

HUK Autoservice Düsseldorf, Deutschland Ganztags

Das erwartet dich bei uns: · Als Site Reliability Engineer in unserem Technology-Team entwickelst und wartest du hochverfügbare, performante, zuverlässige und einfach zu bedienende Software- und Infrastruktursysteme, damit unsere Software-Ingenieure ihre Anwendungen problemlos au ...
Site Reliability Engineer

vor 2 Wochen

easybill GmbH Willich, Deutschland Employee

easybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Aktuell haben wir mehr als aktive Kunden und wir wachsen stet ...
Reliability engineer, cloud engineer

vor 2 Wochen

Venturi Westfalen, Deutschland

An International IT Consultancy is currently searching for a (Senior) Cloud Engineer to join their growing 'Cloud Data Platform' team. · Their Data & Analytics Business Line is 350+ experts in Germany, and the Cloud Data Platform' team is currently 23+ experts and growing. · Co ...
Senior Site Reliability Engineer

vor 1 Woche

easybill GmbH Willich, Deutschland

Über uns · Einleitung easybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Aktuell haben wir mehr als aktive Kunde ...
Senior Site Reliability Engineer

vor 1 Woche

easybill GmbH Willich, Deutschland

Einleitungeasybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Aktuell haben wir mehr als aktive Kunden und wir wa ...
Remote Site Reliability Engineer

vor 2 Wochen

easybill GmbH Willich, Deutschland

Einleitung easybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Deshalb suchen wir nach einer motivierten Verstärk ...
Remote Site Reliability Engineer

vor 2 Wochen

easybill GmbH Willich, Deutschland Ganztags

easybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Deshalb suchen wir nach einer motivierten Verstärkung für uns ...
Remote Site Reliability Engineer

vor 1 Woche

easybill GmbH Willich, Deutschland

Einleitungeasybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Deshalb suchen wir nach einer motivierten Verstärku ...
Senior Site Reliability Engineer

vor 1 Woche

easybill GmbH Willich, Deutschland

Über unsEinleitung easybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Aktuell haben wir mehr als aktive Kunden u ...
Senior Site Reliability Engineer

vor 1 Woche

easybill GmbH Willich, Deutschland

Einleitungeasybill ist eine cloudbasierte Rechnungssoftware, die sich durch eine einfache Anwendung, umfassende Funktionalität und der vielfältigen Anbindung durch Schnittstellen schon seit mehr als 16 Jahren am Markt behauptet. Aktuell haben wir mehr als aktive Kunden und wir wa ...
Site Reliability Engineering Manager

vor 2 Wochen

Go Tek Cologne, Deutschland

SRE Team Lead (Germany/Remote) | Up to €100,000 | Financial SW Provider · Go Tek are partnered with a German Financial Software leader, that provides Cloud based solutions across areas such as Spend Analytics. · They are currently in the process of reshaping their whole SRE capa ...
Regional Reliability, Maintenance and Engineering Manager

vor 5 Tagen

Amazon TA Köln, Deutschland regularEmployment

We are looking for a seasoned and accomplished Reliability Maintenance Engineering (RME) Manager to join our team at Amazon. In this senior leadership role, you will be responsible for managing a team of RME Area Managers, Engineers and Technicians to ensure the reliability and m ...
Regional Reliability, Maintenance and Engineering Manager

vor 5 Tagen

Amazon TA Cologne, Deutschland

We are looking for a seasoned and accomplished Reliability Maintenance Engineering (RME) Manager to join our team at Amazon. In this senior leadership role, you will be responsible for managing a team of RME Area Managers, Engineers and Technicians to ensure the reliability and m ...

Site Reliability Engineer - Cologne, Deutschland - Giant Swarm

Beschreibung

The Basics

Your Job

Requirements

About us

Reliability Engineer

Reliability Engineer

Site Reliability Engineer

Reliability/ Improvement Engineer

Product Reliability Engineer

Product Reliability Engineer

Site Reliability Engineer

DevOps Engineer/Site Reliability Engineer

Site Reliability Engineer

Reliability engineer, cloud engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Remote Site Reliability Engineer

Remote Site Reliability Engineer

Remote Site Reliability Engineer

Senior Site Reliability Engineer

Senior Site Reliability Engineer

Site Reliability Engineering Manager

Regional Reliability, Maintenance and Engineering Manager

Regional Reliability, Maintenance and Engineering Manager

für Personalvermittler

Informationen