Senior Site Reliability Engineer, IMF

Confirmed live in the last 24 hours

Bloomreach

Compensation

up to $3,000

Slovakia

Remote

Posted March 25, 2026

Job Description

Bloomreach is building the world’s premier agentic platform for personalization.We’re revolutionizing how businesses connect with their customers, building and deploying AI agents to personalize the entire customer journey.

We're taking autonomous search mainstream, making product discovery more intuitive and conversational for customers, and more profitable for businesses.
We’re making conversational shopping a reality, connecting every shopper with tailored guidance and product expertise — available on demand, at every touchpoint in their journey.
We're designing the future of autonomous marketing, taking the work out of workflows, and reclaiming the creative, strategic, and customer-first work marketers were always meant to do.

And we're building all of that on the intelligence of a single AI engine — Loomi AI — so that personalization isn't only autonomous…it's also consistent.From retail to financial services, hospitality to gaming, businesses use Bloomreach to drive higher growth and lasting loyalty. We power personalization for more than 1,400 global brands, including American Eagle, Sonepar, and Pandora.

We are looking for a dedicated DevOps Engineer to join our Analytics team and manage our in-memory database (IMF) and related services. Our system runs on Google Cloud Platform (GCP) and Kubernetes and integrates with Kafka, MongoDB, and other services. Your job will be to keep our databases and services running smoothly, maintain reliable monitoring, and develop tools and automation for new releases, maintenance, and incident management.

The team works remotely in the Central European Time Zone. We are happy to meet you in Brno, Prague (Czechia) or Bratislava (Slovakia), where our headquarters is located. Salary ranges from 4,200 EUR gross/month, depending on your seniority.

Responsibilities

System Administration:

Manage and configure our Kubernetes components to ensure they are highly available, reliable, and perform well.

Incident Management:

Handle incident responses and perform root cause analysis for critical issues.
Participate in a 24/7 on-call rotation, with each duty lasting 1 week. We aim to have 4 engineers in the rotation.

Automation and Tools Development: Create and maintain scripts and tools using Python and Go to automate operations and reduce manual tasks.
Scaling and Resource Planning: