Senior Site Reliability Engineer, IMF
Confirmed live in the last 24 hours
Bloomreach
Compensation
up to $3,000
Job Description
- We're taking autonomous search mainstream, making product discovery more intuitive and conversational for customers, and more profitable for businesses.
- We’re making conversational shopping a reality, connecting every shopper with tailored guidance and product expertise — available on demand, at every touchpoint in their journey.
- We're designing the future of autonomous marketing, taking the work out of workflows, and reclaiming the creative, strategic, and customer-first work marketers were always meant to do.
We are looking for a dedicated DevOps Engineer to join our Analytics team and manage our in-memory database (IMF) and related services. Our system runs on Google Cloud Platform (GCP) and Kubernetes and integrates with Kafka, MongoDB, and other services. Your job will be to keep our databases and services running smoothly, maintain reliable monitoring, and develop tools and automation for new releases, maintenance, and incident management.
The team works remotely in the Central European Time Zone. We are happy to meet you in Brno, Prague (Czechia) or Bratislava (Slovakia), where our headquarters is located. Salary ranges from 4,200 EUR gross/month, depending on your seniority.
Responsibilities
- System Administration:
- Manage and configure our Kubernetes components to ensure they are highly available, reliable, and perform well.
- Incident Management:
- Handle incident responses and perform root cause analysis for critical issues.
- Participate in a 24/7 on-call rotation, with each duty lasting 1 week. We aim to have 4 engineers in the rotation.
- Automation and Tools Development: Create and maintain scripts and tools using Python and Go to automate operations and reduce manual tasks.
- Scaling and Resource Planning:
- Monitor system performance and plan for future scaling.
- Ensure there are enough resources during peak times.
- Monitoring and Logging:
- Set up and maintain systems to monitor and log activities, so issues can be detected and addressed early.
- Backup and Recovery:
- Ensure our database has reliable backups and efficient tools for quick and smooth recovery.
- Collaboration:
- Work closely with other engineers and product managers to ensure successful project delivery.
- Collaborate with L2 support engineers to ensure
Similar Jobs
Yugabyte
Staff Site Reliability Engineer
Feedzai
Site Reliability Engineer
Bloomreach
Senior Site Reliability Engineer - Data Pipeline
Bloomreach
Senior Site Reliability Engineer - Data Pipeline
Bloomreach
Senior Site Reliability Engineer, IMF
JFrog