Bloomreach is the world’s #1 Commerce Experience Cloud, empowering brands to deliver customer journeys so personalized, they feel like magic. It offers a suite of products that drive true personalization and digital commerce growth, including:
Discovery, offering AI-driven search and merchandising Content, offering a headless CMS Engagement, offering a leading CDP and marketing automation solutionsTogether, these solutions combine the power of unified customer and product data with the speed and scale of AI optimization, enabling revenue-driving digital commerce experiences that convert on any channel and every journey. Bloomreach serves over 850 global brands including Albertsons, Bosch, Puma, FC Bayern München, and Marks & Spencer. Bloomreach recently raised $175 million in a Series F funding round, bringing its total valuation to $2.2 billion. The investment was led by Goldman Sachs Asset Management with participation from Bain Capital Ventures and Sixth Street Growth. For more information, visit Bloomreach.com.
We are looking for a dedicated DevOps Engineer to join our Analytics team and help manage and maintain our data platform. Your primary focus will be our in-memory database (IMF), ClickHouse, and the associated services. Our entire system operates on Google Cloud Platform (GCP) and Kubernetes, and it integrates with Kafka, MongoDB, and other services. Your responsibilities will include ensuring the smooth operation of our databases and services, maintaining reliable production monitoring, and developing quality tools and automation for handling new releases, maintenance, and incident management.
The team is working remotely in the Central European Timezone. We are more than happy to meet you in Brno (Czechia) or in Bratislava (Slovakia) where our headquarters is located. Salary ranges from 4000 EUR gross/month based on your seniority and it can get much higher later depending on your performance.
Responsibilities System Administration: Manage and configure our database systems on GCP within Kubernetes for high availability, reliability, and performance. Incident Management: Handle incident responses, perform root cause analysis for critical issues, and participate in a 24/7 on-call rotation. Automation and Tools Development: Create and maintain scripts and tools to automate operations and reduce manual tasks. Scaling and Resource Planning: Monitor system performance, plan for future scaling, and ensure enough resources during peak times. Monitoring and Logging: Set up and maintain monitoring and logging systems to detect and address issues early. Backup and Recovery: Develop and manage strategies for data backup and disaster recovery to ensure business continuity. Collaboration: Work closely with development and operations teams to align operations with overall business goals. Qualifications Experience: proven experience in DevOps or site reliability engineering, preferably with databases on GCP and Kubernetes. Knowledge of CI/CD pipelines and DevOps principles. Skills: Expertise in automation and scripting (e.g., Python, Go, Shell), performance tuning, and managing incidents. Tools: Familiarity with monitoring, logging, and automation tools. Problem-solving: Strong analytical and problem-solving abilities. Communication: Excellent communication and collaboration skills for working with remote teams. Adaptability: Ability to work independently and handle multiple tasks in a fast-paced environment. Our stack GitLab Prometheus, Grafana, InfluxDB, Chronograf IMF (our in-memory database written in C++), ClickHouse, Apache Kafka, MongoDB, and more … Kubernetes (GKE) Google Cloud Platform Python, Go Compensations There's a bonus based on company performance and your salary. You will be entitled to restricted stock units