Back to Search
Overview
Lead / Manager

Lead Data Scientist

Confirmed live in the last 24 hours

Xebia CEE

Xebia CEE

Bulgaria; Poland; Romania
On-site
Posted April 8, 2026

Job Description

 

Hello, let’s meet!

Who We Are

While Xebia is a global tech company, our journey in CEE started with two Polish companies – PGS Software, known for world-class cloud and software solutions, and GetInData, a pioneer in Big Data. Today, we’re a team of 1,000+ experts delivering top-notch work across cloud, data, and software. And we’re just getting started.

What We Do

We work on projects that matter – and that make a difference. From fintech and e-commerce to aviation, logistics, media, and fashion, we help our clients build scalable platforms, data and AI solutions, and cutting-edge applications to shape the future of tech. Our clients include McLaren, Aviva, Deloitte, Spotify, Disney, ING, UPS, Tesco, Truecaller, AllSaints, Volotea, Schmitz Cargobull, Allegro, InPost, and many, many more.

We value smart tech, real ownership, and continuous growth. We use modern, open-source stacks, and we’re proud to be trusted partners of Databricks, dbt, Snowflake, Azure, GCP, and AWS. Fun fact: we were the first AWS Premier Partner in Poland!

Beyond Projects

What makes Xebia special? Our community. We support tech communities, organize meetups (Software Talks, Data Tech Talks), and have a culture that actively support your growth via Guilds, Labs, and personal development budgets — for both tech and soft skills. It’s not just a job. It’s a place to grow.

What sets us apart? 

Our mindset. Our vibe. Our people. And while that’s hard to capture in text – come visit us and see for yourself.

 

You will be:

  • designing and developing statistical models for property price adjustments across time, location, quality, and condition,
  • building spatial algorithms (adaptive heatmaps, geographic clustering, polygon-based property search) to capture local market dynamics, 
  • implementing comparable property recommendation with feature engineering across different property types, 
  • developing market analysis pipelines with solid diagnostics: trend fitting, outlier detection, goodness-of-fit metrics, 
  • integrating LLM-based classification services for document and property analysis, 
  • exposing model outputs through production API endpoints and working with frontend engineers on data contracts, 
  • debugging models in production: edge cases, numerical issues, data quality problems.

Your profile:

  • solid statistics background: regression, GAMs, mixed/random effects, link functions, robust estimation, outlier handling,
  • proficiency in Python and the data science stack: NumPy, Pandas, statsmodels, SciPy, scikit-learn, 
  • experience building and maintaining production APIs with FastAPI and Pydantic, 
  • comfortable working with PostgreSQL and SQLAlchemy, 
  • familiar with containerized environments (Docker, Kubernetes, GCP), 
  • able to turn domain requirements into quantitative solutions and communicate trade-offs, 
  • good command of English (spoken and written), 
  • familiarity with basic statistical concepts (e.g., Bayes’ rule, linear regression, maximum likelihood estimation.

Work from the European Union region and a work permit are required.

Nice to have:

  • geospatial data and libraries (GeoPandas, Shapely, H3, GeoAlchemy2),
  • GAM libraries (PyGAM), JAX, or TensorFlow Probability, 
  • task queues and async workflows (Celery, Redis), 
  • observability tooling (OpenTelemetry), 
  • ML pipeline frameworks (Kedro), 
  • data validation and property-based testing (Pandera, Hypothesis, TestContainers), 
  • R integration (rpy2),
  • LLM integrations (Google Gemini or similar),
  • frontend awareness (React, TypeScript),
  • real estate data, valuation methodology, or appraisal workflows.

Recruitment Process:&

reactpythontypescriptgorustawsgcpazurekubernetesdocker