Back to Search
Overview
Mid-Level

Data Engineer - Applied AI

Confirmed live in the last 24 hours

Celonis

Celonis

Bangalore, India
Hybrid
Posted February 27, 2026

Job Description

We're Celonis, the global leader in Process Intelligence technology and one of the world's fastest-growing SaaS firms. We believe there is a massive opportunity to unlock productivity by placing AI, data and intelligence at the core of business processes - and for that, we need your help. Care to join us?

The Team:

You will be joining the Business Apps department. Our mission is to build end-to-end solutions on the Celonis platform, including data models and end-user applications,  to accelerate time to value for our customers and partners. The Catalog team within Business Apps specializes in three aspects: Defining the data ontology of the most common business processes; Building prebuilt transformations for such ontologies for major source systems like SAP, Oracle etc (leveraging the latest AI tools and agentic workflows to accelerate design, transformation development, and model validation); Collaborating with various teams in both the Product and Go-to-market organizations to drive adoption at scale.

The Role:

As a Data Engineer - Applied AI, you will own and focus on primarily two aspects: owning the set of tools and data pipelines that we use to publish pre-built data models to our customers (leveraging AI and automation wherever possible), and innovating AI-driven methodologies to accelerate our own content development, as well as accelerate customer adoption of our content. 

The work you’ll do:

  • Build data models for the defined ontologies and mappings using the object-centric process mining methodologies with performant SQL transformations.
  • Use AI-assisted workflows and agentic tooling to design and implement business objects, process events, and data models in the Celonis platform.
  • Research and design: 
    • ontologies for new business processes, improve and extend capabilities of existing ones.
    • the source system transformations to map them with the defined ontologies.
  • Build and own the set of tools & pipelines we use to publish & distribute Celonis Data Model.
  • Collaborate closely with software engineering teams for requirements, timeline, and dependencies of content distribution.
  • Serve as the primary link for discussions between the Business Apps and Product & Engineering Team.
  • Test and validate the models in development environments and customer environments to gather early feedback.
  • Document the data model governing principles and development 
  • Drive AI adoption to accelerate development and end-user adoption throughout the department

The qualifications you need:

  • You have that rare combination - a strong technical expertise and business acumen. You’ll use this to expand our offering of pre-built data models that drive tangible customer value. .
  • 3+ years of experience in AI-assisted engineering and the development of agentic workflows.
  • Must-have: 
    • Hands-on expertise with SQL and applying AI tools or automation frameworks in data workflows.
    • Expert proficiency in Python, with a strong emphasis on data analysis, numerical simulation, and data pipelines.
  • You can build, test, and deploy AI applications using LLMs and frameworks.
  • You are skilled in designing systems to improve model responses with external knowledge bases.
  • You can implement observability tools to monitor performance, latency, and cost.
  • You possess the ability to optimize prompts to enhance accuracy and reduce hallucinations.
  • You are proficient in data modeling best practices.
  • You have working knowledge of Cloud Platforms including AWS, Azure, or GCP, as well as CI/CD pipelines and Git.
  • You have experience transitioning prototypes into scalable production systems using Docker, FastAPI, and API integration.
  • Experience working in the data field as a Data Engineer, Data Analyst or similar with at least one of the following system types will be a plus:
    • ERP (e.g. SAP ECC or S/4, Oracle EBS or Fusion)
pythongoawsgcpazuredockeraidataproductdesign