We’re looking for a Data Engineer to help build and scale the data foundations that power modern AI and generative AI solutions. This role is focused on designing resilient data pipelines that support advanced analytics, ML, and LLM-driven use cases across a range of data types.

The role:

You’ll work closely with AI, ML, and platform teams to shape how data is collected, processed, and made available for downstream intelligence. The focus is on robust engineering, clean data, and systems that can scale as AI use cases move into production.

What you’ll be doing:
  • Building and maintaining Python-based data pipelines that handle ingestion, transformation, and enrichment of both structured and unstructured data
  • Applying AI-assisted techniques to data preparation, including classification, extraction, and feature creation to support ML and LLM workflows
  • Connecting data pipelines into Azure-based platforms, including data lakes and cloud-native services
  • Ensuring pipelines are reliable and performant through testing, monitoring, and continuous optimisation
  • Partnering with data scientists, AI engineers, and platform teams to support end-to-end AI delivery
What we’re looking for:
  • Solid hands-on experience as a data engineer, with Python as a core language
  • Proven experience delivering data pipelines in production environments at scale
  • Exposure to AI, ML, or generative AI use cases within data platforms
  • Practical experience working with Azure Data Lake and related Azure data services
  • A strong engineering mindset with attention to data quality, system reliability, and performance
  • Comfortable operating in collaborative, cross-functional teams