Data Engineer

Data Engineer (PR/4010748) London, England

Salary:	GBP55000 - GBP80000 per annum + Benefits

Position: Data Engineer (Python/Databricks)

Location: Remote

Salary: up to £80,000 + Bens

Are you passionate about health tech and innovation? Do you want to be at the forefront of transforming clinical research with cutting-edge technology? If so, we have an exciting new role for you!

Join our dynamic and forward-thinking team as a Data Engineer and help us build secure, scalable microservices that operationalise clinical research applications. This is your chance to make a meaningful impact on healthcare while working with some of the most advanced technologies in data engineering.

About Us

We are a pioneering health tech company dedicated to revolutionising clinical research through innovative data solutions. Our cross-functional team, including Frontend Developers, QA Engineers, and DevOps Engineers, collaborates to create high-performance data pipelines and REST APIs that drive AI applications and external data integrations.

Your Role

As a Data Engineer, you will:

Build and Optimise Data Pipelines: Implement high-performance data pipelines for AI applications using Databricks.
Develop REST APIs: Create REST APIs required for seamless external data integrations.
Ensure Data Security: Apply protocols and standards to secure clinical data in-motion and at-rest.
Shape Data Workflows: Use your expertise with Databricks components such as Delta Lake, Unity Catalog, and ML Flow to ensure our data workflows are efficient, secure, and reliable.

Key Responsibilities

Data Engineering with Databricks: Utilize Databricks to design and maintain scalable data infrastructure.
Integration with Azure Data Factory: Leverage Azure Data Factory for orchestrating and automating data movement and transformation.
Python Development: Write clean, efficient code in Python (3.x), using frameworks like FastAPI and Pydantic.
Database Management: Design and manage relational schemas and databases, with a strong focus on SQL and PostgreSQL.
CI/CD and Containerisation: Implement CI/CD pipelines and manage container technologies to support a robust development environment.
Data Modeling and ETL/ELT Processes: Develop and optimize data models, ETL/ELT processes, and data lakes to support data analytics and machine learning.

Requirements

Expertise in Databricks: Proficiency with Databricks components such as Delta Lake, Unity Catalog, and ML Flow.
Azure Data Factory Knowledge: Experience with Azure Data Factory for data orchestration.
Clinical Data Security: Understanding of protocols and standards related to securing clinical data.
Python Proficiency: Strong skills in Python (3.x), FastAPI, Pydantic, and Pytest.
SQL and Relational Databases: Knowledge of SQL, relational schema design, and PostgreSQL.
CI/CD and Containers: Familiarity with CI/CD practices and container technologies.
Data Modeling and ETL/ELT: Experience with data modeling, ETL/ELT processes, and data lakes.

Why Join Us?

Innovative Environment: Be part of a team that is pushing the boundaries of health tech and clinical research.
Career Growth: Opportunities for professional development and career advancement.
Cutting-Edge Technology: Work with the latest tools and platforms in data engineering.
Impactful Work: Contribute to projects that have a real-world impact on healthcare and clinical research.

If you are a versatile Data Engineer with a passion for health tech and innovation, we would love to hear from you. This is a unique opportunity to shape the future of clinical research with your expertise in data engineering.