Data Platform Engineer (Coding & ETL Tooling)
Data Platform Engineer (Coding & ETL Tooling) Position Overview We are seeking a Data Platform Engineer with a strong background in modern coding environments and open-source ETL/ELT technologies. The successful candidate will support the development, orchestration, and automation of data workflows using tools like Python, R, GitLab Runners, Airflow, and dbt. This role also involves managing and optimizing collaborative development environments (GitHub, GitLab) and supporting IDE usage across data science and engineering teams. Key Responsibilities Coding Environment Management
- Support the setup and maintenance of development environments using IDEs such as VSCode, RStudio, Cursor, and Jupyter
- Enable best practices for collaborative coding in languages such as Python, R, and Stata
- Ensure integration between IDEs, data platforms, and source control tools for streamlined workflows
- Assist in optimizing development environments for reproducibility, package management, and dependency tracking
- Administer Git-based version control systems (GitHub, GitLab), including branching strategies, access control, and repo management
- Develop and manage CI/CD pipelines using GitLab Runners and GitHub Actions for data pipelines and analytical code
- Promote code quality through automated testing, linting, and review workflows
- Support onboarding and upskilling of users in Git workflows and coding standards
- Design and implement data transformation pipelines using open-source tools like Apache Airflow, dbt, and VTL (Validation and Transformation Language)
- Maintain orchestration workflows and monitor execution of scheduled jobs
- Optimize task dependencies, retries, and performance within Airflow DAGs and dbt models
- Integrate ETL tools with source systems, metadata layers, and data warehouses
- Build reproducible workflows for data science, statistical analysis, and reporting using templated code bases and configuration-driven pipelines
- Develop modular, reusable components for data ingestion, cleaning, validation, and transformation
- Create infrastructure-as-code templates for deploying ETL tools in cloud or on-prem environments
- Support interoperability and standardization across analytics and data engineering teams
Technical Skills
- 6+ years of experience with data scripting and statistical programming languages (Python, R, Stata)
- Strong proficiency with Git-based workflows and tools (GitLab, GitHub, GitHub Actions)
- Experience configuring and working within IDEs such as VSCode, RStudio, Jupyter, and/or Cursor
- Proven track record implementing and managing open-source ETL/ELT tools (Airflow, dbt, GitLab Runners, VTL)
- Familiarity with data orchestration, testing, and observability for pipelines
- Experience developing CI/CD pipelines for analytical and data engineering use cases
- Knowledge of containerization (Docker) and task execution environments (Kubernetes, GitLab Runners)
- Scripting expertise (Bash, Python, YAML) for configuration, automation, and job orchestration
- Understanding of software engineering best practices (modular design, unit testing, reproducibility)
- Bachelors or Masters degree in Data Engineering, Computer Science, Statistics, or related field
- Experience working in collaborative research or analytics teams with reproducible coding standards
- Knowledge of data validation frameworks (e.g., Great Expectations, VTL), metadata integration, and lineage tracking
- Familiarity with cloud-native infrastructure and deployment (AWS, GCP, Azure)
- Contributions to or experience working with open-source ETL/analytics tooling
Emplois Recommandés
Technicien Helpdesk - Anglais - Paris (75) H/F
Qui sommes-nous ? On est une ESN spécialisée dans le domaine des infrastructures à taille humaine, avec plus de 30 ans d'histoire. Mais ce qui fait notre vraie différence ? On ne se contente p…
First Sales, Business Developer
À propos de Kolverr Kolverr développe un agent IA dédié aux professionnels de l’énergie (installateurs, bureaux d’étude, développeurs) pour simplifier leurs opérations, limiter les erreurs et ac…
After-Sales Service Coordination Assistant
Chaumet seeks an After-Sales Service Coordination Assistant for a six-month internship starting January 2026, based at the Levallois logistics platform. The role involves coordinating jewelry and watc…
Chef de Projet SAP SD H/F H/F
Détail de l'offre Informations générales Entité d'accueil Fondé en 1865 par Jules et Augustine Jaluzot, le Printemps est un des leaders français emblématiques du commerce dans les secteurs d…
Stage Banquier Privé - Gestion privée
- Aide à la rédaction des propositions d'investissement et d'allocations d'actifs - Assistance à la préparation des rendez-vous et à la bonne complétude des dossiers (analyse des dépôts, des comptes…
Stagiaire Chef(fe) de projet sustainability à l'EY ImpACT Lab - Paris - F/H - Dès que possible
En tant que Chef(fe) de projet sustainability de l'EY ImpACT Lab, vous ne serez pas seulement un observateur, mais un(e) acteur clé dans la transformation durable des entreprises. Vous aurez l'opportu…
Data Steward - H/F
L'entreprise L'Inserm est le seul organisme public français entièrement dédié à la recherche biologique, médicale et en santé des populations. Il dispose de laboratoires de recherche sur l'ensembl…
Coordinateur/ coordinatrice/ gestionnaire
Description du poste L'association départementale s'est développée intensivement ces deux dernières années. L'équipe de salariées est actuellement de 4 personnes qui assurent la réalisation de nom…
Assistant(e) marketing et communication en alternance h/f
Missions Création des outils de communication online (bannières, logos, pictogrammes) et offline Création et routage des mailings / newsletters Rédaction de contenus éditoriaux orientés SEO …
Expert Software Engineer (C++)
Who are we? At Finastra, we are a dynamic global provider of open finance software solutions, dedicated to expanding access to financial services. Our innovative applications span Lending, Payments…