Data Engineer - Spark Specialist
- Help customers design, build, and optimize Flows in Dataiku, improving overall project performance and maintainability
- Debug and enhance complex Spark code and data pipelines for better performance and reliability.
- Guide clients in tuning and scaling Spark environments, such as Kubernetes and Databricks, including providing architectural guidance and best practices to enhance performance and reliability.
- Optimize SQL-based data pipelines to ensure efficient and robust data workflows within Dataiku.
- Advise clients on integrating different data pipelines (Spark, SQL, Python) into optimized solutions
- Collaborate with internal teams to resolve technical issues and contribute to the knowledge base.
- Proficiency in writing and debugging PySpark code for large-scale data processing.
- Experience with Parquet, Delta Lake, and columnar file formats.
- Understanding of Spark's interaction with metastores (e.g., Hive, Unity Catalog).
- Deep understanding of resource management: Spark executors, cores, memory, and relevant configurations (e.g., spark.executor.memory, spark.sql.shuffle.partitions).
- Expertise in tuning Spark jobs: partitioning, caching, broadcast joins, and avoiding unnecessary shuffles.
- Familiarity with lakehouse architectures and ACID-compliant data layers (Delta Lake, Iceberg, Hudi).
- Experience working with Databricks, including Databricks Connect and Databricks Workflows.
- Experience automating and scheduling Spark jobs using tools like Apache Airflow or native orchestration tools.
- Proven experience developing, optimizing, and troubleshooting SQL-based data pipelines for efficient ETL and data transformation processes.
- Proficiency in building and managing data transformation workflows in Python, leveraging frameworks such as pandas.
- Familiarity with data modeling concepts and data quality best practices.
- Experience integrating data from a variety of sources, including databases, APIs, and cloud storages.
- Ability to communicate technical concepts effectively to both technical and non-technical stakeholders.
- Initial call with a member of our Technical Recruiting team
- Video call with the Field Engineer Hiring Manager
- Technical Assessment to show your skills (Home Test)
- Debrief of your Tech Assessment with FE Team members
- Final Interview with the VP Field Engineering
Emplois Recommandés
MÉDECIN ANESTHÉSISTE RÉANIMATEUR (F/H)
Marque Rejoignez les 30 000 collaborateurs de l'Appel Médical et bénéficiez de nombreuses missions et emplois les plus adaptés à vos envies et compétences tout en profitant des nombreux services et…
Consultant.e Data Analyst Power BI F/H
Qui sommes-nous? Valoway, pure player Data, IA & Cloud, nous intervenons sur des projets innovants, à travers l’élaboration et la mise en œuvre stratégique de transformations numé…
🏊 Professeur de Natation H/F - Paris- Urgent
Le poste : &##128218; Description du poste Cours particuliers de natation (apprentissage, perfectionnement, confiance dans l'eau). À domicile (piscine privée) ou en piscine publique selon les d…
Lead Consultant Data (H-F)
Description de l'entreprise Arcane est un cabinet de conseil en Data et Marketing Digital à forte croissance qui conçoit des stratégies d'acquisition digitales avancées en exploitant la richesse d…
Médecin Coordonnateur Rééducation - F/H
Le CabRH, cabinet de recrutement et d’approche directe, recrute pour son client un Médecin Coordonnateur Rééducation (H/F). Localisation indiquée Paris 8ᵉ – poste situé en Normandie , facilemen…
Senior Designer, Knitwear
Celine is seeking a Senior Designer for Knitwear in Paris. The role involves contributing to both men's and women's collections, collaborating closely with design and production teams. Candidates shou…
Stage de 6 mois - Réemploi des matériaux du bâtiment - BELLASTOCK SCIC - Paris
Description de l’entreprise Bellastock est une société coopérative qui oeuvre pour la valorisation des lieux et de leurs ressources en proposant des alternatives à l’acte de construire. En effet, La…
Ingénieur Data Science (F/H/X)
Description de l'entreprise Description de l'entreprise EPSILON France est l'entité Datamarketing de Publicis Groupe avec 650 experts data, marketing et technologiques et plus de 40 métiers représ…
Project Manager (SuccessFactors solution) France H/F
ARAGO Consulting is an international leader in the implementation of the digital transformation of the HR function with innovative cloud HR solutions. We support our clients throughout their HRIS pro…
Consultant.e SAP FI CO
Prêt.e à relever des défis ? Intégré.e au pôle Finance (15 personnes), dédié aux activités de consulting SAP de la finance (FI) et du contrôle de gestion (CO), vous serez l’i nterlocuteur…