Software Engineer, Technical Lead, Inference

Mistral AI
Paris
About Mistral

At Mistral AI, we believe in the power of AI to simplify tasks, save time and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life.

We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work.

We are a dynamic, collaborative team passionate about AI and its potential to transform society.

Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited.

Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on

Role summary

As the Technical Lead for the Inference team, you will drive the architecture and optimization of our inference backbone, ensuring high performance, scalability, and efficiency in a dynamic environment. You will lead the acquisition and automation of benchmarks, collaborate with cross-functional teams, and innovate solutions to enhance our AI-powered applications.

What you will do

• Architect and optimize the inference for high-volume, low-latency, and high-availability environments.

• Lead the acquisition and automation of benchmarks at both micro and macro scales.

• Introduce new techniques and tools to improve performance, latency, throughput, and efficiency in our model inference stack.

• Build tools to identify bottlenecks and sources of instability, and design solutions to address them.

• Collaborate with machine learning researchers, engineers, and product managers to bring cutting-edge technologies into production.

• Optimize code and infrastructure to maximize hardware utilization and efficiency.

• Mentor and guide team members, fostering a culture of collaboration, innovation, and continuous learning.

About you

• Extensive experience in C++ and Python, with a strong focus on backend development and performance optimization.

• Deep understanding of modern ML architectures and experience with performance optimization for inference.

• Proven track record with large-scale distributed systems, particularly performance-critical ones.

• Familiarity with PyTorch, TensorRT, CUDA, NCCL.

• Strong grasp of infrastructure, continuous integration, and continuous development principles.

• Ability to lead and mentor team members, driving projects from concept to implementation.

• Results-oriented mindset with a bias towards flexibility and impact.

• Passion for staying ahead of emerging technologies and applying them to Al-driven solutions.

• Humble attitude, eagerness to help colleagues, and a desire to see the team succeed.

Our Culture

We're driven to build a strong company culture and are looking for individuals with solid alignment with the following:

• Reason with rigor

• Are you audacious enough?

• Make our customers succeed

• Ship early and accelerate

• Leave your ego aside

Location & Remote

This role is primarily based at our HQ in Paris, France. We will prioritize candidates who either reside in Paris or are open to relocating. We strongly believe in the value of in-person collaboration to foster strong relationships and seamless communication within our team. Our remote work policy is designed to offer flexibility, enhance work-life balance, and boost productivity. The number of remote workdays is determined by each manager, taking into account individual autonomy and specific circumstances-such as increased flexibility during the summer months. Regardless of the arrangement, we expect all employees to maintain open lines of communication with their teams and be available during core working hours.

In certain specific situations, we will also consider remote candidates based in one of the countries listed in this job posting (currently France, UK, Germany, Netherlands, Spain and Italy). In that case, we ask all new hires to visit our Paris office:

• for the first month of their onboarding (accommodation and travelling covered)

• then at least 3 days per month

What we offer

Competitive salary and equity

Health insurance

Transportation allowance

Sport allowance

Meal vouchers

Private pension plan

Parental : Generous parental leave policy

Visa sponsorship

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Publié le 2025-10-18

Emplois Recommandés

Gifting & Special Projects Intern

Chanel
Paris

Chanel is seeking a Gifting & Special Projects Intern in Paris. This role involves designing a variety of products, managing product development, and collaborating with multiple departments. Candidate…

Voir les Détails
Publié le 2025-09-21

Intellectual Property and Business Law Specialist

Guerlain
Paris

Guerlain, part of the LVMH group, seeks an experienced Intellectual Property and Business Law Specialist in Paris. The role involves providing legal counsel on intellectual property and business law f…

Voir les Détails
Publié le 2025-10-12

Legal Intern in Distribution Law

DFS
Paris

Samaritaine Paris Pont-Neuf seeks a Legal Intern specializing in Distribution Law. This full-time, on-site internship in Paris offers the opportunity to work closely with the legal department, assisti…

Voir les Détails
Publié le 2025-09-21

Global Leadership Learning & Development Intern

LVMH
Paris

LVMH seeks a Global Leadership Learning & Development Intern in Paris to join its international team. The role involves coordinating training logistics, supporting program delivery, analyzing data, an…

Voir les Détails
Publié le 2025-09-27

Directeur clients nationaux F/H

PERSUADERS
Paris

Entreprise « Soyez vous-même, les autres sont déjà pris. » Depuis plus de 20 ans, PERSUADERS RH, s’est construit sur une expertise forte dans le recrutement de profils complexes et a développé un…

Voir les Détails
Publié le 2025-09-15

Comptable fournisseur H/F

LTd
Paris

Le poste : LTD, cabinet de recrutement et agence de travail temporaire, recherche pour le compte de son client, un(e) comptable fournisseurs H/F en intérim. Le poste est basé à Paris. Dans le ca…

Voir les Détails
Publié le 2025-09-26

Avocat M&A/pivate equity min. 3 ans (H/F)

Fed Finance
Paris

Vous gérerez des transactions complexes pour des clients internationaux exigeants, et développerez vos compétences dans un environnement particulièrement compétitif et stimulant. L’équipe est recon…

Voir les Détails
Publié le 2025-09-01

Senior UX Designer - Offer Discovery

Veepee
Paris

Pioneer of online flash sales since 2001 and key player in European e-commerce, Veepee collaborates with over 7,000 brands to offer highly discounted products available for a limited time. Operating …

Voir les Détails
Publié le 2025-10-15

Dermatologue - Paris 8e H/F - Libéral

Emploi Dermatologue Paris 75 | JoberGroup
Paris

Dermatologue - Paris 8e H/F - Libéral Emploi Dermatologue H/F - Paris 8e Nous recrutons un dermatologue H/F afin d’intégrer une maison d’esthétique intégrative située dans le 8e arrondissement de Pa…

Voir les Détails
Publié le 2025-08-28

Gynécologue F/H - 42.9% CP inclus Paris 75008

Joseph Sebban
Paris

La Solution Médicale est une agence spécialisée dans le recrutement de professionnels de santé. Elle accompagne, depuis 2014, les responsables des centres médicaux et dentaires dans leur recherche de…

Voir les Détails
Publié le 2025-07-07