Senior AI Engineer - APM Experiences
- Debug and investigate application performance issues down to the root cause, as both a developer assistant and a fully autonomous agent
- Proactively recommend performance and reliability-based optimizations to prevent the next incident
- Automatically create intelligent monitors and SLOs for the most important business flows and critical paths
- Shape AI experiences for APM. Design and ship LLM/agentic workflows that analyze traces, metrics, logs, and other telemetry to generate diagnoses, explanations, and guided fixes.
- Own the full loop. Prototype quickly, define success metrics and evals, run experiments, iterate, and ultimately productionize for scale and reliability.
- Build robust agent systems. Develop tools, retrieval and planning strategies, and guardrails; manage prompts/evals; design fallbacks and human-in-the-loop paths.
- Integrate with Datadog's platform. Leverage surfaces like Trace Explorer, Service Catalog, monitors, and workflows to deliver end-to-end value in the APM UI.
- Partner deeply. Collaborate with PM, Design, and partner teams to build cohesive experiences.
- Raise the bar on engineering. Write performant, maintainable backend code, own services in production, and improve reliability for high-throughput, low-latency data systems.
- 4+ years building backend or real-time ML systems; you value simplicity, correctness, and performance
- Proven experience delivering LLM/agent features to production (prompting, tooling, evals, safety/guardrails)
- Comfortable owning user journeys, iterating from prototype → alpha → GA, and measuring impact with clear product metrics
- Solid grasp of the ML lifecycle (task definition, dataset collection, modeling, evaluation, deployment, iteration) and statistics (experiment design, confidence intervals)
- Experience choosing/modeling the right technique for the job (e.g., anomaly detection, ranking/recommendation, NLP), and knowing when a heuristic beats a model
- Fluency with offline/online evals for AI systems; can build reliable golden sets and automatic regressions
- Experience with microservices performance: tracing, latency breakdowns, concurrency, and resiliency patterns
- Proficient in Go, Java, or Python; strong API/service design; production ops (monitoring, alerting, on-call rotation)
- Hands-on with distributed tracing stacks (OpenTelemetry/Datadog APM), profilers, and logs/metrics pipelines
- Exposure to planning/agent frameworks, tool-use orchestration, RAG, and retrieval/indexing for observability data
- Familiarity with SLO/SLA practices and incident response
- Get to build tools for software engineers, just like yourself. And use the tools we build to accelerate our development.
- Have a lot of influence on product direction and impact on the business.
- Work with skilled, knowledgeable, and kind teammates who are happy to teach and learn.
- Competitive global benefits.
- Continuous professional development.
Emplois Recommandés
Chef de projet MOE BANQUE Fullstack Java / Angular - H/F
Description de l'entreprise Dans le TOP10 des Sociétés de Conseil en Ingénierie en France , le Groupe SCALIAN intervient sur des activités de services en management de projets industriels, sup…
Event Operation Team Lead f/m
YOUR MISSION Sharingbox is a global leader in the design and production of engaging photo and video experiences. Part of DNP Group, active since 2006, with offices in over 20 countries and operati…
Alternant Commercial - H/F
MASSÉNA, établissement d'enseignement de premier plan à Paris, se spécialise dans la formation professionnelle et le développement des compétences. Depuis près d’une décennie, nous nous consacrons à …
[HSMP]MEDECIN GENERALISTE - SMR DEFICIENCES SENSORIELLES
Poste ouvert aux personnes en situation de handicap. Le service de médecine et réadaptation des déficiences sensorielles recherche un médecin généraliste CDI à mi-temps Ce SMR unique en France est d…
Responsable Stratégie & Opérations Data - H/F
Job Description Summary Nous recherchons un(e) Responsable Stratégie & Opérations Data qui sera en charge: _ Du management d'une équipe opérationnelle data composée notamment d'ingénieurs Data e…
INFIRMIER H/F - SOINS PALLIATIFS - PARIS 15
Au sein de notre établissement de santé, vous serez en charge des missions suivantes : - Assurer les soins infirmiers adaptés aux patients en fin de vie, en tenant compte de leurs besoins physiques…
CDSClear QA and Test Automation Manager
About Us: LSEG (London Stock Exchange Group) is more than a diversified global financial markets infrastructure and data business. We are dedicated, open-access partners with a dedication to excel…
Data & Cloud Engineer (H/F)
fifty-five est une data-company d'un genre nouveau qui aide les marques à exploiter les données pour améliorer le marketing, les médias et l'expérience client grâce à une combinaison de services de co…
Business Development Manager - Toulouse
Poste ouvert aux personnes en situation de handicap. Pourquoi Deliveroo ? Nous transformons la façon dont le monde mange et fait ses courses en rendant leur accès plus pratique et agréable. Nous conn…