Mistral· Solutions· Paris
Applied Scientist / Research Engineer (Internship)
Classified Tasks (16)
Automate 6%Augment 50%Human-Only 44%
Automate (1)
Fully handled by AI agents
Generate data for pre-training and post-training.
operational
Augment (8)
AI assists, human decides
Develop state-of-the-art models across text, image, and speech modalities.
technical
Run pre-training and post-training and deploy state-of-the-art models on large GPU clusters.
technical
Curate data for pre-training and post-training.
operational
Perform model evaluations and optimize model performance to exceed expectations.
analytical
Develop tools and frameworks to facilitate data generation, model training, evaluation, and deployment.
technical
Collaborate with cross-functional teams to implement agent-based and retrieval-augmented generation (RAG) pipelines for complex use cases.
technical
Contribute to a large codebase and navigate it independently with little guidance.
technical
Write clean, readable, high-performance, fault-tolerant Python code.
technical
Human-Only (7)
Requires human judgment
Drive innovative research on AI models and methods.
leadership
Collaborate with clients on complex research projects.
communication
Develop novel methods and research ideas and apply models to diverse use cases and domains.
creative
Work cross-functionally with internal and external science, engineering, and product teams to deliver high-impact AI solutions.
communication
Manage research projects and communications with client research teams.
leadership
Deliver high-impact AI solutions that materially move product or research metrics.
operational
Ship code and features autonomously without managerial direction.
operational
Job description
About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed to meet enterprise needs, whether on-premises or in cloud environments. Our offerings include le Chat, the AI assistant for life and work. We are a dynamic, collaborative team passionate about AI and its potential to transform society. Our diverse workforce thrives in competitive environments and is committed to driving innovation. Our teams are distributed between France, USA, UK, Germany and Singapore. We are creative, low-ego and team-spirited. Join us to be part of a pioneering company shaping the future of AI. Together, we can make a meaningful impact. See more about our culture on https://mistral.ai/careers . Role Summary Mistral AI is seeking Applied Scientists Interns and Research Engineers Interns to drive innovative research and collaborate with clients on complex research projects. You will develop SOTA models across different modalities such as text, image, and speech. By developing novel methods and research ideas you will apply these models across a diverse set of use cases and domains. Working cross-functionally with both external and internal science, engineering, and product teams you will deliver high-impact AI solutions that turn the needle. This position is open for our local offices in Paris and London. What you will do • Run pre-training, post-training and deploy state of the art models on clusters with thousands of GPUs. You don’t panic when you see OOM errors or when NCCL feels like not wanting to talk. • Generate and curate data for pre-training and post-training, working on evaluations and making sure the model’s performance beats expectations. • Develop the necessary tools and frameworks to facilitate data generation, model training, evaluation and deployment. • Collaborate with cross-functional teams to tackle complex use cases using agents and RAG pipelines. • Manage research projects and communications with client research teams. About you • You are fluent in English, and have excellent communication skills. You are at ease explaining complex technical concepts to both technical and non-technical audiences. • You’re an expert with PyTorch or JAX. • You’re not afraid of contributing to a big codebase and can find yourself around independently with little guidance. • You write clean, readable, high-performance, fault-tolerant Python code. • You don’t need roadmaps: you just do. You don’t need a manager: you just ship. • Low-ego, collaborative and eager to learn. • You have a track record of success through personal projects, professional projects or in academia. It would be great if • You are pursuing a PhD / master in a relevant field (e.g., Mathematics, Physics, Machine Learning), but if you’re an exceptional candidate from a different background, you should apply. • We’d love to have you for at least 3 months, ideally 6 months. We prioritise candidates who are about to finalise their studies. • You can bring a variety of research experiences, such as working with agents, multi-modality, robotics, diffusion models, or time-series analysis. • Have contributed to a large codebase used by many (open source or in the industry). • Have a track record of publications in top academic journals or conferences. • Love improving existing code by fixing typing issues, adding tests and improving CI pipelines. • We warmly welcome applicants of every gender, background, and life experience. Benefits 💰 Competitive salary 🥕 Food : Daily lunch vouchers 🥎 Sport : Monthly contribution to a Gympass subscription 🚴 Transportation : Monthly contribution to a mobility pass By applying, you agree to our Applicant Privacy Policy .