Beschreibung

Doctorant en NLP à Inria Paris (équipe ALMAnaCH). Je travaille sur le pretraining et l'adaptation de LLMs.

J'ai publié CamemBERT-bio, un modèle NLP biomédical français avec 100k+ téléchargements sur Hugging Face.

Pendant ma thèse j'ai contribué à GAPeron, une suite de LLMs français open-source (1.5B à 24B paramètres) entraîné sur +128 GPUs AMD/NVIDIA.

Ce que je sais faire:

- Pretraining de LLMs from scratch (DeepSpeed, FSDP, infrastructure distribuée)

- Data curation à grande échelle

- Déploiement de LLMs 7B-70B en production (vLLM, quantization GPTQ/AWQ)

- NER et extraction d'information

- Fine-tuning pour des domaines spécialisés (biomédical, légal, etc..)

Ingénieur diplômé de l'ECE Paris et CentraleSupélec en Intelligence Artificielle.

Dispo soirs et weekends pour des missions courtes.

Publications (2 first-author, 53 citations):

- CamemBERT-bio (LREC-COLING 2024)

- GAPeron LLMs (submitted to ACL 2026)

- Biomed-Enriched (submitted to ACL 2026)

- CamemBERT 2.0 (arXiv 2024)

Google Scholar: scholar.google.com/citations?user=f5hnXrcAAAAJGoogle Scholar: scholar.google.com/citations?user=f5hnXrcAAAAJ

Branchenexpertise

Sprachen

Französisch
Muttersprachlich oder zweisprachig
Englisch
Verhandlungssicher

Arbeitsortpräferenzen

Nur remote

Führt Projekte hauptsächlich remote aus

ViaDialog
R&D Consultant - NLP/NER
März 2025 - Juni 2025 (3 Monate)
Built French text anonymization system using NER for customer data. Designed LLM annotation pipeline (vLLM + constrained decoding) to generate synthetic training data; distilled to production NER model (0.82 F1).
Praxysanté
R&D Consultant - LLM Infrastructure
Juni 2023 - Juni 2024 (1 Jahr)
Deployed open-source LLMs (7B-70B) for healthcare applications with quantization (GPTQ, AWQ) on high-end GPUs. Built production inference pipeline with vLLM and FastAPI.
Inria
PhD student
FORSCHUNG
Juni 2022 - Heute (4 Jahre)
Paris, Frankreich
PhD thesis on LLM pretraining for clinical NLP, supervised by Laurent Romary and Eric de La Clergerie (ALMAnaCH team).

Key achievements:
- Trained a 7B decoder from scratch on 128 GPUs (8.7k GPU-hours), matching clinical SOTA with 2.5x fewer tokens
- Published CamemBERT-bio, a biomedical NLP model with 100k+ downloads on Hugging Face
- Core contributor to GAPeron, a suite of open French LLMs (1.5B-24B parameters)
- Research published at LREC-COLING 2024 and submitted to ACL 2026
NLP LLMs

Gesamte Berufserfahrung von Rian ansehen

Olivier

Monument SAS

Bewertet am 15.2.2022

Très satisfait du déroulement et du résultat de la mission : écoute, réactivité, objectif, délais tenus. Je recommande Rian.

Benutzerkonto gelöscht

Bewertet am 14.11.2018

1) tarif très compétitif pour les petites associations que nous somme 2) j ai eu 2 petite applications pour mes 2 associations qui sont différentes pour le même prix 300€ pour les 2 appli 3) il garantis les bug env 3 mois 4) il donne les sources choses qui était impératif pour nous 5) seul regret impossible de le rencontrer tout se passe à distance je recommande vivement Mr Ryan

Ehemaliger Nutzer und 2 weitere Personen empfehlen Rian

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

Baptiste Duhen

Fullstack developer

4.6

(4)

Amed Hamou

Senior Lead Developer

(2)

Audrey Champion

Web developer

4.3

(3)

Anmelden, um Profile zu sehen

Master's Degree in Engineering
ECE Paris École d'ingénieur
2022
• Maths (Linear Algebra, Probability, Calculus ...) • C/C++, Java, and C# programming • Algorithms and Data Structures, Graph Theory • Advanced Database and SQL
M2 - Intelligence Artificielle
CentraleSupélec
2022
•Machine Learning fundamentals and Deep Learning •Reinforcement Learning •Computer Vision and NLP •Explainable AI

Ausbildung und Abschlüsse von Rian ansehen

Developing Android Apps - MOOC
Google - Udacity
2017
Java Android
Kotlin for Android Developers - MOOC
Google - Udacity
2018
Kotlin

Data Scientists

Rian T.

PhD LLM Training & Deployment | NLP | AI

Über Rian

Projekt- und Berufserfahrung

Bewertungen

5.0

Qualität

5.0

Termintreue

5.0

Kommunikation

4.5

Olivier

Benutzerkonto gelöscht

Empfehlungen

Diese Freelancer passen auch zu Ihren Kriterien

Ausbildung und Abschlüsse

Zertifizierungen

Fähigkeiten

Kategorien