You're seeing this page as if you were . The main menu is still yours, though. Exit from immersion
Guillaume GeoffroyGG

Guillaume Geoffroy

Data Engineer | Databricks | PySpark | CI/CD Azure

EUR 750/Tag
Lausanne, CH
3-7 Jahre

Durchschnittliche Reaktionszeit: 1h

Über Guillaume

Data Engineer PySpark & Cloud | Scalable Data Pipelines

I help companies design, build, and optimize robust and scalable data pipelines in cloud environments.

Specialized in PySpark, Databricks, and Airflow, I support end-to-end data platform projects from architecture to production deployment.


Core expertise:

ETL/ELT pipelines (PySpark, Databricks, Airflow)
Cloud data platforms (Azure, AWS, GCP)
Data lakehouse architecture (Delta Lake)
CI/CD, Terraform, and DevOps practices
Data quality & pipeline monitoring
Spark performance optimization
Github Copilot Integration

Experience:
5 years working on large-scale data projects in energy, aerospace, and industrial environments.


Focus:

Reliable, scalable, production-ready data systems with strong engineering standards.

Available remotely or in Switzerland / France.
  • Französisch

    Muttersprachlich oder zweisprachig

  • Englisch

    Verhandlungssicher

  • Spanisch

    Konversationssicher

Vor Ort möglich
Lausanne (bis zu 50 km), Geneva (bis zu 50 km)

Projekt- und Berufserfahrung

  • Engie - V.I.E
    Data Engineer
    ENERGIE
    Juni 2025 - Heute (1 Jahr)
    Brussels, Belgium

    Designed and managed data pipelines for energy consumption and billing data.


    Developed a Python library to structure ETL workflows, including complex PySpark transformations on time-series data. Worked on VSCode with Github Copilot.

    Industrialized and managed Databricks jobs, with scheduling through Apache Airflow and data storage on S3 using Delta Lake format.

    Orchestrated CI/CD with Azure DevOps and IaC deployments with Terraform across Databricks environments (dev, preprod, prod).

    Integrated GitHub Copilot into the development workflow for code generation, refactoring, and pull request review support.

    Built a Data Quality framework within the library, implementing checks for duplicates, overlaps, and completeness. Used Docker Image for unit testing / functional testing.

    Performed data analysis and developed dashboards with Databricks.
    Databricks Docker PySpark Azure DevOps Terraform
  • Terra Systema
    CDD Data Scientist
    LEBENSMITTELINDUSTRIE
    Mai 2024 - Juli 2024 (2 Monate)
    Molsheim, Frankreich

    Analyzed weather sensor data to anticipate late frost events.

    Led the project autonomously, coordinating with multiple stakeholders.

    Analyzed time-series data from weather sensors and developed solutions on Linux using Python (Pandas, Matplotlib, TensorFlow) and MySQL.

    Designed a Proof of Concept and built a Deep Learning model (CNN/LSTM) to
    estimate dew point at parcel level.
    Python MySQL Deep Learning
  • Cs Group
    CDI Data Engineer
    LUFT- & RAUMFAHRT
    Juni 2021 - April 2023 (1 Jahr und 10 Monate)
    Toulouse, Frankreich

    Predicted aircraft failures for Airbus and airline operators.

    Filtered, analyzed, and visualized multi-source aircraft sensor data, including model development and alert monitoring.

    Developed a Python library dedicated to model development, built on complex
    PySpark transformations.

    Industrialized Big Data models using internal DevOps tools within a continuous
    integration framework.

    Used the internal CodeWorkbook ETL for model prototyping and validation.
    Python Spark ETL GitHub DevOps

Empfehlungen

Sei die erste Person, die Guillaume empfiehlt

Teile Deine Erfahrung aus der Zusammenarbeit mit diesem Freelancer.

Diese Freelancer passen auch zu Ihren Kriterien

AgathaA

Agatha Frydrych

Backend Java Software Engineer

4.7

(3)

2

BaptisteB

Baptiste Duhen

Fullstack developer

4.6

(4)

5

AmedA

Amed Hamou

Senior Lead Developer

4

(2)

7

AudreyA

Audrey Champion

Web developer

4.3

(3)

4

Ausbildung und Abschlüsse

  • Final-year exchange
    Université Laval
    2020
    Final-year exchange, Specialization in Machine Learning and Advanced Python
  • Engineering degree
    SUPMICROTECH-ENSMM
    2020
    Computer Sciences

Fähigkeiten

Kategorien