Romain Graux

Romain Graux

ML Engineer

I'm a ML Engineer passionate about AI Safety and alignment. With experience in both academic research and software engineering, I focus on developing responsible AI systems as well as building softwares during my free time.

Currently seeking opportunities in AI Safety research and engineering roles. Open to full-time positions and collaborations.

Projects

Projects

  • Second Order Jailbreak

    NeurIPS 2023

    Dec 2023

    We authored a paper on Second Order Jailbreak where we delved into the risks posed by malignant intelligent actors spreading their influence over networks of agents with varying intelligence and motivations. We run experiments on a multi-agent environment available here. You can see all the different conversations the agents had with each other on our playground website. We've presented our work at NeurIPS 2023. The current version of the paper can be found here. We are currently working on a follow-up paper, which will include a more detailed analysis of the risks and the strategies that can be used to mitigate them.

  • Co-Creator and Vice President

    Safe AI Lausanne Group

    Jan 2023 - Now

    Co-created and leading a volunteer-driven organization focused on AI safety and alignment, organizing talks, round tables, seminars and bootcamps to foster knowledge exchange and raise awareness about AI risks. During which I:

    • Successfully led the AI Safety Fundamentals from BlueDot Impact, implementing the AI Alignment Curriculum.
    • Organized a 2-week AI Safety bootcamp in Sep. 2023, aiming to skill up 20 motivated participants and encourage them to work in the field of AI Safety. Delivered in-depth training on the technical aspects of Transformer architecture, including mechanistic interpretability (Induction and Indirect Object Identification circuits) and provided lessons on RL, RLHF and jailbreaking of LLMs.

Experience

Experience

  • Data Officer

    NCCR Catalysis (EPFL/ETHZ)

    Jan 2023 - Dec 2024

  • Fullstack software engineer

    Graux Music

    Jan 2024 - Now

  • Teaching Assistant

    École Polytechnique de Louvain

    Sep 2020 - Jun 2021

    • Algorithms and data structures
    • Discrete math and probability
    • Numerical methods
    • Signals and systems
    • Python

  • Computer Vision Internship

    Aerospacelab

    Jul 2020 - Sep 2020

Education

Education

  • M.S. Data Science Engineering

    École Polytechnique Fédérale de Lausanne

    Sep 2021 - Sep 2022

  • M.S. Data Science Engineering

    École Polytechnique de Louvain

    Sep 2020 - Sep 2022

  • B. Sc. Engineering

    École Polytechnique de Louvain

    Sep 2017 - Sep 2020