Thomas Helfer, Ph.D.

Artificial Intelligence | High Performance Computing | Physics

About Me

Research Fellow at the Institute for Advanced Computational Science at Stony Brook

Machine learning, High-Performance Computation, and Physics

My Story

Hello! I’m Thomas Helfer, and my academic and professional path has been driven by fascination with exploring new topics in science. My journey began at Imperial College London, where I delved into the intense world of theoretical physics during my master’s program. Transitioning from theoretical to computational physics for my Ph.D. was exciting. Having started with very little experience in coding, I was quickly able to upstart and get into the deep end of highly parallelized code in C++, even able to contribute to state-of-art codebases like GRChombo (now GRTL). My Ph.D. was a blend of theoretical concepts and practical simulations, offering me a comprehensive insight into the world of computational physics.

My first postdoc position marked another significant turn in my career. I became increasingly interested by deep learning, particularly superresolution. The remarkable achievements of DLSS in video games demonstrated to me the potential of these methods in scientific research, particularly in black hole simulations. This realization steered me towards deep learning, leading to my current role at the Institute for Advanced Computational Science at Stony Brook University. At Stony Brook, my focus is on leveraging deep learning to enhance simulations, exploring its applications not just in speeding up computational processes but also in broadening the scope of deep learning in scientific research. In addition to this, I have explored self-supervised learning (SSL) models and published a paper on multimodal learning using contrastive learning, further expanding the applications of AI in scientific research. What I love about my research is the opportunity to merge my background in physics with the exciting possibilities of AI, continually learning and applying these innovative techniques to new scientific questions.

Fig: A set of AI-generated galaxies using state-of-art diffusion models. Every single block is a synthetic galaxy, almost indistinguishable from real ones.

Fig: A rendering of a black hole disk using my ray-tracing code (github). The black hole distorts the surrounding light; this distortion is so strong that some light makes multiple turns around the black hole, creating multiple copies of the disk (in red and blue). A similar depiction of a black hole is seen in the movie “Interstellar” by Christopher Nolan. 

My achievements

Over the years, I have made significant contributions to advancing our understanding of numerical relativity, particularly in gravitational wave detection. My research has resulted in over twenty papers in prestigious scientific journals, accumulating more than a thousand citations. I’ve developed new, easily applicable methods to improve initial data for bosonic stars, which are now commonly used, and I created the first cosmic string simulations in general relativity, significantly improving upon Hawking’s theoretical estimates on gravitational wave output energy. I was the first to create head-on black hole simulations accurate enough to detect higher-order modes, as well as the first to apply deep learning to numerical relativity. These contributions have earned me a best thesis award and numerous job offers. I’ve also shared my work through over fifty presentations worldwide. As a mentor, I’ve supervised three Ph.D. students directly, many more in smaller projects, and led international, multi-institution collaborations. Now, I am eager for new challenges.

Selected projects

Maven: A Multimodal Foundation Model for Supernova Science

We use transformer based models to combine different modalities (time-domain, spectral).

Link to paper

Selected for NeurIPS TSALM oral

Link to talk 

In collaboration with Gemma Zhang [Harvard], Alexander T. Gagliano [IAIFI, MIT, Harvard] , Siddharth Mishra-Sharma [Currently Anthropic, work performed at IAIFI, MIT], V. Ashley Villa [Harvard]
Superresolution for Numerical Relativity

Upcoming work applying ML methods to improve PDE methods for general relativity.

Link to paper

Accepted for NeurIPS ML4PS

Link to talk

In collaboration with Thomas Edwards [Currently Meta, work performed at Johns Hopkins], Jessica Dafflon [Valence Labs], Matthew Lyle Olson [Intel Labs], Kaze Wong [Johns Hopkins]
Medical AI

In this work we translated MRI images to CT images using Pix2Pix (cGAN) as well as a diffusion based model with Controlnet

Link to paper

In collaboration with Jessica Dafflon [NIH], Walter Hugo Lopez Pinaya [King's College London]
Non-linearities in highly boosted black hole head-on merger

Published in Physical Review Letters

ArXiv: 2208.07374 

Press release

In this work, we used simulations of highly boosted black holes to study the aftermath and if non-linear effects are needed to describe the process. 

In collaboration with Mark Ho-Yeuk Cheung, Vishal Baibhav, Emanuele Berti, Vitor Cardoso, Gregorio Carullo, Roberto Cotesta, Walter Del Pozzo, Francisco Duque, Estuti Shukla, Kaze W. K. Wong
Full overview over publications: See

Google Scholar

Open source code projects

PyInterpX

A highly performant, GPU compatible package for higher order interpolation in PyTorch

TorchGRTL

A translation of crucial parts of numerical relativity in torch for superresolition

Multimodal learning

A pytorch codebase for multimodal learning in Astrophysics

GRChombo (renamed GRTL)

An AMR based open-source code for numerical relativity simulations.

Boson Star solver

A python solver that produces fully general relativistic solution for many different types of scalar and proca stars.

Geodesic shooter

A C++ codebase to solve to explore spacetimes with geodesics.

Education and work experience

Research Fellow in Deep Learning applied to Physics
2023 -
Institute for Advanced Computational Science, Stony Brook University
Postdoc in Computational Physics
2019 - 2023

with Prof. Emanuele Berti – Groundbreaking work on non-linear physics of black holes (see press-release)

PhD in Computational Physics
2015 - 2020

Under the supervision of Prof. Eugene Lim – Won thesis prize under the best 20 thesis for this year at King’s College 

MSc in Theoretical Physics (QFFF)
2014 - 2015
– Graduated with Distinction
BSc in physics at ETH Zürich
2010 - 2014

Prices and Grants won

King’s Outstanding Thesis Prize 2021
TACC Computational Grant

Access to the Frontera cluster via the Pathways Allocations 

Excellence Scholarships of South Tirol

Leistungs stipendium der Provinz Bozen

Professional development

Deep Learning

IAIFI Summer School (AI in physics)

A MIT, Harvard, Northeastern, and Tufts summer school covering:
- Generative Artificial Intelligence
- Symmetries in Neural Networks
- Trasformers
- Statistical physics of Neural Networks
See more Details .

Ellis Summer School (Generative and probabilistic AI)

A Cambridge University summer school covering a plethora of topics ranging from reinforcement learning in robotics to theory of stochastic differential equations.
See more Details .

DeepLearn International School on Deep Learning

A school on Deep Learning covering various topics of AI applications in science.
See more Details

Certifications

Roughly 20 certifications spanning from deep learning to MLOps, AWS Cloud (SageMaster), Generative Adversarial Networks (GANs) as well as Natural Language Processing (NLP).
See more Details

High Performance Computing

C++ Intermediate

A four day course organised by the high performance centre (HPC) in Stuttgart

C++ Advanced

A four day course organised by the HPC in Stuttgart

Single-node Performance Optimisation

Two day course in Oxford, UK. Topics include: topics: Parallel IO, Derived Datatypes, Basic MPI-IO Calls, HDF5 and NetCDF

Intro to OpenMP and MPI

One day course in Portsmouth, UK

Advanced OpenMP

One day course at Imperial College London, UK

Argonne Training on
Extreme-Scale Computing

Two week prestigious school on HPC from the Argonne National Laboratory