Haize Labs
Employee #1 and Member of Technical Staff. Working on dynamic evaluations for AI safety. e.g. Active Learning for Aligning LLM judges to human evaluations
I am an AI safety researcher primarily interested in evaluations and model
organisms of misalignment;.
Currently, I am employee #1 and Member of Technical Staff at Haize Labs,
working on dynamic safety evals;
My background is in building and productionalizing AI applications and I have an
interdisciplinary PhD in Physics, Statistic, and Data Science from MIT;
I strive to approach life with kindness and mindful reflection, recognizing that everyone has
their own unseen struggles while staying grounded in self-awareness about my path and purpose.
Employee #1 and Member of Technical Staff. Working on dynamic evaluations for AI safety. e.g. Active Learning for Aligning LLM judges to human evaluations
Conceptualized and ran the McKinsey partnership with FDL for a 9-week research sprint to tackle climate change through the world's first accurate real-time induced seismicity forecasting system. Ran a team of four PhD-level interdisciplinary researchers plus ~10 supportes as ML and management lead. By reducing the runtime of SOTA approaches from 22 hours with manual setup to 2 minutes without, our team addressed a main pain point of the seismicity community and fulfilled a central goal of the DoE's multi-institution and -year SMART initiative.
Led the development and implementation of cutting-edge AI solutions for Fortune 500 clients' most critical business challenges. Spearheaded multiple cross-functional teams across organizations to analyze requirements, architect technical solutions, and rapidly deliver production-ready systems with measurable business impact. Successfully delivered machine learning solutions across six different industries, including leading the end-to-end development of an enterprise-grade conversational AI system with robust evaluation frameworks and safety guardrails.
In 2020, when the regime of AI foundations model was just starting, I worked at Amazon Alexa to supparize medical text. After solving the business problem by running the SOTA summarization model, I developed an LLM sampling algorithm that removed pathologies and improved fluency.
Worked on machine learning for particle physics supervised by Mike Williams. At CERN, sifted through 200TB of LHCb data to better understand dark matter, setting world-leading limits on dark photons [163 citations]. Made the following contributions to ML for Science:
Over the years I have taken part in various technical competitions, which test team work and under pressure:
Programming: | Python (pytorch, keras, tensorflow, scikit-learn, numpy), C++ (STL and ROOT) , R, $\LaTeX$, iOS, Linux, AWS |
Statistics: | Hypothesis testing, frequentist confidence intervals, parameter estimation, Bayesian statistics |
Machine Learning: | Linear regression, CART, SVM, Naive Bayes, kNN, CNNs, RNNs, LSTM, GRU, BERT, VAE, GAN, PCA, SGD, ADAM, ensemble algorithms, LASSO and ridge regularization |
Languages: | German Native Speaker, English Fluent, Spanish B1, French A1 |