Boston, MA

Prashank Kadam

Principal Machine Learning Engineer · Cerby

Principal ML Engineer leading foundation model development at Cerby for identity and access management automation. My work spans building and fine-tuning large language models, reinforcement learning, and geometric deep learning. Currently exploring mechanistic interpretability, understanding how LLMs arrive at their decisions. Cross-domain experience across FinTech, Life Sciences, Shipping, and Utility sectors.

Large Language ModelsMechanistic InterpretabilityReinforcement LearningGeometric Deep LearningGraph Neural NetworksMultimodal ML

Google Scholar LinkedIn GitHub CV prashankkadam07 [at] gmail [dot] com

Selected Research

Financial Fraud Detection using Jump-Attentive Graph-based Learning
2024

ICMLA 2024

Graph machine learning paper using Graph Neural Networks with attention-based residual connections to detect fraud in financial transactions. Outperforms state-of-the-art algorithms on benchmark datasets.

GNNAttentionFraud Detection

IEEE Xplore
Enhancing Financial Fraud Detection with Human-in-the-Loop Feedback and Feedback Propagation
2024

ICMLA 2024

Analyzes the impact of human-in-the-loop feedback on financial transactions and proposes a novel algorithm that propagates feedback signal further into the graph.

GNNHuman-in-the-LoopFraud Detection

IEEE Xplore
GPT-Guided Monte Carlo Tree Search for Symbolic Regression in Financial Fraud Detection
2024

CIKM 2024 RAGEnt Workshop

Foundation model trained on Symbolic Regression to generate rule sets for financial fraud detection. Rule sets run 60x faster than other state-of-the-art methods.

LLMMCTSSymbolic Regression

Workshop
Accelerating Neural MCTS Algorithms using Neural Sub-Net Structures
2023

AAMAS 2023 · with Prof. Karl Lieberherr, Ruiyang Xu

Step-MCTS: subnet structures within the complete network, each simulating a tree that provides lookahead for exploration. Optimizes training times of state-of-the-art AlphaZero MPV-MCTS.

RLNeural MCTSAlphaZero

Extended Abstract Full Paper
Optimizations over Neural MCTS for Combinatorial Games
2021

Master's Thesis, Northeastern University · Advisor: Prof. Karl Lieberherr | Reader: Prof. Christopher Amato

Meta MPV-MCTS and Dual MCTS AlphaZero: outperform vanilla AlphaZero on combinatorial games using limited training budgets. Implemented efficient training strategies for asymmetric games.

RLNeural MCTSGame Theory

Thesis
Dual MCTS
2021

arXiv · with Prof. Karl Lieberherr, Ruiyang Xu

Novel algorithm using a substructure within the neural network to achieve balanced accelerated learning. An intermediate policy is used to optimize the final policy that determines agent behavior.

RLNeural MCTS

arXiv
First-Order Problem Solving through Neural MCTS based Reinforcement Learning
2021

arXiv · with Prof. Karl Lieberherr, Ruiyang Xu

Evaluates first-order logic using game-theoretic semantics by dynamically mapping to a two-player game (logic game).

RLLogicGame Theory

arXiv

Experience

Principal Machine Learning Engineer · Cerby
Sep 2025 to Present

Led the design of AI-driven workflow code generation using transformer-based language models and graph machine learning, modeling user interactions, DOM structure, and execution flows as graphs to generate, validate, and evolve CHAOS automations. Architected multimodal ML systems leveraging Scout and Replayer data (DOM, rrweb events, screenshots, videos, and failure traces) to power error classification, failure clustering, and drift detection in production workflows. Built and owned scalable end-to-end ML infrastructure on AWS, including data pipelines, SageMaker training pipelines, model deployment, and monitoring, enabling reliable iteration toward self-healing automations.
Principal Machine Learning Engineer · Vesta Corporation
Jan 2025 to Sep 2025

Led design and deployment of graph-based fraud detection systems using GNNs and custom graph algorithms, including JA-GNN which outperformed state-of-the-art methods on proprietary and public datasets, improving risk attribution by over 20% across multi-block transaction networks. Built distributed graph pipelines with PySpark, DGL, Neo4j, and Redis, including a custom in-memory graph library enabling real-time subgraph extraction, similarity scoring, and risk propagation.
Senior Machine Learning Engineer · Vesta Corporation
Jan 2022 to Jan 2025

Engineered scalable data pipelines for temporal and graph-based feature generation over billions of transactions, enabling efficient training and inference for large-scale fraud models. Delivered end-to-end ML solutions on Azure, containerized with Kubernetes (AKS), integrating graph outputs into downstream risk models and real-time scoring workflows. Collaborated with investigators and cross-functional teams to convert user stories into graph-native features and deployed models using open-source ML frameworks like PyTorch and DGL.
Machine Learning and Optimization Engineer (Intern) · Waters Corporation
Jan 2021 to Jul 2021

Built and optimized end-to-end ML pipelines to reduce liquid chromatograph equilibration time by 30%, employing advanced signal decomposition on time-series sensor data. Integrated AWS Redshift with PySpark-based ETL pipelines and deployed a Streamlit application on EC2, streamlining real-time data ingestion and SME collaboration for model lifecycle management.
Teaching Assistant · Northeastern University
Aug 2020 to Jun 2021

Teaching Assistant for CS4100: Artificial Intelligence (Prof. Christopher Amato) and CS5100: Artificial Intelligence (Prof. Stacy Marcella).
Data Scientist · Maersk Tankers
Mar 2019 to Dec 2019

Developed HPC-ready genetic optimization models for vessel routing, integrating climatic and onboard sensor data to achieve a 14% improvement in fuel efficiency. Implemented a Flask-based web app transitioning vessels from HSFO to VLSFO, containerized and deployed on Kubernetes for real-time, high-performance fuel management. Partnered with operations and technology teams to develop distributed, data-driven solutions using PySpark pipelines, optimizing maritime logistics and reducing operational costs.
Data Analyst · Accenture Solutions Private Limited
Nov 2016 to Mar 2019

Built and deployed predictive pricing models using Spark MLlib on high-volume utility data, enabling real-time rate adjustments for electric and gas clients. Collaborated with data engineers and domain experts to streamline data pipelines and integrate open-source ML frameworks for more efficient model experimentation.

Education

Northeastern University
Jan 2020 - Dec 2021

Master of Science, Data Science (Thesis Track)

Thesis: Optimizations over Neural MCTS for Combinatorial Games (Advisor: Karl Lieberherr, Reader: Chris Amato). Coursework: Algorithms, Supervised/Unsupervised Machine Learning, Information Retrieval, Statistics.

GPA: 3.90
Harvard University
Aug 2020

Cross-Registration from NEU

Coursework: Deep Reinforcement Learning

GPA: 4.00
University of Pune
Aug 2012 - Jul 2016

Bachelor of Engineering, Electronics

Coursework: Object Oriented Programming, Data Structures, Digital Signal Processing

First Class with Distinction

Talks & Service

Invited Talks

Machine Learning Faculty Development Program

Visiting Lecturer, University of Pune

Feb 2025
Introduction to Reinforcement Learning

Visiting Lecturer, University of Pune

Nov 2022
First Order Problem Solving using Neural MCTS based Reinforcement Learning

Multidisciplinary Workshop on Stochastic Modelling and Monte Carlo Tree Search, TU Munich

Oct 2022
Machine Learning Fundamentals

Visiting Lecturer, University of Pune

Apr 2022

Academic Service

Reviewer - ICML 2026
PC Member - NeurIPS 2026
PC Member - SIGKDD 2026 (Feb & Aug cycles)
PC Member - ACM WebConf 2026
PC Member - SIGKDD 2025 (Feb & Aug cycles)
PC Member - SIGKDD 2024 (Feb cycle)
Reviewer - CIKM 2024

Projects

Health Care Entities and Relationship Extraction

Jan 2021 - Apr 2021

Deep Learning, NLP

BioBERT-based models for extracting NERs from EHR data and a market-basket model for Relationship Extraction (RE) between extracted NERs to produce knowledge graphs.

Meta Learning based Training for Deep Reinforcement Learning Agent

Advisor: Prof. Karl Lieberherr

Implemented ideas from Discovering Reinforcement Learning Algorithms (Oh et al.) using the backward LSTM approach. Tested on Nim, Othello, Connect4.

Data Processing and Modeling Tool

Jan 2020 - Apr 2020

Python, Dash, Heroku, Statsmodels, Plotly

Web application that lets the user upload a dataset and perform analysis or modeling using various machine learning algorithms.

StarGAN for Multi-domain Image Translation

Jan 2020 - Feb 2020

Computer Vision, TensorFlow

Translates human faces across domains (expressions, color, gender, age) using the GAN configuration from StarGAN (Choi et al.).

Integrating MPV-MCTS and Metacontroller with Persephone

Advisor: Prof. Karl Lieberherr

Multiple policy-value networks combined with the metacontroller from Deepmind's Agent 57 to achieve balanced learning in AlphaZero for combinatorial gaming problems.

Technical Stack

Languages

PythonRSQLPostgreSQL

ML Methods

Bayesian methodsTree-based learners (XGBoost, RF)GNN, CNN, RNN, LSTM, GAN, BERTDRQN, Neural-MCTS

Frameworks & Libraries

PyTorchKerasScikit-learnStatsmodelsDGL / PyGSpark MLlibPandas / NumPy

Cloud & Infrastructure

AWS (SageMaker, EC2, S3, ElasticSearch, Lambda, Redshift)Azure (ML Studio, Cosmos, AKS)Neo4j, Redis, PostgreSQL

Distributed & HPC

PyTorch DistributedHorovod, Ray, DeepSpeedDask, PySparkDocker, Kubernetes, Slurm, MPIMLFlow, Weights & Biases

Awards & Certifications

Awards

Northeastern University Research Fellowship
Northeastern University Tuition Scholarship
Teaching Assistant, Northeastern University
Accenture Celebrates Excellence Award (Sep 2018)
Accenture Outperformer Award (Feb 2018, Feb 2017)

Prashank Kadam

Selected Research

Financial Fraud Detection using Jump-Attentive Graph-based Learning

Enhancing Financial Fraud Detection with Human-in-the-Loop Feedback and Feedback Propagation

GPT-Guided Monte Carlo Tree Search for Symbolic Regression in Financial Fraud Detection

Accelerating Neural MCTS Algorithms using Neural Sub-Net Structures

Optimizations over Neural MCTS for Combinatorial Games

Dual MCTS

First-Order Problem Solving through Neural MCTS based Reinforcement Learning

Experience

Principal Machine Learning Engineer · Cerby

Principal Machine Learning Engineer · Vesta Corporation

Senior Machine Learning Engineer · Vesta Corporation

Machine Learning and Optimization Engineer (Intern) · Waters Corporation

Teaching Assistant · Northeastern University

Data Scientist · Maersk Tankers

Data Analyst · Accenture Solutions Private Limited

Education

Northeastern University

Harvard University

University of Pune