Prashank Kadam

Boston, MA

Prashank Kadam

Principal Machine Learning Engineer · Cerby

Principal ML Engineer leading foundation model development at Cerby for identity and access management automation. My work spans building and fine-tuning large language models, reinforcement learning, and geometric deep learning. Currently exploring mechanistic interpretability, understanding how LLMs arrive at their decisions. Cross-domain experience across FinTech, Life Sciences, Shipping, and Utility sectors.

Large Language ModelsMechanistic InterpretabilityReinforcement LearningGeometric Deep LearningGraph Neural NetworksMultimodal ML
Google Scholar LinkedIn GitHub CV prashankkadam07 [at] gmail [dot] com
Prashank Kadam

Selected Research

  1. Financial Fraud Detection using Jump-Attentive Graph-based Learning

    2024

    ICMLA 2024

    Graph machine learning paper using Graph Neural Networks with attention-based residual connections to detect fraud in financial transactions. Outperforms state-of-the-art algorithms on benchmark datasets.

    GNNAttentionFraud Detection
  2. Enhancing Financial Fraud Detection with Human-in-the-Loop Feedback and Feedback Propagation

    2024

    ICMLA 2024

    Analyzes the impact of human-in-the-loop feedback on financial transactions and proposes a novel algorithm that propagates feedback signal further into the graph.

    GNNHuman-in-the-LoopFraud Detection
  3. GPT-Guided Monte Carlo Tree Search for Symbolic Regression in Financial Fraud Detection

    2024

    CIKM 2024 RAGEnt Workshop

    Foundation model trained on Symbolic Regression to generate rule sets for financial fraud detection. Rule sets run 60x faster than other state-of-the-art methods.

    LLMMCTSSymbolic Regression
  4. Accelerating Neural MCTS Algorithms using Neural Sub-Net Structures

    2023

    AAMAS 2023 · with Prof. Karl Lieberherr, Ruiyang Xu

    Step-MCTS: subnet structures within the complete network, each simulating a tree that provides lookahead for exploration. Optimizes training times of state-of-the-art AlphaZero MPV-MCTS.

    RLNeural MCTSAlphaZero
  5. Optimizations over Neural MCTS for Combinatorial Games

    2021

    Master's Thesis, Northeastern University · Advisor: Prof. Karl Lieberherr | Reader: Prof. Christopher Amato

    Meta MPV-MCTS and Dual MCTS AlphaZero: outperform vanilla AlphaZero on combinatorial games using limited training budgets. Implemented efficient training strategies for asymmetric games.

    RLNeural MCTSGame Theory
  6. Dual MCTS

    2021

    arXiv · with Prof. Karl Lieberherr, Ruiyang Xu

    Novel algorithm using a substructure within the neural network to achieve balanced accelerated learning. An intermediate policy is used to optimize the final policy that determines agent behavior.

    RLNeural MCTS
  7. First-Order Problem Solving through Neural MCTS based Reinforcement Learning

    2021

    arXiv · with Prof. Karl Lieberherr, Ruiyang Xu

    Evaluates first-order logic using game-theoretic semantics by dynamically mapping to a two-player game (logic game).

    RLLogicGame Theory

Experience

  1. Principal Machine Learning Engineer · Cerby

    Sep 2025 to Present

    Led the design of AI-driven workflow code generation using transformer-based language models and graph machine learning, modeling user interactions, DOM structure, and execution flows as graphs to generate, validate, and evolve CHAOS automations. Architected multimodal ML systems leveraging Scout and Replayer data (DOM, rrweb events, screenshots, videos, and failure traces) to power error classification, failure clustering, and drift detection in production workflows. Built and owned scalable end-to-end ML infrastructure on AWS, including data pipelines, SageMaker training pipelines, model deployment, and monitoring, enabling reliable iteration toward self-healing automations.

  2. Principal Machine Learning Engineer · Vesta Corporation

    Jan 2025 to Sep 2025

    Led design and deployment of graph-based fraud detection systems using GNNs and custom graph algorithms, including JA-GNN which outperformed state-of-the-art methods on proprietary and public datasets, improving risk attribution by over 20% across multi-block transaction networks. Built distributed graph pipelines with PySpark, DGL, Neo4j, and Redis, including a custom in-memory graph library enabling real-time subgraph extraction, similarity scoring, and risk propagation.

  3. Senior Machine Learning Engineer · Vesta Corporation

    Jan 2022 to Jan 2025

    Engineered scalable data pipelines for temporal and graph-based feature generation over billions of transactions, enabling efficient training and inference for large-scale fraud models. Delivered end-to-end ML solutions on Azure, containerized with Kubernetes (AKS), integrating graph outputs into downstream risk models and real-time scoring workflows. Collaborated with investigators and cross-functional teams to convert user stories into graph-native features and deployed models using open-source ML frameworks like PyTorch and DGL.

  4. Machine Learning and Optimization Engineer (Intern) · Waters Corporation

    Jan 2021 to Jul 2021

    Built and optimized end-to-end ML pipelines to reduce liquid chromatograph equilibration time by 30%, employing advanced signal decomposition on time-series sensor data. Integrated AWS Redshift with PySpark-based ETL pipelines and deployed a Streamlit application on EC2, streamlining real-time data ingestion and SME collaboration for model lifecycle management.

  5. Teaching Assistant · Northeastern University

    Aug 2020 to Jun 2021

    Teaching Assistant for CS4100: Artificial Intelligence (Prof. Christopher Amato) and CS5100: Artificial Intelligence (Prof. Stacy Marcella).

  6. Data Scientist · Maersk Tankers

    Mar 2019 to Dec 2019

    Developed HPC-ready genetic optimization models for vessel routing, integrating climatic and onboard sensor data to achieve a 14% improvement in fuel efficiency. Implemented a Flask-based web app transitioning vessels from HSFO to VLSFO, containerized and deployed on Kubernetes for real-time, high-performance fuel management. Partnered with operations and technology teams to develop distributed, data-driven solutions using PySpark pipelines, optimizing maritime logistics and reducing operational costs.

  7. Data Analyst · Accenture Solutions Private Limited

    Nov 2016 to Mar 2019

    Built and deployed predictive pricing models using Spark MLlib on high-volume utility data, enabling real-time rate adjustments for electric and gas clients. Collaborated with data engineers and domain experts to streamline data pipelines and integrate open-source ML frameworks for more efficient model experimentation.

Education

  1. Northeastern University

    Jan 2020 - Dec 2021

    Master of Science, Data Science (Thesis Track)

    Thesis: Optimizations over Neural MCTS for Combinatorial Games (Advisor: Karl Lieberherr, Reader: Chris Amato). Coursework: Algorithms, Supervised/Unsupervised Machine Learning, Information Retrieval, Statistics.

    GPA: 3.90

  2. Harvard University

    Aug 2020

    Cross-Registration from NEU

    Coursework: Deep Reinforcement Learning

    GPA: 4.00

  3. University of Pune

    Aug 2012 - Jul 2016

    Bachelor of Engineering, Electronics

    Coursework: Object Oriented Programming, Data Structures, Digital Signal Processing

    First Class with Distinction

Talks & Service

Invited Talks

Academic Service

Projects

Health Care Entities and Relationship Extraction

Jan 2021 - Apr 2021

Deep Learning, NLP

BioBERT-based models for extracting NERs from EHR data and a market-basket model for Relationship Extraction (RE) between extracted NERs to produce knowledge graphs.

Meta Learning based Training for Deep Reinforcement Learning Agent

Advisor: Prof. Karl Lieberherr

Implemented ideas from Discovering Reinforcement Learning Algorithms (Oh et al.) using the backward LSTM approach. Tested on Nim, Othello, Connect4.

Data Processing and Modeling Tool

Jan 2020 - Apr 2020

Python, Dash, Heroku, Statsmodels, Plotly

Web application that lets the user upload a dataset and perform analysis or modeling using various machine learning algorithms.

StarGAN for Multi-domain Image Translation

Jan 2020 - Feb 2020

Computer Vision, TensorFlow

Translates human faces across domains (expressions, color, gender, age) using the GAN configuration from StarGAN (Choi et al.).

Integrating MPV-MCTS and Metacontroller with Persephone

Advisor: Prof. Karl Lieberherr

Multiple policy-value networks combined with the metacontroller from Deepmind's Agent 57 to achieve balanced learning in AlphaZero for combinatorial gaming problems.

Technical Stack

Languages

PythonRSQLPostgreSQL

ML Methods

Bayesian methodsTree-based learners (XGBoost, RF)GNN, CNN, RNN, LSTM, GAN, BERTDRQN, Neural-MCTS

Frameworks & Libraries

PyTorchKerasScikit-learnStatsmodelsDGL / PyGSpark MLlibPandas / NumPy

Cloud & Infrastructure

AWS (SageMaker, EC2, S3, ElasticSearch, Lambda, Redshift)Azure (ML Studio, Cosmos, AKS)Neo4j, Redis, PostgreSQL

Distributed & HPC

PyTorch DistributedHorovod, Ray, DeepSpeedDask, PySparkDocker, Kubernetes, Slurm, MPIMLFlow, Weights & Biases

Awards & Certifications

Awards

Certifications