Tobasum Mandal

My Story

I grew up in motion. Six cities across India, each one a different language at the dinner table, a different school uniform, a different version of home. By the time I was sixteen, I had attended eight schools. Some people find roots in places. I found mine in the act of leaving them.

Then came UWC Mahindra, a boarding school in the hills of Pune where sixty-plus nationalities collided under one roof. It cracked open the world for me. I sat across from people whose childhoods were shaped by conflicts I had only read about, whose governments had failed them in ways I could barely fathom. And yet, despite growing up worlds apart, we found the same humor in bad cafeteria food. That was the first time I understood that inequity is not a statistic. It is the gap between what you see at home and what you see everywhere else.

I came to Vanderbilt not to escape that gap, but to learn how to close it.

Now, as a BS/MS student in Computer Science, Economics, and Applied Mathematics, I work at the intersection of mathematical theory and real-world systems. At IBM watsonx AI Labs, I build self-directed AI agents for live marketplaces. At Vanderbilt's Center for Transportation, I apply reinforcement learning and agent-based simulation to model Nashville's traffic corridors, optimizing congestion strategies across 10,000+ traffic patterns. In the Advanced Learning Lab, I engineer multimodal ML pipelines to predict student metacognition with 88% accuracy.

My current research explores training neural networks on fractal geometries, investigating how self-similar mathematical structures can inform more efficient network architectures. It sits at the intersection of pure mathematics and deep learning, the kind of problem that demands both rigorous theory and creative experimentation.

I think in math and build in code. Whether it is domain-adaptive pre-training for NLP models, LiDAR-guided autonomous navigation for search-and-rescue drones, or GPU-accelerated deep learning workflows for visual cognition, I gravitate toward problems where computational rigor meets tangible impact. I also play the violin and write, because some of the best ideas arrive outside the terminal.

I see technology as infrastructure. The systems that matter most are the ones built for communities that have historically been excluded from the design process.

Experience

What I'm building right now

Venture Fellow

IBM watsonx AI Labs

Building a self-directed growth agent for a live P2P AI credit marketplace. Automating GTM workflows end-to-end with LLM-driven outreach pipelines, contributing to a 3x increase in marketplace listings within the first month.

Mar 2026 - Present

Software Engineering Intern

Vanderbilt Center for Transportation and Operational Resiliency

Agent-based simulation framework using SimPy and R-tree indexing to model 10,000+ traffic patterns across Nashville's corridors. Applied reinforcement learning for dynamic incentive allocation, improving reward distribution efficiency by 15%. Calibrated behavioral models from 369,831 rideshare trips.

Jan 2026 - Present

Machine Learning Intern

Advanced Learning Lab (ALL), Vanderbilt University

Multimodal ML pipelines using Random Forest, XGBoost, and neural architectures to predict student metacognition, motivation, and math learning on Carnegie Learning's MATHia platform with 88% accuracy. Engineered 50+ multimodal features across linguistic, engagement, and time-series domains.

Aug 2025 - Present

Natural Language Processing Intern

iTELL AI

Domain-Adaptive Pre-Training (DAPT) on ModernBERT, BLEURT, and MPNet, achieving 83% accuracy in open-ended comprehension scoring while improving inference latency by 22%. Built end-to-end LLM-based automation pipelines for PDF-to-JSON conversion, accelerating processing by 60%.

Jan 2025 - Present

Machine Learning Research Intern

Category Lab (CATLAB), Vanderbilt University

Optimized DNN/CNN and CLIP-based vision-language models to study visual cognition and perceptual expertise, achieving 96% accuracy on ECOSET and ImageNet benchmarks. GPU-accelerated deep learning workflows with TensorFlow, PyTorch, and MATLAB.

Jun 2025 - Jul 2025

Projects

Hackathon Winner, AWS Autonomy

Edge-AI Earthquake Search and Rescue System

End-to-end UAV-UGV coordination system for post-earthquake SAR using Edge AI. Real-time thermal+RGB detection on Jetson Xavier NX with LiDAR-guided UGV SLAM/Nav2, enabling GPS-denied autonomous search with ~94% simulated uptime and ~60% lower alert latency over 2G/LoRa.

ROS 2 YOLO AWS IoT Jetson Xavier NX LiDAR MQTT

View on GitHub →

Hackathon Winner, LaborUp

Conversational AI Pipeline for Blue-Collar Hiring

Redis-backed MCP-powered AI voice agent supporting 5K+ concurrent calls with real-time adversarial detection and automated webhook delivery. Built for blue-collar recruitment using Python, FastAPI, Twilio, and OpenAI APIs.

Python FastAPI Redis MCP Twilio OpenAI

Ongoing Research

Neural Networks on Fractal Geometries

Investigating how self-similar mathematical structures and recursive fractal patterns can inform more efficient neural network architectures. Bridging pure mathematics and deep learning for novel training paradigms.

PyTorch Fractal Geometry Deep Learning

Transportation Research

Nashville Traffic Corridor Simulation

Agent-based simulation modeling 10,000+ traffic patterns with R-tree spatial indexing, greedy heuristics, dynamic programming, and metaheuristics. Calibrated from 369,831 rideshare trips to predict carpool adoption and departure time shifts.

SimPy R-tree Reinforcement Learning

Learning Analytics

Student Metacognition Prediction

Multimodal ML pipeline predicting student metacognition at 88% accuracy on Carnegie Learning's MATHia platform. 50+ features across linguistic, engagement, and time-series domains using NLP and advanced imputation methods.

XGBoost NLP Random Forest Time-Series

Computer Vision

Visual Cognition and Perceptual Expertise

DNN/CNN and CLIP-based vision-language models for studying visual cognition. GPU-accelerated workflows with TensorFlow, PyTorch, and MATLAB, integrating MDS and HDBSCAN for high-dimensional data reduction. 96% accuracy on ECOSET and ImageNet.

CLIP TensorFlow PyTorch CUDA