Machine Learning Researcher
Soham Dan
I build reliable language and agentic systems with an emphasis on reasoning, robustness, memory, and efficient learning.
I am a Machine Learning Researcher at Scale AI working on LLM agents. Previously, I was a Senior Research Scientist at Microsoft working on Copilot and a Research Scientist at IBM Research. I completed my Ph.D. in Computer and Information Science at the University of Pennsylvania, where I was advised by Prof. Dan Roth in the Cognitive Computation Group. Before that, I earned my undergraduate degree in Computer Science from IIT Kharagpur, where I was awarded the President of India Gold Medal.

Research areas
Language models that are more dependable
My work spans LLM agents, reasoning and planning, long-context behavior, memory, multilingual modeling, robustness, and alignment. I am interested in systems that perform reliably outside narrow benchmark settings.
Current focus
Practical evaluation and control for agentic systems
I care about the gap between what models can do in demos and what they can do in sustained deployment. That includes benchmarks, training algorithms, and evaluation methods that surface failure modes early.
Background
Industry and research across multiple settings
My research has crossed academia and industry, with work published in venues including ACL, NAACL, EMNLP, ICML, ICLR, CVPR, COLING, and AISTATS.
Selected recent work
Representative papers
Large Language Models can be Strong Self-Detoxifiers
Examines how language models can reduce toxic behavior through their own generation and control mechanisms.
Larimar: Large Language Models with Episodic Memory Control
Studies memory control mechanisms for large language models, with implications for persistent and agentic systems.
Understanding Robust Generalization in Learning Regular Languages
Analyzes how models generalize under robustness constraints, connecting theory and empirical behavior.
Get in touch
Open to research conversations
If your work touches LLM agents, reasoning, evaluation, or dependable machine learning systems, the best starting points are Google Scholar, LinkedIn, and my CV.