Research Publications
GAICo: A Deployed and Extensible Framework for Evaluating Diverse and Multimodal Generative AI Outputs
Proceedings of the Thirty-Eighth Annual Conference on Innovative Applications of Artificial Intelligence (IAAI/AAAI 2026), January 2026
On Identifying Why and When Foundation Models Perform Well on Time-Series Forecasting Using Automated Explanations and Rating
AAAI2025 Fall Symposium on AI Trustworthiness and Risk Assessment for Challenged Contexts (ATRACC), Arlington, VA, USA, Nov 2025
FABLE: A Novel Data-Flow Analysis Benchmark on Procedural Text for Large Language Model Evaluation
Preprint, May 2025
Revisiting LLMs in Planning from Literature Review: a Semi-Automated Analysis Approach and Evolving Categories Representing Shifting Perspectives
Proceedings of the 35th International Conference on Automated Planning and Scheduling (ICAPS), 2025, Melbourne, Australia, Nov 2025
The Case for Developing a Foundation Model for Planning-like Tasks from Scratch
Accepted at Planning and Reinforcement Learning (PRL) Workshop at ICAPS 2024
PLANTS: A Novel Problem and Dataset for Summarization of Planning-Like (PL) Tasks
Preprint, May 2024