Homepage of Jonathan Balloch, Ph.D.

Hi, I'm Jonathan!

I develop ways for interactive AI agents to perceive, understand, and adapt to our ever-changing world

I recently completed my Ph.D. where my research focused primarily on developing methods for enabling AI-enabled agents to respond to unexpected changes in their environment. Using methods like reinforcement learning, large multimodal generative modals, and training in simulation, my work improves the adaptability and robustness of agent performance in the presence of such changes. Previously, I was a graduate student at the Georgia Institute of Technology in Atlanta, GA, earning my PhD in Robotics from the College of Computing. I did my PhD research in the Entertainment Intelligence and Human-Centered AI (EI+HCAI) Labs at Georgia Tech under Dr. Mark Riedl (check out his twitter!). I also collaborated in my Ph.D. with Professors Irfan Essa, Sonia Chernova, Zsolt Kira on other ways of making machine learning and autonomy adaptable to change.

Before my Ph.D. I was a Robotics Engineer at Intelligent Automation, Inc. in Rockville, MD. There I focused on finding autonomous vehicle and smart device solutions under DoD research grants, specializing in creating sliding autonomy systems for teleoperators to interactively teach robots tasks and using computer vision to improve teleoperation of field robots. I earned my Masters Degree in Robotics from the University of Pennsylvania in Philadelphia, PA, and my Bachelors Degree in Physics and Mathematics from Georgetown University in Washington, D.C. For more details, check out my C.V.!

Sometimes I let this website get out of date. For the most up-to-date professional information, check out my LinkedIn.

Highlighted Projects

[NEW] NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty

[Long Oral] AAAI 2022 Spring Symposium on Designing Artificial Intelligence for Open Worlds

Paper
Code

A robust body of reinforcement learning techniques have been developed to solve complex sequential decision making problems. However, these methods assume that train and evaluation tasks come from similarly or identically distributed environments. This assumption does not hold in real life where small novel changes to the environment can make a previously learned policy fail or introduce simpler solutions that might never be found. To that end we explore the concept of {\em novelty}, defined in this work as the sudden change to the mechanics or properties of environment. We provide an ontology of for novelties most relevant to sequential decision making, which distinguishes between novelties that affect objects versus actions, unary properties versus non-unary relations, and the distribution of solutions to a task. We introduce NovGrid, a novelty generation framework built on MiniGrid, acting as a toolkit for rapidly developing and evaluating novelty-adaptation-enabled reinforcement learning techniques. Along with the core NovGrid we provide exemplar novelties aligned with our ontology and instantiate them as novelty templates that can be applied to many MiniGrid-compliant environments. Finally, we present a set of metrics built into our framework for the evaluation of novelty-adaptation-enabled machine-learning techniques, and show characteristics of a baseline RL model using these metrics.

Memory-Efficient Semi-Supervised Continual Learning: The World is its Own Replay Buffer

[Short Oral] 2021 International Joint Conference on Neural Networks (IJCNN)

Paper
Code

Rehearsal is a critical component for class-incremental continual learning, yet it requires a substantial memory budget. Our work investigates whether we can significantly reduce this memory budget by leveraging unlabeled data from an agent's environment in a realistic and challenging continual learning paradigm. Specifically, we explore and formalize a novel semi-supervised continual learning (SSCL) setting, where labeled data is scarce yet non-i.i.d. unlabeled data from the agent's environment is plentiful. Importantly, data distributions in the SSCL setting are realistic and therefore reflect object class correlations between, and among, the labeled and unlabeled data distributions. We show that a strategy built on pseudo-labeling, consistency regularization, Out-of-Distribution (OoD) detection, and knowledge distillation reduces forgetting in this setting. Our approach, DistillMatch, increases performance over the state-of-the-art by no less than 8.7% average task accuracy and up to 54.5% average task accuracy in SSCL CIFAR-100 experiments. Moreover, we demonstrate that DistillMatch can save up to 0.23 stored images per processed unlabeled image compared to the next best method which only saves 0.08. Our results suggest that focusing on realistic correlated distributions is a significantly new perspective, which accentuates the importance of leveraging the world's structure as a continual learning strategy.

Tool Macgyvering: Tool Construction Using Geometric Reasoning

2019 International Conference on Robotics and Automation (ICRA)

Paper

MacGyvering is defined as creating or repairing something in an inventive or improvised way by utilizing objects that are available at hand. In this paper, we explore a subset of Macgyvering problems involving tool construction, i.e., creating tools from parts available in the environment. We formalize the overall problem domain of tool Macgyvering, introducing three levels of complexity for tool construction and substitution problems, and presenting a novel computational framework aimed at solving one level of the tool Macgyvering problem, specifically contributing a novel algorithm for tool construction based on geometric reasoning. We validate our approach by constructing three tools using a 7-DOF robot arm.

Unbiasing Semantic Segmentation For Robot Perception using Synthetic Data

RSS 2017 Workshop on New Frontiers for Deep Learning

Paper
Code

The ability of a robot to reason about the geometry and semantics of its environment is fundamental to interactive robot behaviors, but often challenging due to perception frameworks that are trained on too little data or data not representative of the robots environment. In this work, we investigate the potential gains of using synthetic data to augment the training process of convolutional neural networks designed to enable real-time semantic segmentation for robots with limited real-world training data. We investigate the degree to which a larger amounts of data improves performance when training such a model, the relationship between the way a deep neural network is trained using multiple sources of synthetic segmentation data to pretrain standard segmentation datasets that apply to robotics and autonomous driving, and show that our method outperforms both training from scratch and standard data augmentation practices like pretraining on ImageNet. We show that synthetic data does continue to improve these models in spite of real-time model architectures having many fewer parameters than typical deep neural networks, and therefore hypothetically less representational power. Finally, we show how this approach generalizes to small purpose-built robot vision datasets on data acquired using an HRI robot.

Contact Me

If you have any questions for me please feel free to e-mail me at jon.balloch@gmail.com.

Recent News