Projects

A rather comprehensive list of projects that I have worked on, grouped by area of interest.

Learning, generalization, and domain adaptation

Invariance Principle Meets Information Bottleneck for Out-of-Distribution Generalization We prove that using the "information bottleneck" along with invariance helps address key failures of IRM. We propose an approach that incorporates both of these principles and demonstrate its effectiveness.
Lead: Kartik Ahuja
In Search of Robust Measures of Generalization We look into the experimental evaluation of generalization measures for neural networks. We argue that generalization measures should be evaluated within the framework of distributional robustness and provide methodology and experimental results on a variety of architectures.
Lead: Karolina Dziugaite, Alexandre Drouin (ServiceNow/ElementAI)
A Modern Take on the Bias-Variance Tradeoff in Neural Networks We measure prediction bias and variance in NNs. Both bias and variance decrease as the number of parameters grows. We decompose variance into variance due to sampling and variance due to initialization.
Lead: Brady Neal
Generalizing to unseen domains via distribution matching We propose a process that enforces pair-wise domain invariance while training a feature extractor over a diverse set of domains. We show that this process ensures invariance to any distribution that can be expressed as a mixture of the training domains.
Lead: Isabela Albuquerque, João Monteiro (INRS)
In Support of Over-Parametrization in deep RL There is significant recent evidence in supervised learning that, in the over-parametrized setting, wider networks achieve better test error. We experiment on four OpenAI Gym tasks and provide evidence that overparametrization is also beneficial in deep RL.
Lead: Brady Neal
Connections between max margin classifiers and gradient penalties Maximum-margin classifiers can be formulated as Integral Probability Metrics (IPMs) or classifiers with some form of gradient norm penalty. This implies a direct link to a class of Generative adversarial networks (GANs) which penalize a gradient norm.
Lead: Alexia Jolicoeur-Martineau, Image source: wikipedia

Differentiable games

Generative models

Optimization and numerical analysis

Deep learning and applications

MCMC methods

Large-scale systems