VICAUSE: Simultaneous missing value imputation and causal discovery
- Pablo Morales-Alvarez ,
- Angus Lamb ,
- Simon Woodhead ,
- Simon Peyton Jones ,
- Miltos Allamanis ,
- Cheng Zhang
MSR-TR-2021-14 |
Published by ICML 2021 workshop on the Neglected Assumptions in Causal Inference
Missing values constitute an important challenge in real-world machine learning for both prediction and causal discovery tasks. However, only few methods in causal discovery can handle missing data in an efficient way, while existing imputation methods are agnostic to causality. In this work we propose VICAUSE, a novel approach to simultaneously tackle missing value imputation and causal discovery efficiently with deep learning. Particularly, we propose a generative model with a structured latent space and a graph neural network-based architecture, scaling to large number of variables. Moreover, our method can discover relationship between groups of variables which is useful in many real-world applications. VICAUSE shows improved performance compared to popular and recent approaches in both missing value imputation and causal discovery.