General Latent Feature Modeling for Data Exploration Tasks

    Abstract

    This paper introduces a general Bayesian non- parametric latent feature model suitable to per- form automatic exploratory analysis of heterogeneous datasets, where the attributes describing each object can be either discrete, continuous or mixed variables. The proposed model presents several important properties. First, it accounts for heterogeneous data while can be inferred in linear time with respect to the number of objects and attributes. Second, its Bayesian nonparametric nature allows us to automatically infer the model complexity from the data, i.e., the number of features necessary to capture the latent structure in the data. Third, the latent features in the model are binary-valued variables, easing the interpretability of the obtained latent features in data exploration tasks.

    Authors

    Isabel Valera, Melanie F. Pradier, Zoubin Ghahramani

    Conference

    ICML Workshop on Human Interpretability in Machine Learning

    Full Paper

    ‘General Latent Feature Modeling for Data Exploration Tasks’ (PDF)

    Uber AI

    Comments
    Previous articleTime-series extreme event forecasting with neural networks at Uber
    Next articleFew-Shot Learning Through an Information Retrieval Lens
    Zoubin Ghahramani
    Zoubin Ghahramani is Chief Scientist of Uber and a world leader in the field of machine learning, significantly advancing the state-of-the-art in algorithms that can learn from data. He is known in particular for fundamental contributions to probabilistic modeling and Bayesian approaches to machine learning systems and AI. Zoubin also maintains his roles as Professor of Information Engineering at the University of Cambridge and Deputy Director of the Leverhulme Centre for the Future of Intelligence. He was one of the founding directors of the Alan Turing Institute (the UK's national institute for Data Science and AI), and is a Fellow of St John's College Cambridge and of the Royal Society.