University of Cambridge > Talks.cam > Machine Learning Reading Group @ CUED > Machine Learning on Tabular Data

Machine Learning on Tabular Data

Download to your calendar using vCal

If you have a question about this talk, please contact Xianda Sun .

Data in tabular form is ubiquitous in industries such as finance, healthcare, and education. Historically, boosted decision trees and multi-layer perceptrons were the models of choice for making predictions on tabular data. In the last couple of years, neural process-based transformers (e.g. TabPFN, TabICL) that are trained on synthetic data have surpassed traditional approaches in terms of speed and accuracy. Well-funded start-ups have recently exploited these advances offering easy to use tools aimed at enterprises. In this talk we will 1) introduce the basic concepts behind tabular machine learning; 2) describe traditional tabular learning approaches including XGBoost, CatBoost, and RealMLP; 3) do deep dives on TabPFN and TabICL; 4) examine the use of LLMs for making tabular predictions; and finally 5) discuss some recent work on casual tabular foundation models.

This talk is part of the Machine Learning Reading Group @ CUED series.

This talk is included in these lists:

Note that ex-directory lists are not shown.

 

Š 2006-2025 Talks.cam, University of Cambridge. Contact Us | Help and Documentation | Privacy and Publicity