How to fine-tune the TabPFN model? #8

setipsh · 2025-01-13T10:32:40Z

I am using the TabPFN model for a classification task and would like to fine-tune the already trained pre-trained model to better adapt it to my specific dataset. I have reviewed the documentation and code, but I still have some questions about the specific steps and parameter settings for fine-tuning.
Does fine-tuning TabPFN involve gradient updates?
How many training epochs are needed?
During the fine-tuning process, which parameters are the most important?

fif911 · 2025-02-02T17:57:33Z

Hey. Did you find any details? Also looking into it now

LennartPurucker · 2025-02-02T19:59:10Z

Heyho, sharing a discussion from Discord (link):

Hi everyone, there are two different things that both can be called fine-tuning:
(1) Fine-tuning TabPFN to do better on one or more datasets in order to generalize better to other, similar datasets. The analogy for LLMs would be fine-tuning GPT/Llama/Mistral to your own data.
(2) Fine-tuning TabPFN to a single dataset, in order to improve its performance (e.g., to tackle large datasets that don't fit into memory). This doesn't have an analogy in LLMs but is specific to tabular data.

For (2), I already created some code, see https://github.com/LennartPurucker/finetune_tabpfn_v2 for more.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to fine-tune the TabPFN model? #8

How to fine-tune the TabPFN model? #8

setipsh commented Jan 13, 2025

fif911 commented Feb 2, 2025

LennartPurucker commented Feb 2, 2025 •

edited

Loading

How to fine-tune the TabPFN model? #8

How to fine-tune the TabPFN model? #8

Comments

setipsh commented Jan 13, 2025

fif911 commented Feb 2, 2025

LennartPurucker commented Feb 2, 2025 • edited Loading

LennartPurucker commented Feb 2, 2025 •

edited

Loading