[WIP][New Model] PerpetualBoosting by LennartPurucker · Pull Request #170 · autogluon/tabarena

LennartPurucker · 2025-06-25T08:48:54Z

This code adds the PerpetualBooster (https://github.com/perpetual-ml/perpetual).

Benchmark TabArena-Full Results

All results: perpetual_boosting.zip
Raw results: https://data.lennart-purucker.com/tabarena/data_PerpetualBoosting.zip

Notes

I only evaluated the one default config, which already has a lot of open TODOs, before we should try it with HPO.
The code has a few problems with memory management. This goes so far that I was not able to get one dataset with many categoricals to run at all. So I had to impute one dataset (see the LB impute column)
Another big problem is that one cannot add a callback for early stopping on external validation data.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

LennartPurucker · 2025-09-29T06:56:56Z

The authors of PerpetualBoosting have closed the integration issue on their side.
So it is unlikely that the model will receive the updates or integration (with verification by the authors) that is required.

Moreover, as the initial results do not seem promising enough, I am not willing to invest more time into fixing the problems for the integration myself in the near future.

How should we proceed? @Innixma @dholzmueller
Should we still include these results and add some kind of additional tag or column to indicate clearly that this is even more experimental/unverified than our other unverified results?

Innixma · 2025-11-29T20:22:02Z

@LennartPurucker Up to you. IMO we could keep a list somewhere of models that were not finished being integrated, with a link to the GitHub PR/Issue? That way we don't need to get the actual results themselves in the codebase. I'd prefer the results we have available to see in the code-base to pass some level of quality bar.

LennartPurucker · 2026-01-17T17:04:10Z

I have updated the PR to the current state of TabArena and integrated the feedback from the author in their issue.

Some of the bugs have not been addressed but maybe it is fine now. I will rerun the benchmark and then we can merge this model IMO.

deadsoul44 · 2026-01-17T18:37:02Z

Hi,

I am the author of PerpetualBooster. I have a couple of comments:

You can rename PerpetualBoosting to PerpetualBooster.
budget parameter is not meant to be tuned but for the purpose of getting the best result, you can treat it as a hyperparameter to be tuned. Try running the benchmark with 0.5, 1.0, 1.5, 2.0 and get the best result on validation set.
PerpetualBooster doesn't need a separate set for early stopping. The algorithm stops itself when it doesn't see any performance gain. You can use all data at the end for the final performance.
Increase iteration_limit to 10,000.

Let me know if you need any detail about the algorithm.

LennartPurucker · 2026-01-18T20:45:11Z

Heyho @deadsoul44,

Great to see you here!

To clarify: the code already implements (2) and (3).
I will add (4) again, thanks! I will rename the model as well.

LennartPurucker · 2026-01-31T13:28:58Z

It seems my CPU compute was put on hold for ~two weeks, but afterward, we will have new and better hardware. Sorry for the delay!

deadsoul44 · 2026-02-02T08:57:04Z

v1.1.2 is released with improved performance and numerical stability. Benchmarked version can be updated.

LennartPurucker · 2026-02-02T09:00:13Z

@deadsoul44 sounds great, I will check also for the newest version once I start the benchmarks, thank you!

deadsoul44 · 2026-02-02T15:41:59Z

Do you have any plans to add a "Large" option to the All Dataset section? In this option, datasets with more than 1,000,000 samples can be included.

LennartPurucker · 2026-02-02T16:18:17Z

@deadsoul44, yes, very much! The next version of TabArena will include datasets with more than 250k samples.
However, so far, we have no datasets larger than 250k, so we can only have a medium tab.

Note, we did not filter large datasets, but our first curation round for TabArena-v0.1 simply did not find any larger datasets following our curation rules, sadly.

LennartPurucker added the new model label Jun 25, 2025

LennartPurucker mentioned this pull request Jun 25, 2025

[TabArena] Benchmarking PerpetualBoosting; Verifying our Implemenetation perpetual-ml/perpetual#66

Closed

add: perpetual model based on author feedback

25115aa

LennartPurucker force-pushed the tabarena_perpetual branch from e8c1e19 to 25115aa Compare January 17, 2026 17:02

maint: comment fix

95b07f1

LennartPurucker closed this by deleting the head repository Feb 14, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][New Model] PerpetualBoosting#170

[WIP][New Model] PerpetualBoosting#170
LennartPurucker wants to merge 2 commits intoautogluon:mainfrom
LennartPurucker:tabarena_perpetual

LennartPurucker commented Jun 25, 2025 •

edited

Loading

Uh oh!

LennartPurucker commented Sep 29, 2025

Uh oh!

Innixma commented Nov 29, 2025

Uh oh!

LennartPurucker commented Jan 17, 2026

Uh oh!

deadsoul44 commented Jan 17, 2026 •

edited

Loading

Uh oh!

LennartPurucker commented Jan 18, 2026

Uh oh!

LennartPurucker commented Jan 31, 2026

Uh oh!

deadsoul44 commented Feb 2, 2026

Uh oh!

LennartPurucker commented Feb 2, 2026

Uh oh!

deadsoul44 commented Feb 2, 2026

Uh oh!

LennartPurucker commented Feb 2, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

LennartPurucker commented Jun 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmark TabArena-Full Results

Notes

Uh oh!

LennartPurucker commented Sep 29, 2025

Uh oh!

Innixma commented Nov 29, 2025

Uh oh!

LennartPurucker commented Jan 17, 2026

Uh oh!

deadsoul44 commented Jan 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

LennartPurucker commented Jan 18, 2026

Uh oh!

LennartPurucker commented Jan 31, 2026

Uh oh!

deadsoul44 commented Feb 2, 2026

Uh oh!

LennartPurucker commented Feb 2, 2026

Uh oh!

deadsoul44 commented Feb 2, 2026

Uh oh!

LennartPurucker commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

LennartPurucker commented Jun 25, 2025 •

edited

Loading

deadsoul44 commented Jan 17, 2026 •

edited

Loading

LennartPurucker commented Feb 2, 2026 •

edited

Loading