making parallel most part of the featurwize #128

reza1615 · 2025-01-29T14:54:24Z

My dataset has 4500 features and for selecting features the initial part before xgboost takes around 1 hour to run. I realized most part of the featurwize is not parallel is it possible to make them parallel?

AutoViML · 2025-01-29T15:12:08Z

Wherever I could use n_jobs=-1 I have used it. Other than that, I have not used multithreading which is now available in XGBoost. This could be something to think about.
Ram

reza1615 · 2025-01-29T16:06:58Z

we should inspect each part of the pipeline regarding to time to see how we can reduce the timing by vectorize or parallelize with Joblib or similar libraries

AutoViML · 2025-01-30T01:19:40Z

yes that is doable but I don't have the time. If you or anyone is interested I can help out.
Ram

AutoViML closed this as completed Feb 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

making parallel most part of the featurwize #128

making parallel most part of the featurwize #128

reza1615 commented Jan 29, 2025

AutoViML commented Jan 29, 2025

reza1615 commented Jan 29, 2025 •

edited

Loading

AutoViML commented Jan 30, 2025

making parallel most part of the featurwize #128

making parallel most part of the featurwize #128

Comments

reza1615 commented Jan 29, 2025

AutoViML commented Jan 29, 2025

reza1615 commented Jan 29, 2025 • edited Loading

AutoViML commented Jan 30, 2025

reza1615 commented Jan 29, 2025 •

edited

Loading