You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
My dataset has 4500 features and for selecting features the initial part before xgboost takes around 1 hour to run. I realized most part of the featurwize is not parallel is it possible to make them parallel?
The text was updated successfully, but these errors were encountered:
Wherever I could use n_jobs=-1 I have used it. Other than that, I have not used multithreading which is now available in XGBoost. This could be something to think about.
Ram
we should inspect each part of the pipeline regarding to time to see how we can reduce the timing by vectorize or parallelize with Joblib or similar libraries
My dataset has 4500 features and for selecting features the initial part before xgboost takes around 1 hour to run. I realized most part of the featurwize is not parallel is it possible to make them parallel?
The text was updated successfully, but these errors were encountered: