Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

When operating a custom dataset, will it work if I just set config.toml? #45

Open
limhasic opened this issue Oct 21, 2024 · 2 comments
Open

Comments

@limhasic
Copy link

  1. Isn't it still not applicable to data such as custom datasets (e.g. credit card fraud detection data)?

  2. It seems to be suitable for tabular data, but is it not suitable for transaction data?

@randydkx
Copy link

randydkx commented Nov 3, 2024

@limhasic hard to do that, I'm recently on the synthesis of fraud detection transactions, I found that tab-DDPM proposed here can only handle samll feature sets. My trans data includes about 400 features, the training is hard and the generation is not satisfactory.

@limhasic
Copy link
Author

limhasic commented Nov 4, 2024

Would it be possible with data with about 60 columns? The data is from https://dacon.io/competitions/official/236297/overview/description, and since I have difficulty using this repository, I am using tabddpm from synthecity.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants