-
Notifications
You must be signed in to change notification settings - Fork 18
Home
Welcome to the hcaptcha-model-factory wiki!
This project is about 🏗 hCAPTCHA binary classification model factory.
If this project is hopeful for you, please leave a ⭐star~!
Image recognazation as a most common captcha category was provided by many captcha service like hCaptcha and reCaptcha. But it's can easyly be solved by deep learning. Collect and label data is the only thing you need to do.
Any image recognazation task can be regarded as a binary classification task for now. You just need to decide to "click" or "not click", "true" or "false".
So, this project is as a pluggable module in hcaptcha-challenger, which can quick iteration and update. When a new challenge comes, just train a simple resnet model for it is enough.
This ResNetMini model is only 295KB
for onnx format. But I don't know how big the hCaptcha generation model is, haha!
Make AI great again!
In progressing...
- ResNetMini
- size: 295 KB
- params: 75154 trainable parameters
- structure: conv - bn - relu - conv - bn - conv - bn - relu
Library: Python 3.7+, PyTorch>=1.8.1 [Optional: CUDA>=10.2]
System: Windows/Linux/Mac
(It supports all system which can install PyTorch, but I just test it on Windows. Hope you know, and Welcome a pr!)
Run following command.
git clone https://github.com/beiyuouo/hcaptcha-model-factory.git
cd hcaptcha-model-factory
pip install -r requirements.txt
cd src
When a new task comes, you need to modify the task_name
varible in config.py
. You may need to tune the parameters in training setting
section.
I think you do not need a label tool for this task... Just drag and drop the picture to the corresponding label folder is enough. It's easy, right?
Place your labeled data in data\[task]\origin\[yes|bad]
. It will be divied automatically.
python main.py --split
python main.py --mode train
python main.py --mode val
In progressing...
python main.py --split --mode trainval
Copyright @BJ.YAN