Which bert training program should I use? #46
Answered
by
Cryolite
timercrack
asked this question in
Q&A
-
There are three bert trainning program called phase1, phase2 and phase 3(on develop branch), are there any difference? which one should I use? |
Beta Was this translation helpful? Give feedback.
Answered by
Cryolite
Aug 2, 2023
Replies: 1 comment 5 replies
-
The goal of each is different. For example: Phase1's goal is to imitates human choices. |
Beta Was this translation helpful? Give feedback.
5 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
In phase1, the model learns to imitate human behavior in the training data. In phase2, it learns to predict the point delta of each round based on the choices made in the training data. The training data contains both human actions and the point delta of each round, and the only difference between phase1 and phase2 is what they calculate as their objective function.