a few typos and possible fixes #155
Replies: 3 comments
-
For the first 3 issues, please free feel to contribute to the repo. For the general discussion, please see #81 for more detailed discussion. |
Beta Was this translation helpful? Give feedback.
-
You guys fixed the utf-8 code along with refactoring out the loading config script to better handle the file path issues. The typos were still there. I've created a pull request: #192 |
Beta Was this translation helpful? Give feedback.
-
In my experience, even Google Colab couldn't handle |
Beta Was this translation helpful? Give feedback.
-
Hi
I like the project and it's pretty good as it stands.
A few issues that I have noticed and I could submit PR requests to fix a few issues.
If you'd like those fixes in a single PR or multiple PR one for each of the above issues, you can let me know.
A more generalized discussion
I've ran the pre-training stage max_len 1800 batch size 4
the time reported between each epoch was about 1600-2100
the first pass was able to fit in a 4090mobile 16GB
trying to train on stage 2 totally failed so I haven't been able to proceed beyond that stage as of yet.
I might re-train with a google cloud A100 to see if I can get stage 2 to train properly.
here's the log from trying to train stage 2
Beta Was this translation helpful? Give feedback.
All reactions