Skip to content

Actions: zhongkaifu/Seq2SeqSharp

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
204 workflow runs
204 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Update: Convert token frequency counter type to long for huge data set
.NET #297: Pull request #78 opened by zsogitbe
October 21, 2023 06:49 16m 35s zsogitbe:master
October 21, 2023 06:49 16m 35s
Convert token frequency counter type to long for huge data set
.NET #296: Commit 87b3ad6 pushed by zhongkaifu
October 20, 2023 18:51 16m 40s master
October 20, 2023 18:51 16m 40s
Merge pull request #77 from zsogitbe/master
.NET #295: Commit 3e99859 pushed by zhongkaifu
October 20, 2023 16:49 21m 28s master
October 20, 2023 16:49 21m 28s
Normalizing and simplifying Cuda precompilation logging
.NET #294: Pull request #77 synchronize by zsogitbe
October 20, 2023 11:26 22m 42s zsogitbe:master
October 20, 2023 11:26 22m 42s
Add layernorm for source embeddings in image caption
.NET #290: Commit 53d517b pushed by zhongkaifu
October 18, 2023 15:28 22m 41s master
October 18, 2023 15:28 22m 41s
minor bug fix
.NET #289: Commit 6022102 pushed by zhongkaifu
October 18, 2023 03:50 16m 33s master
October 18, 2023 03:50 16m 33s
minor bug fix
.NET #288: Commit 4588a7e pushed by zhongkaifu
October 18, 2023 03:45 20m 42s master
October 18, 2023 03:45 20m 42s
Bug fix for learning rate factor in Image2Seq
.NET #287: Commit a828018 pushed by zhongkaifu
October 17, 2023 17:34 22m 56s master
October 17, 2023 17:34 22m 56s
Code refactoring for image caption
.NET #286: Commit d69d48c pushed by zhongkaifu
October 11, 2023 18:40 16m 47s master
October 11, 2023 18:40 16m 47s
Bug fix: Pass SaveModelEveryUpdates to the framework
.NET #285: Commit 69e644f pushed by zhongkaifu
October 10, 2023 16:49 23m 3s master
October 10, 2023 16:49 23m 3s
October 10, 2023 14:05 16m 27s
Check weights corrupted while loading/saveing models
.NET #283: Commit bd152bf pushed by zhongkaifu
September 26, 2023 14:52 20m 5s master
September 26, 2023 14:52 20m 5s
Update strategy for corrupted weights
.NET #282: Commit af9fb59 pushed by zhongkaifu
September 25, 2023 17:50 16m 8s master
September 25, 2023 17:50 16m 8s
Check Nan value when loading and saving weights
.NET #281: Commit fffd57a pushed by zhongkaifu
September 25, 2023 02:57 17m 53s master
September 25, 2023 02:57 17m 53s
Fix weights reload issue
.NET #280: Commit 5aa6eb5 pushed by zhongkaifu
September 22, 2023 17:01 16m 23s master
September 22, 2023 17:01 16m 23s
Fix model reloading bug
.NET #279: Commit 1ce1784 pushed by zhongkaifu
September 22, 2023 16:45 1m 54s master
September 22, 2023 16:45 1m 54s
Optimize memory usage when calculating cross entory loss
.NET #278: Commit 8f2af2d pushed by zhongkaifu
September 22, 2023 14:14 16m 38s master
September 22, 2023 14:14 16m 38s
Optimize memory usage for EltMulMulAdd operatior
.NET #277: Commit 71e9372 pushed by zhongkaifu
September 20, 2023 22:14 16m 52s master
September 20, 2023 22:14 16m 52s
Add RMSNorm
.NET #276: Commit caff84b pushed by zhongkaifu
September 16, 2023 20:49 17m 8s master
September 16, 2023 20:49 17m 8s
Make Positional embedding is configurable
.NET #275: Commit e972fa7 pushed by zhongkaifu
September 12, 2023 18:39 20m 22s master
September 12, 2023 18:39 20m 22s
Print out some debug information
.NET #274: Commit a61fd87 pushed by zhongkaifu
September 11, 2023 17:52 20m 0s master
September 11, 2023 17:52 20m 0s
Add inPlace for SiLU activation when it runs forward only.
.NET #273: Commit f1d5a6d pushed by zhongkaifu
September 7, 2023 03:27 16m 51s master
September 7, 2023 03:27 16m 51s
Add LeakyReLU activation function
.NET #272: Commit 89b9b25 pushed by zhongkaifu
September 5, 2023 20:19 20m 45s master
September 5, 2023 20:19 20m 45s
Only keep RoPE for self-attention layer
.NET #271: Commit 54945f1 pushed by zhongkaifu
September 4, 2023 17:54 16m 26s master
September 4, 2023 17:54 16m 26s
Add option for start batch id
.NET #270: Commit 4db69a5 pushed by zhongkaifu
September 4, 2023 03:58 20m 24s master
September 4, 2023 03:58 20m 24s