add extract gpt result script #142

ouyangyu · 2021-08-11T02:44:27Z

Filtered Result `median value`

case	memory (MiB)	lantency (ms)	throuthput(sample/sec)
1n1g_dp1_mp1_pp1_mbs16_gbs16_na1_l24_hs1536_nah24_sl2048	30008	2223.93	7.19
1n1g_dp1_mp1_pp1_mbs1_gbs1_na1_l24_hs2304_nah24_sl2048	30130	286.76	3.49
1n1g_dp1_mp1_pp1_mbs2_gbs2_na1_l24_hs2304_nah24_sl2048	31080	489.99	4.08
1n1g_dp1_mp1_pp1_mbs4_gbs4_na1_l24_hs2304_nah24_sl2048	32984	896.91	4.46
1n4g_dp4_mp1_pp1_mbs1_gbs4_na1_l24_hs2304_nah24_sl2048	33870	305.59	13.09
1n4g_dp4_mp1_pp1_mbs2_gbs8_na1_l24_hs2304_nah24_sl2048	34808	510.87	15.66
1n4g_dp4_mp1_pp1_mbs4_gbs16_na1_l24_hs2304_nah24_sl2048	36724	917.67	17.44
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l16_hs2304_nah16_sl2048	14110	977.59	32.73
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l24_hs1536_nah24_sl2048	13214	1049.49	30.49
1n8g_dp1_mp8_pp1_mbs32_gbs32_na1_l24_hs2304_nah24_sl2048	17744	1516.38	21.10
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l16_hs2304_nah16_sl2048	24748	1949.71	32.83
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l24_hs1536_nah24_sl2048	23504	2063.59	31.01
1n8g_dp1_mp8_pp1_mbs64_gbs64_na1_l24_hs2304_nah24_sl2048	31270	3016.09	21.22
1n8g_dp2_mp4_pp1_mbs16_gbs32_na1_l24_hs1536_nah24_sl2048	12584	806.30	39.69
1n8g_dp2_mp4_pp1_mbs16_gbs32_na1_l24_hs2304_nah24_sl2048	16476	1174.57	27.24
1n8g_dp2_mp4_pp1_mbs32_gbs64_na1_l24_hs2304_nah24_sl2048	25402	2294.77	27.89
1n8g_dp4_mp2_pp1_mbs16_gbs64_na1_l24_hs1536_nah24_sl2048	19880	1331.34	48.07
1n8g_dp4_mp2_pp1_mbs16_gbs64_na1_l24_hs2304_nah24_sl2048	26304	1963.60	32.60
1n8g_dp8_mp1_pp1_mbs16_gbs128_na1_l24_hs1536_nah24_sl2048	32112	2263.92	56.54
1n8g_dp8_mp1_pp1_mbs1_gbs8_na1_l24_hs2304_nah24_sl2048	33870	312.95	25.56
1n8g_dp8_mp1_pp1_mbs2_gbs16_na1_l24_hs2304_nah24_sl2048	34820	518.97	30.83
1n8g_dp8_mp1_pp1_mbs4_gbs32_na1_l24_hs2304_nah24_sl2048	36730	928.78	34.45

ShawnXuan · 2021-08-11T03:09:08Z

OneFlow/LanguageModeling/GPT/extract_gpt_result.py

+from extract_util import extract_result 
+
+
+parser = argparse.ArgumentParser(description="flags for BERT benchmark")


BERT -> GPT

ShawnXuan · 2021-08-11T03:09:26Z

OneFlow/LanguageModeling/GPT/extract_gpt_result.py

+    Training...
+    | step     | micro_batches   | samples         | throughput | latency    | loss       |
+    | -------- | --------------- | --------------- | ---------- | ---------- | ---------- |
+    | 1        | 1               | 32              | 3.65895    | 8.74569    | 11.27187   |


every step?

ShawnXuan · 2021-08-11T03:09:45Z

OneFlow/LanguageModeling/GPT/extract_util.py

+def compute_throughput(result_dict, args):
+    throughput = 0
+    latency = 0
+    for i in range(args.start_iter,args.end_iter):


every step?

add extract gpt result script

e5bd2fc

ShawnXuan reviewed Aug 11, 2021

View reviewed changes

ouyangyu added 2 commits August 11, 2021 11:48

code format

56e14e8

refine

1d13526

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add extract gpt result script #142

add extract gpt result script #142

ouyangyu commented Aug 11, 2021

ShawnXuan Aug 11, 2021

ShawnXuan Aug 11, 2021

ShawnXuan Aug 11, 2021

		from extract_util import extract_result


		parser = argparse.ArgumentParser(description="flags for BERT benchmark")

add extract gpt result script #142

Are you sure you want to change the base?

add extract gpt result script #142

Conversation

ouyangyu commented Aug 11, 2021

Filtered Result median value

ShawnXuan Aug 11, 2021

Choose a reason for hiding this comment

ShawnXuan Aug 11, 2021

Choose a reason for hiding this comment

ShawnXuan Aug 11, 2021

Choose a reason for hiding this comment

Filtered Result `median value`