First draft of methodology for modeling generative AI systems #221

bokelley · 2024-07-31T12:23:13Z

No description provided.

…ta from BDavy

docs/overview.mdx

MikeFreyberger · 2024-08-26T13:37:10Z

docs/snippets/defaults_genai.mdx

@@ -0,0 +1,12 @@
+```


where is this snippet used?

It's not, but figured we'd want it when we do the model later? Wasn't sure if we'd do an AI variant of calculations.mdx, so really just here as a placeholder.

MikeFreyberger · 2024-08-26T13:45:29Z

docs/training.mdx

+| --------- | -------------- |
+| Total reserved time | 118 days |
+| Reservation start time | January 2022 (?) |
+| GPU hours for final model | 1,082,990 |


is GPU hours suppose to be the hours where the GPUs were used during the intermediate and final training?

In this case with a cluster of 384 GPUs, this total GPU hours represents 117.5 (1082990/384/24) cluster days, which is the same as total reserved time.

I would have thought total reserved time in days to be more than the GPU hours for final model.

Right - that's where we backfill the intermediate down below when we normalize. If you have ideas on how to clarify please lmk

MikeFreyberger · 2024-08-26T13:49:02Z

docs/training.mdx

+
+Embodied emissions = (cluster embodied emissions per hour) x (training time)
+
+Usage emissions = (usage energy per GPU-hour) x (total GPU hours) x (average grid intensity during training)


if instead of total GPU hours, we had GPU hours utilized by hour/time, we could potentially leverage the actual grid mix during that hour. i'm thinking of cases where potentially training could be paused and resumed to only happen during times where marginal grid mix intensity is low

Yep agreed - though I was thinking this would be easiest to represent as a lower average grid mix?

LR updating phase description. Testing for update process going forward

Remove extra word

bokelley added 16 commits July 18, 2024 14:22

wip on AI model

8041ef3

split ai to new section

5d2fa97

wip

cec32ec

wip

bafa5c6

first draft of training

a84cc8b

split out cluster and nvidia

d5eba0f

fine tuning updated

8219aea

inference service

6605fd9

fix tests

2805882

add model for token to energy

2b2f6b0

update inference model to include LoRA; update overview to include da…

056d6b8

…ta from BDavy

break out datacenter more clearly in cluster defintiion

a799db5

update water use estimates for a100

2b313ef

Merge remote-tracking branch 'origin/main' into genai

bd660fe

update cluster link & fix typos

1d2c53a

break out memory usage for more granular calculation

1b1b961

MikeFreyberger reviewed Aug 26, 2024

View reviewed changes

bokelley and others added 4 commits August 28, 2024 13:17

Merge remote-tracking branch 'origin/main' into genai

593dee6

Split out foundation components, fix various typos

bbc30ca

Update overview.mdx

3535618

LR updating phase description. Testing for update process going forward

Update overview.mdx

75db097

Remove extra word

lratliff3 approved these changes Oct 2, 2024

View reviewed changes

MikeFreyberger changed the base branch from main to preview October 3, 2024 18:24

bokelley merged commit 6379647 into preview Oct 3, 2024
2 checks passed

bokelley deleted the genai branch October 3, 2024 19:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

First draft of methodology for modeling generative AI systems #221

First draft of methodology for modeling generative AI systems #221

bokelley commented Jul 31, 2024

MikeFreyberger Aug 26, 2024

bokelley Aug 29, 2024

MikeFreyberger Aug 26, 2024

bokelley Aug 29, 2024

MikeFreyberger Aug 26, 2024

bokelley Aug 29, 2024


		Embodied emissions = (cluster embodied emissions per hour) x (training time)

		Usage emissions = (usage energy per GPU-hour) x (total GPU hours) x (average grid intensity during training)

First draft of methodology for modeling generative AI systems #221

First draft of methodology for modeling generative AI systems #221

Conversation

bokelley commented Jul 31, 2024

MikeFreyberger Aug 26, 2024

Choose a reason for hiding this comment

bokelley Aug 29, 2024

Choose a reason for hiding this comment

MikeFreyberger Aug 26, 2024

Choose a reason for hiding this comment

bokelley Aug 29, 2024

Choose a reason for hiding this comment

MikeFreyberger Aug 26, 2024

Choose a reason for hiding this comment

bokelley Aug 29, 2024

Choose a reason for hiding this comment