You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Great work! But when using the write_selected_data.py script, a mismatch in the number of --train_file_names and --train_files parameters results in an assertion failure.
Steps to Reproduce
Run the Data Selection Script as the README Instructions:
In this example, --train_file_names has 4 names (flan_v2, cot, dolly, oasst1), while --train_files only provides 2 file paths (dolly_data.jsonl and oasst1_data.jsonl). This mismatch triggers the assertion, causing the script to terminate unexpectedly.
The text was updated successfully, but these errors were encountered:
Great work! But when using the
write_selected_data.py
script, a mismatch in the number of--train_file_names
and--train_files
parameters results in an assertion failure.Steps to Reproduce
Run the Data Selection Script as the README Instructions:
Observe the Assertion Failure:
The script contains the following assertion:
In this example,
--train_file_names
has 4 names (flan_v2
,cot
,dolly
,oasst1
), while--train_files
only provides 2 file paths (dolly_data.jsonl
andoasst1_data.jsonl
). This mismatch triggers the assertion, causing the script to terminate unexpectedly.The text was updated successfully, but these errors were encountered: