Skip to content

Commit

Permalink
fixed sql query
Browse files Browse the repository at this point in the history
  • Loading branch information
soldni committed Jul 5, 2023
1 parent 9965b4c commit 17cc37a
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion pretrain_data/s2/v3-fos/process_corpus/s2orc.sql
Original file line number Diff line number Diff line change
Expand Up @@ -131,7 +131,7 @@ UNLOAD (
ARRAY_AGG(metadata)[1] AS metadata,
ARRAY_AGG(split)[1] AS split,
CAST(id AS INT) % 10 AS part_id
FROM filtered_corpus
FROM filtered_corpus_with_fos
GROUP BY id
)
TO 's3://ai2-llm/pretraining-data/sources/s2/v3-fos/documents/dataset=s2orc'
Expand Down

0 comments on commit 17cc37a

Please sign in to comment.