Skip to content

Commit

Permalink
Merge pull request nomic-ai#174 from waybarrios/fixing_data_bug
Browse files Browse the repository at this point in the history
DatasetDict to dataset object.
  • Loading branch information
zanussbaum authored Apr 3, 2023
2 parents c62312f + 1a45144 commit 846f4cd
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion data.py
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@ def load_data(config, tokenizer):
dataset = load_dataset("json", data_files=files, split="train")

else:
dataset = load_dataset(dataset_path)
dataset = load_dataset(dataset_path,split='train')

dataset = dataset.train_test_split(test_size=.05, seed=config["seed"])

Expand Down

0 comments on commit 846f4cd

Please sign in to comment.