Error2 reports
Fix NonMatchingSplitsSizesError
in Datasets
✅ Solution
NonMatchingSplitsSizesError usually arises when the manually defined split proportions don't align with the actual number of examples discovered during dataset generation. Ensure that the sum of `split` argument values in `datasets.SplitGenerator` equals the true number of examples, or avoid specifying `split` entirely to let the library automatically infer the split sizes. If a dataset is sharded, verify that downloaded shards are complete and not corrupted, causing an incorrect example count.
Related Issues
Real GitHub issues where developers encountered this error:
Timeline
First reported:Mar 26, 2026
Last reported:Mar 26, 2026