-
Notifications
You must be signed in to change notification settings - Fork 30
Milestone
Description
This issue is a general container for matters relating to datasets in general. Discussions on TinyMMLU or IFEval specifically should go in the sub issues for this one.
Current Status
TinyMMLU
- Dataset is converted from
.parquetto.tfrecordvia a utility script. - Dataset loads
.tfrecordand stores data inside samples. - Dataset provides samples by id to driver/backend in proper format.*
- Dataset Processes output from driver/backend.*
- Dataset calculates and provides accuracy using output data on device.
IFEval
- Dataset is converted from
.jsonlto.tfrecordvia a utility script. - Dataset loads
.tfrecordand stores data inside samples. - Dataset provides samples by id to driver/backend in proper format.*
- Dataset Processes output from driver/backend.*
- Dataset calculates and provides accuracy using output data on device.
* This includes tokenization/detokenization using common SentencePiece utility code.
Sub-issues
Metadata
Metadata
Assignees
Labels
No labels