Skip to main content

Dataset

A dataset is a structured collection of documents generated from a specific project.

Datasets can be downloaded in either JSON or CSV format, allowing for flexibility in data handling. JSON is ideal for applications that require structured data, while CSV is perfect for spreadsheet applications like Excel.

NOTE: JSON data is returned in the JSONL format, meaning each document/row in a dataset is listed on its own line in the file. For every large datasets, this allows datasets to be consumed efficiently (without loading all of the data into memory all at once).