This guide will walk you through the creation, management, and deletion of datasets for fine-tuning of foundational language models.
prompt
(Optional): This column should contain the input text, instruction, or question for the model.answer
(Required): This column must contain the desired output or response from the model.question | answer |
---|---|
What is the capital of France? | The capital of France is Paris. |
Who wrote “To Kill a Mockingbird”? | Harper Lee wrote “To Kill a Mockingbird”. |
Explain the theory of relativity in simple terms. | The theory of relativity, developed by Albert Einstein, describes how gravity is a property of spacetime, and how space and time are linked. |
id
. It is essential to save the id
for each uploaded file, as you will need them in the next step to create your dataset.
id
for your training and validation files, you can create a dataset. A dataset groups these files under a single ID that you’ll use to start a fine-tuning job.
To create a dataset, provide a name, the fileid
for your training file, and optionally, the file id
for your validation file.