AI-Ready Data

Generate training data from your unstructured documents to fine-tune a model.

The AI-Ready Data page lists your data generation jobs and their current status. Each job processes your uploaded documents and generates a training dataset you can use to fine-tune a model.

AI-Ready Data Jobs list view showing job IDs, statuses, and creation dates.

Create an AI-Ready Data job

  1. Navigate to Data Engine > AI-Ready Data.
  2. Click + Create Job.
  3. In the Define Goal and Upload Data Sources step:
    • Enter your fine-tuning goal. Describe the outcome you want to achieve with your fine-tuned model. For example: "I'm developing a chat assistant that must align with the ethical standards and organizational values of my company brand."
    • Upload your data source files. Drag and drop or click Browse. You can upload up to 10 files at a time. Accepted file formats: PDF, Word, Markdown, JSON. Maximum size 100 MB per file.
  4. Click Next to review your setup.
  5. On the Confirm and Start Job page, review your fine-tuning goal and data sources, then click Start Job.

Job status

Each job moves through the following states:

StatusDescription
QueuedJob is in line to start
RunningDocuments are being processed and training data is being generated
CompletedTraining data has been generated and is ready to use
FailedSomething went wrong — review your files and try again

Output

Once a job completes, the generated training dataset is automatically saved to:

  • The AI-Ready Data page
  • Data Engine > Storage

No download is required. When creating a fine-tuning job, you can select the dataset directly from your saved files in SeekrFlow.

Next steps