Expand dataset with synthetic samples

Generate synthetic dataset samples using LLM based on existing data patterns

Path parameters

idstringRequiredformat: "uuid"

Request

This endpoint expects an object.
modelstringRequired>=1 character
The model to use for synthetic data generation
sample_countintegerOptional1-200
Number of synthetic samples to generate
preserve_fieldslist of stringsOptional
Fields to preserve patterns from original data
variation_instructionsstringOptional
Additional instructions for data variation
custom_promptstringOptional

Custom prompt to use for generation instead of auto-generated one

max_completion_tokensintegerOptional>=100
Maximum number of tokens for the LLM response. Required by Anthropic, used as maxOutputTokens for Gemini. If not provided, defaults to 4000 for Anthropic models only.

Response

Generated synthetic samples
generated_sampleslist of objects
List of generated synthetic dataset items
modelstring
Model used for generation
total_generatedinteger
Total number of samples generated
generation_timedatetimeRead-only
Generation timestamp