December 9, 2024:
Databricks Launches Synthetic Data Generation API - Databricks has launched a new API within its Mosaic AI Agent Evaluation tool for creating synthetic datasets. This service facilitates the generation of question-answer pairs for machine learning projects, especially those with large language models. Developers can upload business data and specify outputs, with the API offering customization options.
The process aims to enable faster and more accurate dataset reviews by subject matter experts. Future updates will provide an enhanced interface for dataset reviewers and introduce tools for monitoring dataset changes over time.