Data validation for Customers

Overview

Bright Data’s automated dataset creation platform consists of a verification and approval phase before the dataset delivery. The platform facilitates error handling, validation checks, and customization, ensuring data accuracy and reliability. These validation checks are crucial in saving time, reducing errors in the data, and keeping the data quality at the desired level.

How does it work?

Once the dataset snapshot is ready:

The user will get the dataset with an indication in the platform that all tests passed.

When the user approves the dataset snapshot, they proceed to the delivery phase.

Validation Rules

Uniqueness

Filling Rate

Required fields

Data Stability

Type Verification

Schema and Field Custom Validation

Minimum Records Threshold

Data Size Fluctuation Threshold

Record Completeness Validation

Unique Identity and Duplicates Validation

Main components and functionality

Overall view of dataset tests (All results, Passed, Failed)

Evaluating the validation test results

Once the dataset snapshot validation errors are handled, the user is notified to evaluate and choose whether to:

Approve
Approve temporarily
Reject the snapshot.

Evaluation actions

For each failed validation test, the user has three options:

Set a new threshold
1. Set custom values - If the developer doesn’t reach the default set value, the user can choose a new threshold. The snapshot is returned to the developer once a new threshold is set.
2. Set for X% - Accept the success rate the developer reached and set the threshold to the value the developer managed to extract.
Ignore test (one time only) - Accept the value that the developer extracted just once (the default value will not change for the next dataset snapshot)
Reject - The user doesn’t accept the adjustments to the failed tests; the issues will be returned to the developer to fix. The status will be marked as “Rejected” for additional fixes and will later on be re-send to the customer for approval.

In case all issues are ignored/approved, click on “Deliver dataset” to deliver the snapshot.

A snapshot delivery will be automatically delivered when pending customer approval for 14 days.

In case all/some issues are rejected, click on “send back to the developer” to send it back for additional fixes.

Communications and notifications

Users are notified of status updates through their account in the control panel and via email.

Introduction

Product Guides

Data validation for Customers

Overview

How does it work?

Validation Rules

Main components and functionality

Evaluating the validation test results

Evaluation actions

Communications and notifications

Introduction

Product Guides

​Overview

​How does it work?

​Validation Rules

​Main components and functionality

​Evaluating the validation test results

​Evaluation actions

​Communications and notifications

Overview

How does it work?

Validation Rules

Main components and functionality

Evaluating the validation test results

Evaluation actions

Communications and notifications