TL;DR
Backfill currently requires the data lake to be a trusted source. Validate that the data we ingest is actually correct, so a faulty or malicious source can't silently feed us bad data — removing the trusted-source requirement.
Scope
- Determine how ingested data can be verified as correct without trusting the source, and put that verification in place.
TL;DR
Backfill currently requires the data lake to be a trusted source. Validate that the data we ingest is actually correct, so a faulty or malicious source can't silently feed us bad data — removing the trusted-source requirement.
Scope