Currently the project allows inspection and deidentification of sensitive data in different data sources.
There is a need to reidentify the data for business needs.
Implement a reidentify pipeline that:
- Uses the same encryption configuration as deidentify pipeline
- Supports following sources:
a. BigQuery Table or Query
b. AVRO file
- Supports writing output as BigQuery table or AVRO for CSV file
- Supports emitting only a subset of columns (all columns with subset not provided)
Currently the project allows inspection and deidentification of sensitive data in different data sources.
There is a need to reidentify the data for business needs.
Implement a reidentify pipeline that:
a. BigQuery Table or Query
b. AVRO file