ALTR offers a variety of data discovery tools to assist in data discovery and governance, including classification with Google DLP, Classification with Snowflake, and metadata management with Snowflake Object Tag Data. These tools enable ALTR users to more easily identify sensitive columnar data and group them via Data Tags for data governance scale.
Data Tags are metadata within ALTR that enable users to define groups of columnar data. Data Tags can be generated by performing a Google DLP Classification, a Snowflake Classification, or a Snowflake Object Tag imports. When any of these processes are run, ALTR groups columns together and tags them based on the resulting GDLP Classifications or Snowflake Object Tags. Enterprise-level ALTR users can then define Column Access Policies on these tags, instead of having to individually specify columns.
Note: Even if Column Access Policies are defined by tags, Columns must still be individually connected to ALTR in order for them to be governed. Learn more about connecting columns to ALTR here.
Google DLP Classification enables ALTR users to send a random sample of their data to Google’s DLP service for classification. In a Google DLP Classification, ALTR selects a random sample from each column in your Snowflake database - up to 256 values per column - and sends that sample to Google DLP for analysis. If there is only a limited amount of data available in a column (2x the number of columns in a table), the column is not classified. Each column is sampled separately to protect the randomness and anonymity of data. Google’s DLP service returns possible classification results to ALTR, which associates those results to the affected columns as Data Tags. Google DLP results can be accessed in ALTR via the Google DLP Classification Report.
Note: Due to the sensitive nature of client data, ALTR does not perform a GDLP classification unless explicitly requested by a user. Additionally, ALTR does not persist any of the values sampled during a Google DLP classification.
ALTR integrates with Snowflake’s Object Tagging functionality to import any Object Tags available in Snowflake. Two Options are available for importing Snowflake Object Tag data into ALTR: importing any existing Object Tags available in Snowflake, or executing a Snowflake Classification first and them importing all available object tag data.
Object Tags are metadata in Snowflake similar to ALTR’s Data Tags that are used to associate particular Snowflake Objects with each other. You can manually define these tags in Snowflake and assign them to column, or you can automatically generate these Object Tags through a Snowflake Data Classification.
Snowflake’s Data Classification tool scans through all of the columns in a Snowflake database and attempts to identify what kind of data exists in each column. These result in two Snowflake Object Tags: Semantic Categories, which indicate the specific type of data, and Privacy Categories, which indicate the sensitivity of the data. If you trigger a Snowflake Classification and Object Tag Integration for a database in ALTR, ALTR will trigger a Classification for each column in that database and then store the resulting Object Tags - as well as all other object tags available in Snowflake - as Data Tags in ALTR. Running a Snowflake Object Tag Import without a classification will not trigger a new Snowflake Classification, but may access any Object Tags created by previous Snowflake classifications.
Note: When performing a Snowflake Classification and Object Tag Import, your data stays local inside Snowflake. ALTR does not access the individual values present in your Snowflake Database; it only accesses the resulting metadata.
In ALTR, users can leverage data tags to more easily identify sensitive data and use these identifications to create governance rules at scale.
ALTR enables users to see the results of a Google DLP Classification via the Google DLP Classification Report, which is available in the second tab of the Data Management Page. On the Google DLP Classification Report, users can easily see the most recent Google DLP classification for each database and connect the classified columns with a single click.
Note: ALTR automatically generates a Friendly Name for columns connected through the Google DLP classification report based on the classification and the column name
Note: ALTR does not yet offer an interface to enable users to visualize the result of Snowflake Classifications. This feature is expected in Q4 2022.
Enterprise ALTR customers can define Column Access Policies on Data Tags, saving significant time from otherwise having to specifying individual columns when creating and updating policies.