Introduction
1. Obfuscate using Recommendation feature
When clicking on the recommendation ** Obfuscate ssn, a step is added to the Preparation Script and the data is obfuscated.
This recommendation capability is currently leveraging a 'knowledge reference set' that is Oracle-Analytics-Vanilla list of recommendations. For example, it may not recognize social security numbers of every countries in the world. However very soon, the product will support uploading custom knowledge reference, which will enable administrators to tune the automatic recommendations that the product delivers.
2. Obfuscate using ObfuscateUpdate Function
There are instances where you may want to obfuscate data that has not been automatically identified by the recommendation engine for obfuscation. For instance, you may want to obfuscate a birthdate in your dataset. The recommendation engine will not identify this column for obfuscation. In order to manually apply obfuscation, you can use the ObfuscateUpdate function in data prep.
This function has the signature ObfuscateUpdate(<column>,parameter). The Parameter can be 'ALL', 'FIRST_5_DIGITS' or 'FIRST_12_DIGITS'.
a.Obfuscate Parameter | ‘ALL’
b. Obfuscate Parameter | ‘FIRST_5_DIGITS’ and ‘FIRST_12_DIGITS’
In addition to the ALL parameter, it also accepts ‘FIRST_5_DIGITS’ and ‘FIRST_12_DIGITS’ as options. These parameters are applied only on digits within the column and not to text. . Let's say we want to obfuscate the first 5 digits of the ccexpires column. The column data for example is 12/2015 and if you mask the first 5 digits, it changes from 12/2015 to ##/###5.
3. Using REGEX: More options to Obfuscate data
If we need to obfuscate specific number of digits(other than 5 and 12) , using a Regular Expression is an option. A regular expression (or regex) feature is available as part of "find and replace" feature of a column. Regex is a sophisticated feature with many use cases and is not limited to masking. You can read the following blog post (link) to understand regex feature more.
Using REGEX | Mask all characters
- In String to replace use “.”
- In New String use “*”
Using REGEX | Mask all characters and show same number of characters in each row
In the above example, the number of characters in each row can guessed by
counting number of *. If you do not want to show the length of the
string, use the following expression
- In String to replace use “^.*$”
- In New String use “*****”
Using REGEX | Mask first n Characters
If you want to mask only first 4
characters, use the following expression
- In String to replace use “^.{1,4}”. Replace 4 with any number you want.
- In New String use “****”. Note: If you are replacing 4 characters use ****.
Using REGEX | Masking last n characters
- In String to replace use “.{1,3}$”. Replace 3 with any number you want.
- In New String use “***”. Note: If you are replacing 3 characters use ***.
Summary
Obfuscation feature is highly useful to all analysts who want to mask data quickly without resorting to complicated techniques. Oracle Analytics offers different mechanism to achieve this need.
4 comments:
Great information thanks for sharing keep sharing
microstrategy online training
microstrategy certification
Nice post
online micro strategy training
Microstrategy Online Training India
Really you have done a good job. Thanks for sharing this valuable information....
Informatica MDM Training in Chennai
informatica MDM Training Center Chennai
informatica MDM Training Institute in Chennai
Corporate Training in Chennai
Corporate Training Companies in Chennai
Corporate Training Institute in Chennai
nice..........!
micro strategy certification training
msbi course training
Post a Comment