Policy-based Pre-Processing in Hadoop
Yi Cheng Christian Schaefer
This pre-processor can perform anonymization, filtering, aggregation,encryption and other modifications to sensitive data before they are given to an application. By integrating policy based pre-processing into the MapReduce framework, privacy and security mechanisms can be applied on a per-application basis to provide higher flexibility and efficiency.