Security and Privacy Considerations for the Big Data Lake with Robert Geiger
5:30-6:30pm Pizza and Networking
6:30-8:00pm Talk and Q&A
8:00-8:30pm Wind down
As Big Data moves more into the mainstream, security and privacy issues are coming to the forefront. Large amounts of data, from many sources and, of various content and quality are being combined into a "Data Lake" that may be defined by more than one system of distributed technologies, most commonly centered around HDFS/Hadoop. Protecting this data involves many aspects such as: who can access given pieces of data, protection of data within the cloud and on disk, basic and advanced Personally Identifiable Information (PII) protection, protection of data during transfer in/out, removal of temporary and expired data, and managing the disparate interfaces and capabilities in the Big Data ecosystem. Here we review the issues and some of the technologies being developed within the community and by vendors to secure and manage the data in the emerging data lake.
About the speaker:
Robert Geiger has been an Architect in the PHD team at Pivotal Inc. since April 2013, working in the areas of data security and Hadoop as a service in the cloud. Before joining Pivotal Robert was VP Engineering and a lead developer at Translattice Inc., a start-up offering a masterless geographically distributed PostgreSQL database. Previous to that he was a Sr. Director at Symantec leading an ~80 person team developing the technology acquired from Recourse, where he was a lead developer and director of engineering. While at Recourse Robert worked to develop and productize Recourse's distributed high speed intrusion detection and event correlation technology. Prior to Recourse Robert was at Motorola Solutions Research for 10 years working variously on MPP analysis of electromagnetic fields in circuits, mobile IP, mobile cryptography and mobile e-commerce. He has a Masters in EE from the University of Illinois, Urbana-Champaign, and a BSEE from SUNY.