- Semantic Data & ML DevOps; A Cybersecurity Perspective on Adversarial ML
Talk 1: Semantic Data and ML DevOps : An Industrial IoT case study Speaker: Semih Korkmaz (Datagraph) Abstract: ML model development and application faces challenges addressing understandability, debugging, configuration management, and operational support. This talk explores the data strategies that incorporate semantic data, data provenance, schema mutability and the relationship to effective ML development, deployment and operational practices by providing a reference implementation in an industrial IoT application. Bio: Semih has almost a decade of experience designing and developing machine learning solutions including recommendation engines for Vodafone Mobile TV systems and Industry 4.0 applications such as reinforcement learning models for assembly tasks at Arcelik/Beko. His edge analytics projects on Siemens Systems have become accepted as a ML model deployment strategy at several manufacturing locations. Formerly, Semih was a lecturer at German Research Center for Artificial Intelligence, and a researcher Max Planck Institute. -- Talk 2: Adversarial Machine Learning: A Cybersecurity Perspective Speaker: Amit Kushwaha Abstract: Security and Privacy issues need no introduction. But how exactly is this affecting the field of Machine Learning? This is what this talk will cover. We first expose the attack surface of systems deploying machine learning. We then describe how an attacker may force models to make wrong predictions with very little information about the victim. One such attack can be biometric recognition where fake biometric traits may be exploited to impersonate a legitimate user. We demonstrate that these attacks are practical against existing machine learning as a service platform. Towards the end, we will discuss current research to defend models from such attacks. Bio: Amit Kushwaha is a Python Backend Engineer in the Pricing and Forecasting Team of Zalando. He works on large-scale Optimal Discount Recommendation. He worked earlier as an ML Engineer in Zomato. His interests are in Deep Learning, Recommendation Systems, NLP and Data Engineering. He works with the Tensorflow, Keras, Pyspark, Airflow, Luigi and Pandas. He dreams to pursue AI as an independent researcher in future.
- Augmentation instead of Regularization & Autonomous Driving
Talk 1: Data augmentation instead of explicit regularization Speaker: Alex Hernández-García Abstract: Explicit regularization techniques, such as weight decay and dropout, are the standard and most popular ways of improving the generalization of CNNs. However, these techniques blindly reduce the effective capacity of the model and, importantly, have very sensitive hyper-parameters that require specific fine-tuning. Furthermore, they are used, unquestioned, in combination with other techniques from the "machine learning toolbox", such as SGD, normalization, convolutional layers or data augmentation, which also provide implicit regularization. Little is known about the interactions among these techniques. In this talk, I will present the results of systematically contrasting data augmentation and explicit regularization on different architectures and object recognition data sets. Data augmentation, unlike explicit regularization, does not reduce the capacity of the model and does not require fine-tuning of hyper-parameters. Besides, we have recently shown that models trained with heavier data augmentation learn more similar representations to those measured in the human visual cortex. In sum, I will show how replacing weight decay and dropout by data augmentation can safely free us from the hassle of fine-tuning sensitive hyper-parameters, potentially achieve better performance and learn more biologically plausible representations. Bio: Alex Hernández-García is a last-year PhD candidate at the Institute of Cognitive Science of the University of Osnabrück. After completing his M.Sc. at the University Carlos III of Madrid, Spain, he moved in 2016 to Berlin to start a PhD on biologically-inspired machine learning, with a Marie Sklodowska-Curie ITN grant. Although his main background is on machine learning and computer vision, he has an interdisciplinary profile and interests in other fields such as computational neuroscience as reflected by his internships at the Spinoza Centre for Neuroimaging in Amsterdam and the Cognition and Brain Sciences Unit of the University of Cambridge. His paper "Further advantages of data augmentation on convolutional neural networks" recently won the Best Paper Award at the International Conference on Artificial Neural Networks, ICANN. - Talk 2: Tackling autonomous driving with a single neural network Speaker: Markus Hinsche Abstract: Deep networks can be trained on demonstrations of human driving to learn to follow roads and avoid obstacles. This is possible with a single end-to-end network learning all the parts of the driving at once. I will give a short introduction to autonomous driving stacks and guide you through the implemention of this network which was introduced in the paper "End-to-end Driving via Conditional Imitation Learning". We open-sourced our implementation (https://github.com/merantix/imitation-learning) and wrote a Medium post (https://medium.com/merantix/journey-from-academic-paper-to-industry-usage-cf57fe598f31). Bio: Markus Hinsche is a Software Engineer working on Machine Learning at Merantix and one of the rare breed of people actually from the Berlin area. He is eager to explore new topics every day. To satisfy this hunger for the unknown, Markus worked at various startups after receiving his Master's degree in IT Systems Engineering at Hasso Plattner Institute in Potsdam.