Universal CNN Accelerator Intended for Edge-Based AI Inference


Details
This is the monthly Meetup, part of the local TinyML Meetup group in Germany.
Registration link to join the event: https://us02web.zoom.us/webinar/register/1716167003406/WN_NeIrLOHCThaKzaOnMOm1LQ
Once registered, you will receive a link and dial in information to Zoom teleconference by email, that you can also add to your calendar.
"Universal CNN Accelerator Intended for Edge-Based AI Inference"
Speaker: Rastislav Struharik, Full Professor at University of Novi Sad
Deep learning, and particularly Convolutional Neural Networks (CNNs), are currently one of the most intensively and widely used machine learning predictive models. CNNs are not a new concept, but after recent breakthrough applications in the fields of image processing, and speech recognition, they have returned to the academic and industrial focus. Today, different types of CNNs are being employed in a wide range of applications, ranging from autonomous driving, medical, and even to playing complex games. In many of these application domains, CNNs are now able to exceed human levels of performance.
However, the superior accuracy of CNNs comes at a high cost because of their computational and storage complexity. State-of-the-art CNNs are described by hundreds of millions of parameters and require billions of computations in order to classify single input instance. It is highly likely that future CNNs will be even larger, deeper, will process larger input instances, requiring even more computations per input instance, and will be used to perform more intricate classification tasks at faster speeds, ever-increasingly in real-time, within low-power operating conditions. Because of this, careful selection of appropriate computing platform for the implementation of CNN-based applications is of great importance. This becomes even more important if we are to deploy CNNs in edge-based applications.
In this talk we will present hardware options available for implementing CNN acceleration on an edge device and discuss what are their weak and strong points. We will also present IDS DeepOcean core, FPGA-based CNN hardware accelerator intended for edge-based CNN inference. We will discuss basic operating principles of the DeepOcean core, present some use cases, and compare its performance to several competing solutions.
Rastislav Struharik is a full professor at the Department of Power, Electronics and Telecommunications, Faculty of Technical Sciences, University of Novi Sad, Serbia. He received his PhD in Electronics in 2009, in the area of hardware acceleration of machine learning algorithms. During his academic career he has published more than 35 papers in international journals and conferences, mainly focusing on the hardware acceleration of machine learning algorithms, such as Decision Trees, Support Vector Machines, Artificial Neural Networks, Convolutional Neural Networks, and Ensemble Classifiers, targeting both learning and inference algorithms. For the past three years he has also been working as the chief architect for the IDS own FPGA IP core technology for hardware acceleration of Convolutional Neural Networks, intended for edge AI applications.
We encourage you to register earlier since on-line broadcast capacity may be limited.
Note: tinyML Talks slides and videos will be available on the tinyML website and tinyML YouTube Channel afterwards, for those who missed the live session. Please take a moment and subscribe to the YouTube channel today: https://www.youtube.com/tinyML

Universal CNN Accelerator Intended for Edge-Based AI Inference