DataTalks #25: Productizing DS w/Kubeflow & Kale + Translating Musical Genres
Details
DataTalks #25: Productizing DS w/ Kubeflow + Translating Music Genres
Our 25th DataTalks meetup is done in cooperation with Team8, and will be held online. We'll talk about migrating Data Science research
to production using Kubernetes, Kubeflow pipelines & Kale, and translation of music genres and voices.
๐ญ๐ผ๐ผ๐บ ๐น๐ถ๐ป๐ธ: https://zoom.us/j/92086149175?pwd=ZzZ1dmlmaERLYlphUC9jVGFjSlU1QT09
๐๐ด๐ฒ๐ป๐ฑ๐ฎ:
๐ต 18:00 - 18:05 - Introduction - Tom Sela, Director of Research at Team8
๐ถ 18 :05 - 18:50 - Migrating Data Science Research to Production Using Kubernetes, Kubeflow Pipelines and Kale - Amit Ripshtos, Senior Software Engineer at noogata
๐ด 18:50 - 19:35 - Translation of Music Genres and Voices - Adam Polyak, AI Research Engineer at Facebook
---------------------
๐ ๐ถ๐ด๐ฟ๐ฎ๐๐ถ๐ป๐ด ๐๐ฎ๐๐ฎ ๐ฆ๐ฐ๐ถ๐ฒ๐ป๐ฐ๐ฒ ๐ฅ๐ฒ๐๐ฒ๐ฎ๐ฟ๐ฐ๐ต ๐๐ผ ๐ฃ๐ฟ๐ผ๐ฑ๐๐ฐ๐๐ถ๐ผ๐ป ๐จ๐๐ถ๐ป๐ด ๐๐๐ฏ๐ฒ๐ฟ๐ป๐ฒ๐๐ฒ๐, ๐๐๐ฏ๐ฒ๐ณ๐น๐ผ๐ ๐ฃ๐ถ๐ฝ๐ฒ๐น๐ถ๐ป๐ฒ๐ ๐ฎ๐ป๐ฑ ๐๐ฎ๐น๐ฒ - ๐๐บ๐ถ๐ ๐ฅ๐ถ๐ฝ๐๐ต๐๐ผ๐, ๐ฆ๐ฒ๐ป๐ถ๐ผ๐ฟ ๐ฆ๐ผ๐ณ๐๐๐ฎ๐ฟ๐ฒ ๐๐ป๐ด๐ถ๐ป๐ฒ๐ฒ๐ฟ ๐ฎ๐ ๐ป๐ผ๐ผ๐ด๐ฎ๐๐ฎ
At noogata, a crucial part of delivering an end-to-end data science / AI based solution involves research and migrating to a finished product. We found out quickly that the researcher's results are not suitable for production use out-of-the-box. In order to 'productize' a solution, a data-processing pipeline needs to be established, taking research code from Jupyter notebooks and arranging it in a processing pipeline we can easily repeat and reproduce.
Our main technologies around our work are Kubernetes (container orchestration infrastructure) and Kubeflow pipelines (Workflow engine that runs natively on Kubernetes), as our data pipelines (Various ETLs, model training) need to scale up and down quite a bit, and we want things to work on any infrastructure.
In this talk, I will describe the process we built for 'productionizing' research projects using Kubernetes, Kubeflow pipelines and tools like Kale. I will explain how we use Kubeflow pipelines in the research phase and how it works for production as well, and I will deep dive into its benefits and disadvantages.
---------------------
๐ง๐ฟ๐ฎ๐ป๐๐น๐ฎ๐๐ถ๐ผ๐ป ๐ผ๐ณ ๐ ๐๐๐ถ๐ฐ ๐๐ฒ๐ป๐ฟ๐ฒ๐ ๐ฎ๐ป๐ฑ ๐ฉ๐ผ๐ถ๐ฐ๐ฒ๐ - ๐๐ฑ๐ฎ๐บ ๐ฃ๐ผ๐น๐๐ฎ๐ธ, ๐๐ ๐ฅ๐ฒ๐๐ฒ๐ฎ๐ฟ๐ฐ๐ต ๐๐ป๐ด๐ถ๐ป๐ฒ๐ฒ๐ฟ ๐ฎ๐ ๐๐ฎ๐ฐ๐ฒ๐ฏ๐ผ๐ผ๐ธ
In this talk, we will present two methods:
i) A method for translating music across musical instruments and styles.
ii) A wav-to-wav method for converting between speakersโ voices, without relying on text.
The first is based on unsupervised training of a multi-domain WaveNet autoencoder that is trained end-to-end on waveforms. Employing a diverse training dataset and large net capacity, allows us to translate also from musical domains that were not seen during training.
The second method is based on an encoder-decoder architecture, where the encoder is pre-trained for the task of Automatic Speech Recognition (ASR), and a multi-speaker waveform decoder trained to reconstruct the original signal. The modularity of our approach, which separates the target voice generation from the Text To Speech (TTS) module, enables the customization of existing TTS services.
---------------------
๐ญ๐ผ๐ผ๐บ ๐น๐ถ๐ป๐ธ: https://zoom.us/j/92086149175?pwd=ZzZ1dmlmaERLYlphUC9jVGFjSlU1QT09
