Skip to content

Details

DataTalks #25: Productizing DS w/ Kubeflow + Translating Music Genres

Our 25th DataTalks meetup is done in cooperation with Team8, and will be held online. We'll talk about migrating Data Science research
to production using Kubernetes, Kubeflow pipelines & Kale, and translation of music genres and voices.

๐—ญ๐—ผ๐—ผ๐—บ ๐—น๐—ถ๐—ป๐—ธ: https://zoom.us/j/92086149175?pwd=ZzZ1dmlmaERLYlphUC9jVGFjSlU1QT09

๐—”๐—ด๐—ฒ๐—ป๐—ฑ๐—ฎ:
๐Ÿ”ต 18:00 - 18:05 - Introduction - Tom Sela, Director of Research at Team8
๐Ÿ”ถ 18 :05 - 18:50 - Migrating Data Science Research to Production Using Kubernetes, Kubeflow Pipelines and Kale - Amit Ripshtos, Senior Software Engineer at noogata
๐Ÿ”ด 18:50 - 19:35 - Translation of Music Genres and Voices - Adam Polyak, AI Research Engineer at Facebook

---------------------

๐— ๐—ถ๐—ด๐—ฟ๐—ฎ๐˜๐—ถ๐—ป๐—ด ๐——๐—ฎ๐˜๐—ฎ ๐—ฆ๐—ฐ๐—ถ๐—ฒ๐—ป๐—ฐ๐—ฒ ๐—ฅ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐˜๐—ผ ๐—ฃ๐—ฟ๐—ผ๐—ฑ๐˜‚๐—ฐ๐˜๐—ถ๐—ผ๐—ป ๐—จ๐˜€๐—ถ๐—ป๐—ด ๐—ž๐˜‚๐—ฏ๐—ฒ๐—ฟ๐—ป๐—ฒ๐˜๐—ฒ๐˜€, ๐—ž๐˜‚๐—ฏ๐—ฒ๐—ณ๐—น๐—ผ๐˜„ ๐—ฃ๐—ถ๐—ฝ๐—ฒ๐—น๐—ถ๐—ป๐—ฒ๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—ž๐—ฎ๐—น๐—ฒ - ๐—”๐—บ๐—ถ๐˜ ๐—ฅ๐—ถ๐—ฝ๐˜€๐—ต๐˜๐—ผ๐˜€, ๐—ฆ๐—ฒ๐—ป๐—ถ๐—ผ๐—ฟ ๐—ฆ๐—ผ๐—ณ๐˜๐˜„๐—ฎ๐—ฟ๐—ฒ ๐—˜๐—ป๐—ด๐—ถ๐—ป๐—ฒ๐—ฒ๐—ฟ ๐—ฎ๐˜ ๐—ป๐—ผ๐—ผ๐—ด๐—ฎ๐˜๐—ฎ

At noogata, a crucial part of delivering an end-to-end data science / AI based solution involves research and migrating to a finished product. We found out quickly that the researcher's results are not suitable for production use out-of-the-box. In order to 'productize' a solution, a data-processing pipeline needs to be established, taking research code from Jupyter notebooks and arranging it in a processing pipeline we can easily repeat and reproduce.

Our main technologies around our work are Kubernetes (container orchestration infrastructure) and Kubeflow pipelines (Workflow engine that runs natively on Kubernetes), as our data pipelines (Various ETLs, model training) need to scale up and down quite a bit, and we want things to work on any infrastructure.

In this talk, I will describe the process we built for 'productionizing' research projects using Kubernetes, Kubeflow pipelines and tools like Kale. I will explain how we use Kubeflow pipelines in the research phase and how it works for production as well, and I will deep dive into its benefits and disadvantages.

---------------------

๐—ง๐—ฟ๐—ฎ๐—ป๐˜€๐—น๐—ฎ๐˜๐—ถ๐—ผ๐—ป ๐—ผ๐—ณ ๐— ๐˜‚๐˜€๐—ถ๐—ฐ ๐—š๐—ฒ๐—ป๐—ฟ๐—ฒ๐˜€ ๐—ฎ๐—ป๐—ฑ ๐—ฉ๐—ผ๐—ถ๐—ฐ๐—ฒ๐˜€ - ๐—”๐—ฑ๐—ฎ๐—บ ๐—ฃ๐—ผ๐—น๐˜†๐—ฎ๐—ธ, ๐—”๐—œ ๐—ฅ๐—ฒ๐˜€๐—ฒ๐—ฎ๐—ฟ๐—ฐ๐—ต ๐—˜๐—ป๐—ด๐—ถ๐—ป๐—ฒ๐—ฒ๐—ฟ ๐—ฎ๐˜ ๐—™๐—ฎ๐—ฐ๐—ฒ๐—ฏ๐—ผ๐—ผ๐—ธ

In this talk, we will present two methods:
i) A method for translating music across musical instruments and styles.
ii) A wav-to-wav method for converting between speakersโ€™ voices, without relying on text.

The first is based on unsupervised training of a multi-domain WaveNet autoencoder that is trained end-to-end on waveforms. Employing a diverse training dataset and large net capacity, allows us to translate also from musical domains that were not seen during training.

The second method is based on an encoder-decoder architecture, where the encoder is pre-trained for the task of Automatic Speech Recognition (ASR), and a multi-speaker waveform decoder trained to reconstruct the original signal. The modularity of our approach, which separates the target voice generation from the Text To Speech (TTS) module, enables the customization of existing TTS services.

---------------------

๐—ญ๐—ผ๐—ผ๐—บ ๐—น๐—ถ๐—ป๐—ธ: https://zoom.us/j/92086149175?pwd=ZzZ1dmlmaERLYlphUC9jVGFjSlU1QT09

You may also like