• DSF Meetup with KNIME

    Midcity Place

    Join Data Science Festival - London in partnership with KNIME. July 8th, for an evening of Guided Analytics to develop predictive models on the fly. leveraging automated machine learning and interactive dashboard components, custom Guided Analytics Applications, tailored to your business needs. Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Event at KNIME on July 8th 2019, the ballot will be drawn on the 4th July 2019. Those randomly selected will then be e-mailed a Universe ticket for the event, with the joining details. If you get an allocated Universe ticket, please bring a copy of your paper ticket or your ticket on your phone to the event to check in with your QR code. Tickets are non-transferable. PLEASE NOTE REGISTERING ON MEETUP DOES NOT GUARANTEE YOU ENTRY TO THIS EVENT. Please click here to apply for a ticket: https://www.datasciencefestival.com/event/dsf-meetup-with-knime/ SCHEDULE 6:00pm: Doors open 6:30pm: Paolo Tamagnini 7:15pm: Break 7:45pm: Rob Blanford 8:30: Networking 9:00: Close Paolo Tamagnini - Data Scientist at KNIME Summary: In recent years, a wealth of tools has appeared that automate the machine learning cycle inside a black box. At KNIME we take a different stance. Automation should not result in black boxes, as modern data science should allow automation and interaction to be combined flexibly into a more transparent solution. Paolo will show how to build such a more transparent application for automated machine learning using KNIME Software. It will have an input user interface to control the settings for data preparation, model training (e.g. using deep learning, random forest, etc.), hyperparameter optimization, and feature engineering. The trained models will be shown in the end in an interactive dashboard to visualize the results with model interpretability techniques. Bio: Paolo Tamagnini works as a Data Scientist for KNIME in the evangelism team in Berlin. After graduating with a Master's Degree in Data Science at Sapienza University of Rome, Italy, Paolo gathered research experience at New York University in machine learning interpretability and visual analytics tools. Since working for KNIME, Paolo has run various workshops in the US and Europe and has developed a number of reusable guided analytics applications for automated machine learning and data exploration. Rob Blanford - Data Scientist at Atos Summary: Rob will share his experiences with working with KNIME as a consultant. His talk will address first how KNIME can help large organizations in connecting different kinds of data experts. Furthermore Rob will show a KNIME workflow from an on-going client case study about social media analysis. Finally he will show a prototype of how KNIME can integrate with other software libraries. In fact Rob will show how Blender3D, an open source application for creation and animation of 3D models, can integrate via Python to control a 3D model from a KNIME workflow. Bio: Rob Blanford is a Senior Consultant and Data Scientist, a member of the UK&I Analytics team at Atos. Taking an unusual path into Data Science, he graduated from Durham University with a degree in Psychology and Philosophy, gained experience in industry before completing an MBA at Cranfield University. Now he works across industries with Atos, using KNIME, R and Python to improve people’s access to data, improve the quality of information and create analyses to support expert decisions.

    25
  • DSF Startup Showcase with Secret Escapes

    120 Holborn

    Join Data Science Festival - London in partnership with Secret Escapes. June 11th, we will be featuring 6 new and upcoming companies at our Start-up Showcase. Come and hear how these new companies use DS to solve real work problems, the issues their teams have encountered and also the mistakes and success that you should look for when you are starting your own projects. Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Event at Secret Escapes on June 11th 2019, the ballot will be drawn on the 4th June 2019. Those randomly selected will then be e-mailed a Universe ticket for the event, with the joining details. If you get an allocated Universe ticket, please bring a copy of your paper ticket or your ticket on your phone to the event to check in with your QR code. Tickets are non-transferable. PLEASE NOTE REGISTERING ON MEETUP DOES NOT GUARANTEE YOU ENTRY TO THIS EVENT. Please click here to apply for a ticket: https://www.datasciencefestival.com/event/dsf-startup-showcase-with-secret-escapes/ SCHEDULE 6.00pm doors open [masked]:15pm talk - 3 Short Sharp Lightning Talks - Secret Escapes - Depop - Wefarm 7:15-7:45 pm - Refreshments 7:45-8:30pm talk - 3 Short Sharp Lightning Talks - Benevolent AI - OneSub - WinningMinds.ai 8:[masked]pm - Close Talk 1: Ross Gray - Secret Escapes challenges scaling Airflow to running hundreds of dynamically generated DAGs. Talk 2: Clemence J Burnichon - Depop QuickSearch recommendations Talk 3: Rob Stanley - Data Science in Languages I Don’t Understand Talk 4: Jinwoo Leem - Harnessing data to revolutionise drug discovery Talk 5: Jim Morrison - Breaking the Echo Chamber - AI versus Human Stupid. Talk 6: Dimi Masaouti - People Data for Good For full talk summaries, please go to: https://www.datasciencefestival.com/event/dsf-startup-showcase-with-secret-escapes/

    12
  • Data Science Festival Mainstage Day 2019

    Skillsmatter (CodeNode)

    Data Science Festival Mainstage (Ballot ticket only) All tickets have now been allocated to this event. If you have not received your universe ticket via e-mail you have been unlucky this time around. Please note, you will need to show ID on arrival so please have that ready with your ticket. Tickets can be digital or printed. Anyone without a ticket will not be able to enter the venue, doors for entry open at 8:15 AM. Please arrive by 9 AM to guarantee entry. We have a dropout rate so have allocated more tickets than seats. Latecomers may not be guaranteed entry when we reach capacity. Registering on Meetup does not give you access to this event. BALLOT TICKETS ARE NOW OPEN HERE: https://2019.london.datasciencefestival.com/event/data-science-festival-mainstage-day-free/ REGISTERING ON MEETUP DOES NOT GIVE YOU ACCESS TO THIS EVENT PLEASE FOLLOW THE LINK ABOVE. Please read how the ticketing works here: https://datasciencefestival.com/dsf-tickets-101/ Schedule and speaker details will be live on the DSF site in February. Register your interest today and apply for a ballot ticket. Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Mainstage day on Saturday 13th April 2019, the ballot will be drawn on the 5th April 2019. Those randomly selected will then be e-mailed tickets for the event, with the joining details. This free day will feature four-stream rooms with over 500 people attending. A day of lectures with over 40 top speakers. It will also feature our partners in our exhibitor section of the event. An entire day to learn, mingle and be inspired. For the people by the people! Speakers to date, please check www.datasciencefestival.com for up to date speaker information. • What to bring Please bring a copy of your paper ticket or your ticket on your phone to the event to check in with your QR code. Tickets are non-transferable. • Important to know Doors open at 8:15AM

    2
  • DSF Friday Social- I went to work as an SQL programmer, and left as a hostage

    Join us for a night of Data Science Networking. Private event at Bounce Farringdon featuring an incredible talk by Peter Moore “I went to work as an SQL programmer, and left as a hostage”. There will be a relaxed vibe with drinks, food, music and table tennis. This relaxed evening will offer both the opportunity to wind down with fellow data enthusiasts after a full week of data events, as well as to network with co-attendees and speakers ahead of our Saturday conference. Please click here to apply for a ticket: https://2019.london.datasciencefestival.com/event/dsf-friday-social-and-networking-evening/ SCHEDULE: 6.15pm doors open – First drink sponsored by Data Idols 7:00pm – 8:00pm – Talk – Peter Moore 8:00pm – 10:00pm – Networking, food and drinks Peter Moore: I went to work as an SQL programmer, and left as a hostage Peter Moore was an SQL Server developer of several years’ experience prior to accepting a three-month assignment working on a financial system for the Iraqi government. He returned home two and a half years later, having been held hostage by an Iraqi militia for 946 days. In this talk, he describes the chain of events that led up to his capture, his experiences as a hostage, and his eventual release, with many fascinating stories about programming, IT, travel, and survival incorporated along the way. Peter still works in IT as a consultant and developer, as well as delivering training on Conduct after Capture, SERE , hostage taking, and hostile environments. Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Social & Networking at Bounce, in partnership with Data Idols, on April 12th 2019, the ballot will be drawn by the 1st April 2018. Those randomly selected will then be e-mailed tickets for the event, with the joining details.

    5
  • DSF Day 4 - Data Science and e-commerce hosted by the Trainline

    DSF Day 4 - Solving E-commerce challenges with DS hosted by Trainline All tickets have now been allocated to this event. If you have not received your universe ticket via e-mail you have been unlucky this time around. Please note, you will need to show ID on arrival so please have that ready with your ticket. Tickets can be digital or printed. Anyone without a ticket will not be able to enter the venue, doors for entry will close at 6:45PM. Registering on Meetup does not give you access to this event. Join us for an evening of Data Science talks focused on solving E-commerce challenges. The evening will consist of two talks, one from our hosts Trainline, the other from Marco Bertetti Data Scientist at Skyscanner. With applications and websites growing more complex than ever, Marco will discuss important new approaches to surface the right content to the right users as many time as possible. Please click here to apply for a ticket: https://2019.london.datasciencefestival.com/event/dsf-day-4-trainline/ Schedule: 6.00pm - Doors open 6.30pm- Marco Bertetti 7.15pm - Drinks food & networking 7.45pm – Dan Taylor & Sam Taylor 8.30pm - Networking 9.00pm - Close Address: 120 Holborn, London EC1N 2TD Marco Bertetti – Data Scientist at Skyscanner Summary: Contextual multi-armed bandits for widget optimization. Mobile applications and websites are growing more complex than ever, with new graphics, functionalities and widgets being added every day. In this ever-growing space it is important to develop new approaches to surface the right content to the right users as many time as possible. While A/B test is a widely used and solid technique, it is not always viable when the number of possible choices is very large, hundreds or thousands of tests would be required to find the best option for each situation. This talk will firstly provide an introduction to the muti-armed bandit problem. Then, a practical comparison between bandits and classic A/B testing will be shown. Closing with a practical Bandit implementation at Skyscanner. Dan Taylor - Data Engineering Manager Sam Taylor - Lead Machine Learning Engineer Summary: Enabling Real Time Data Science in E-Commerce. The architecture of modern E-commerce companies typically revolves around micro-services, this often causes a challenge for Data Scientists who need to pull data from disparate sources to carry out their day to day work. This pushes the typical 80% of time spent on data exploration, closer to 100%, resulting in less time spent solving the customer problem. At Trainline, we have solved this using event sourcing and streaming technology. We will talk about how this is practically carried out from a data engineering to data product development perspective. We will then explore advancing this technology, giving the ability to put realtime data products into production and the hands of our customers. Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Event at COMPANY on April 11th 2019, the ballot will be drawn on the 1st April 2019. Those randomly selected will then be e-mailed tickets for the event, with the joining details. If you get an allocated ticket, please bring a copy of your paper ticket or your ticket on your phone to the event to check in with your QR code. Tickets are non-transferable. The Data Science Festival is the first of its kind as the only community led, free to attend Data Science Festival in the UK.

    3
  • DSF Day 3 – Data Science Lightning Talks at Facebook

    1 Rathbone Square

    DSF Day 3 – Data Science Lightning Talks at Facebook All tickets have now been allocated to this event. If you have not received your universe ticket via e-mail you have been unlucky this time around. Please note, you will need to show ID on arrival so please have that ready with your ticket. Tickets can be digital or printed. Anyone without a ticket will not be able to enter the venue, doors for entry will close at 6:45PM. Registering on Meetup does not give you access to this event. Join Data Science Festival London, in partnership with Facebook this April. The evening will consist of 4 lightning talks and a panel discussion. Schedule: 6.15pm – Doors open 6:45pm- Lightning talks (4 x speakers: 10mins each) 7:30pm – Panel chat hosted by David Loughlan – Changing approaches and how DS is influencing the media industry. 8.15pm – Drinks, food and networking 9.00pm – Close Speaker 1: Hervé Schnegg – Lead Data Scientist – The Telegraph Speaker 2: Miriam Redi – Research Scientist – Wikimedia Foundation Speaker 3: Adi Masas – Data Scientist – Facebook Speaker 4: Evrim Altintas -Data Scientist at Facebook Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Event at Facebook on April 10th 2019, the ballot will be drawn on the 1st April 2019. Those randomly selected will then be e-mailed tickets for the event, with the joining details. If you get an allocated ticket, please bring a copy of your paper ticket or your ticket on your phone to the event to check in with your QR code. Tickets are non-transferable. The Data Science Festival is the first of its kind as the only community-led, free to attend Data Science Festival in the UK.

    14
  • DSF Day 2 – Data Science to Production hosted by Zoopla

    DSF Day 2 – Data Science to Production hosted by Zoopla All tickets have now been allocated to this event. If you have not received your universe ticket via e-mail you have been unlucky this time around. Please note, you will need to show ID on arrival so please have that ready with your ticket. Tickets can be digital or printed. Anyone without a ticket will not be able to enter the venue, doors for entry will close at 6:45PM. Registering on Meetup does not give you access to this event. Join us at Zoopla for Day 2 of the Data Science Festival, where we will have an evening focused around taking data science projects to production. Our speakers, Jan and Miroslav, will dig into the real world challenges of implementing successful data science initiatives. With Gartner estimating that 85% of data projects fail, this is one not to be missed!! Please click here to apply for a ticket: https://2019.london.datasciencefestival.com/event/dsf-day-2-zoopla/ Schedule: 6.00pm - Doors open 6.30pm- Jan Teichmann 7.15pm - Drinks food & networking 7.45pm – Miroslav Batchkarov 8.30pm - Networking 9.00pm - Close Jan Teichmann - Senior Data Scientist at Zoopla Summary: Solving the real challenge of Data Science -- Productionisation -- with proven solutions straight from the front lines. Making data science a success is really hard with up to 85% of projects and initiatives around big data and data science failing according to Gartner. The reasons are complex but often misunderstood. What is so different about data science that it needs new approaches? A survey in 2016 concluded that 80% of data science is preparing and cleaning data (the infamous 80/20 rule). That survey of data scientists caught on and developed into the widely recognised problem statement for data science. Unfortunately! Because Productionisation of models is the TOUGHEST problem in data science. This talk will: -introduce the unique data science requirements and -introduce the Rendezvous Architecture, a proven solution to integrate data science and enterprise requirements in a harmonious way at scale Miroslav Batchkarov - CTO & Co-founder at Teebly Summary: Gold standard data: lessons from the trenches.The first stage in a data science project is often to collect training data. However, getting a good data set is surprisingly tricky and takes longer than one expects. This talk describes our experiences in labelling gold-standard data and the lessons we learnt the hard way. We will present three case studies from natural language processing and discuss the challenges we encountered. Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Event at COMPANY on April 9th 2019, the ballot will be drawn on the 1st April 2019.Those randomly selected will then be e-mailed tickets for the event, with the joining details. If you get an allocated ticket, please bring a copy of your paper ticket or your ticket on your phone to the event to check in with your QR code. Tickets are non-transferable. PLEASE NOTE REGISTERING ON MEETUP DOES NOT GUARANTEE YOU ENTRY TO THIS EVENT.

    15
  • DSF Day 1 - Data Science at Scale hosted by King

    178 Wardour St

    DSF Day 1 - Data Science at Scale hosted by King All tickets have now been allocated to this event. If you have not received your universe ticket via e-mail you have been unlucky this time around. Please note, you will need to show ID on arrival so please have that ready with your ticket. Tickets can be digital or printed. Anyone without a ticket will not be able to enter the venue, doors for entry will close at 6:45PM. Registering on Meetup does not give you access to this event. Join us for an evening of tech talks focused on Data Science at scale. Featuring two King speakers discussing real-world data science problems from the game analytics space. The evening will also include Rafah El-Khatib who joins us from the advanced analytics on machine learning applications department at ING. Rafah will discuss selecting predictive features to input into a model when you are dealing with data at scale. Please click here to apply for a ticket: https://2019.london.datasciencefestival.com/event/dsf-day-1-king/ Schedule: 6.00pm - Doors open 6.30pm- Rupali Singhal & Piergiorgio Calzi 7.15pm - Drinks food & networking 7.45pm – Rafah El-Khatib 8.30pm - Networking 9.00pm - Close Rafah El-Khatib - Data Scientist at ING Summary: Feature Selection Best Practices - LOFO and a Survey of Key Feature Importance Packages. Selecting predictive features to input into a model is key to ensuring that the input data is not noisy and is time-effective in cases where the original number of features or dataset are large. In this talk, I will present a survey of key feature importance packages and explain their strengths and weaknesses, and I will present an in-house open-source feature importance package called LOFO (leave-one-feature-out) and its fast approximation (FLOFO, or Fast LOFO). The LOFO importance calculates the importance of a set of features based on a metric of choice, for a model of choice, by iteratively removing each feature from the set, and evaluating the performance of the model, cross-validated, based on the chosen metric. Piergiorgio Calzi - Data Scientist Summary: Lesser known tips and tricks for Pandas. Any Python user performing data analysis has used Pandas at a certain point. A handful of functions can get you by and allow you to achieve surprisingly good results throughout the data preparation stage. However, and perhaps less often, we face more challenging data preparation tasks. This talk will highlight some of the less common pandas features. Not exactly a 101 but maybe a 102. Rupali Singhal - Marketing Analyst Summary: Data Rich Marketing at King Due to the popularity of Data Science Festival events, we are now allocating event tickets via a random ballot. Registering here enters you into the ticket ballot for the Data Science Festival Event at King on April 8th 2019, the ballot will be drawn on the 1st April 2019. Those randomly selected will then be e-mailed tickets for the event, with the joining details. If you get an allocated ticket, please bring a copy of your paper ticket or your ticket on your phone to the event to check in with your QR code. Tickets are non-transferable.

    12
  • DSF Meetup with Trainline

    120 Holborn

    Join Data Science Festival London, in partnership with Trainline this March. The evening will consist of talks on how Trainline use anomaly detection techniques to automatically detect fluctuations in demand and will be discussing the role of the Data Science team in a commercial e-commerce business. Those randomly selected and approved will then be e-mailed tickets for the event. If you do not receive an approval e-mail from us by the 11th of March you have been unsuccessful in getting a ticket for this event. PLEASE NOTE REGISTERING ON MEETUP DOES NOT GUARANTEE YOU ENTRY TO THIS EVENT. Please click here to apply for a ticket: https://www.datasciencefestival.com/event/dsf-meetup-with-trainline/ SCHEDULE: 6:00pm: Guests arrive 6:30pm: Louisa Johns & Tim Williams 7:15pm - Break & Refreshments 7:45pm - Miriam Redi 8:30pm - Networking 9:00pm - Close Address: 120 Holborn, London EC1N 2TD Louisa Johns - Data Scientist at Trainline Bio: Louisa is a data scientist at Trainline. She has developed an in-depth knowledge of the UK rail network on here current project and is particularly interested in the on-time performance of trains on the network. Tim Williams - Data Scientist at Trainline Bio: Tim is a data scientist at Trainline and developed our anomaly detection system alongside other members of the team. In his spare time he loves a good game of “guess the anomaly”. Summary: Have you ever been stuck on a train with a crowd of football supporters when you weren’t one of them? Missed out on those cheap train tickets because they sold out so fast? In this talk we will review how we used anomaly detection techniques to automatically detect these fluctuations in demand. We will also discuss the role of the Data Science team in a commercial e-commerce business and how we determine how we work and what we work on. Miriam Redi - Research Scientist at the Wikimedia Foundation and Visiting Research Fellow at King's College London Title: The Science of (Visual) Knowledge Equity Abstract: In this talk, we will see how computer vision and machine learning can support knowledge equity and help to break down the social, political, and technical barriers preventing people from accessing free knowledge. We will look at technologies designed to bridge content and verifiability gaps in Wikipedia. We will see how multimedia retrieval techniques can be used to break language barriers by visually enriching Wikimedia projects, and we will learn how science can promote knowledge equity in online and offline communities beyond Wikimedia spaces. Bio:Miriam Redi is a Research Scientist at the Wikimedia Foundation and Visiting Research Fellow at King's College London. Formerly, she worked as a Research Scientist at Yahoo Labs in Barcelona and Nokia Bell Labs in Cambridge. She received her PhD from EURECOM, Sophia Antipolis. She conducts research in social multimedia computing, working on fair, interpretable, multimodal machine learning solutions to improve knowledge equity.

    8
  • DSF Meetup with Zopa

    Zopa

    Join Data Science Festival London, in partnership with Zopa this February. The evening will consist of talks on building Analytical Data Warehouse with Amazon Web Services​ and Evaluating feature importance in machine learning models using Shapley values​. Those randomly selected and approved will then be e-mailed tickets for the event. If you do not receive an approval e-mail from us by the 22nd of February 2019 you have been unsuccessful in getting a ticket for this event. PLEASE NOTE REGISTERING ON MEETUP DOES NOT GUARANTEE YOU ENTRY TO THIS EVENT. Please click here to apply for a ticket: https://www.datasciencefestival.com/blog/event/live/2018/dsf-meetup-with-zopa/ SCHEDULE: 6:00pm: Guests arrive 6:30pm: Tadas Krisciunas 7:15pm - Break & Refreshments 7:45pm - Ross Young 8:30pm - Networking 9:00pm - Close Tadas Krisciunas - Data Scientist at Zopa Bio: Tadas is a data scientist at Zopa who is passionate about using machine learning and technology to bring more transparency and efficiency to retail finance. Prior to Zopa, he was an early employee of the fintech start-up Oodle Finance. He holds a master’s degree in Mathematics & Philosophy from the University of Oxford. Summary: It is often talked about “the interpretability-accuracy trade-off”: deep learning, gradient boosted trees and other powerful machine learning methods can capture complex relationships in the data, but lack transparency and interpretability when compared to more traditional methods. In this talk, I’ll briefly review a few of the most popular techniques to measure feature importance in black-box models, with a highlight on a novel class of methods stemming from the game-theoretic concept of Shapley values. Ross Young - Data Engineer at Zopa Bio: Ross is a data engineer developing analytical data warehouse at Zopa. He has a PhD from the University of Edinburgh in Experimental Particle Physics, a field that first spurred his interest in big data. Prior to Zopa he worked at the BAE Systems as a data analyst. Summary: Analytical data warehouse is the system that gathers data from a wide range of sources and consolidate them to inform decision-making in large organisations. In recent year, companies are moving over from traditional, on-premise architectures to cloud-based architectures such as Amazon Web Services. This can bring a number of advantages such as cost, scalability, and performance. In this talk, I will introduce some architectural ideas that can be leveraged by an enterprise on AWS to build and monitor their entire, end-to-end ETL cycle.

    17