• Data Science meetup Crunch edition
    This is a free event, co-organised with the Budapest Big Data Meetup group: https://bit.ly/2RVLDgu We are going to start at 6pm, a bit earlier than usual, simply because we have more speakers for this special event. // GOOD TO KNOW If you are also interested in Crunch Conference, use BPDATASCIENCE coupon code to get $50 off at https://crunchconf.com/ // SCHEDULE 6:00pm - Doors open 6:30pm - Talks 8:00pm - Pizza and beers // ABSTRACTS Selling Seats and Pushing Pints with Data ——————— In this session we discuss how we use data to increase customer experience and incomes in our live entertainment business at AEG Europe. Going through two applications first we look at how we set up a fully automated machine learning process to target fans with the most relevant ticket offers using their purchasing and browsing history. In the second half of the talk we take a look at how we use data to improve our venue operations and increase food and beverage income. By taking digital records of fans' taste and interests we can predict their thirst and appetite prior to the event and serve them better on their big night out. by *Akos IZSO* Data Analytics Manager at AEG Europe Ákos Izsó is a Data Analytics Manager at AEG Europe. He works primarily on predictive models that help the live entertainment company to plan operations and increase sales and customer satisfaction. He earned a Master’s degree in Economics and Business Administration at the Corvinus University of Budapest and analysed customer data ever since then in the financial, telecommunications and entertainment sectors. Apache Gobblin: Your Swiss army knife for all data integration use-cases ————————— In this talk, we will introduce Apache Gobblin and its architecture which is used at LinkedIn for multiple data integration use-cases such as ingestion of 300+ billion Kafka events and data management of 250 PBs on HDFS. Then, we will explore the challenges of an evolving data ecosystem like multiple execution environments (single box, bare metal, hadoop or cloud), dynamic data routing, global throttling and how we can leverage Apache Gobblin to address them. by *Abhishek TIWARI* Staff Software Engineer at LinkedIn, Apache Gobblin PPMC / Committer Abhishek Tiwari is a Committer and PPMC member of Apache Gobblin (incubating). He is the Tech Lead for Data Integration Infrastructure at LinkedIn. Before joining LinkedIn, he had worked on building Amazon CloudSearch service at AWS, platform for Watson supercomputer at Nuance, Hadoop infrastructure at Yahoo, and web architecture for several million monthly users at AOL. The Mother of all Query Languages: SQL in Modern Times ——————————————————— Did you know the purely relational dogma of SQL was already abandoned in 1999? The last SQL standard that was limited to the relational idea was SQL-92. From SQL:1999 onwards, the SQL language was extended with non-relational operations and non-relational data structures. As much as this move was discussed at that time, it took decades until database vendors caught up with this idiomatic change. Many SQL users haven’t heard of it until today. The year 2018 finally marks the turning point. With the release of MySQL 8.0 all major SQL dialects finally support the most important non-relational concepts of SQL. This talk provides the big picture on the evolution of the SQL standard. You will see that SQL has changed as much as our requirements have changed over the past decades. by *Markus WINAND* Markus Winand is an independent author, trainer and consultant on all things SQL. His book “SQL Performance Explained” has just been translated into a fifth language. It can be purchased as a print edition, and it’s also available to read for free on his use-the-index-luke.com website. Markus is currently working on his second book centered on the idea that things have changed a lot since SQL-92. It is likewise available free of charge online (modern-sql.com).

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

  • Craft Conference Data Meetup
    We are getting some of the best data-specialists from Craft Conference to give a few talks here at our meetups right before they speak at the conference! This is a joint event with the Budapest Big Data Meetup group https://www.meetup.com/Big-Data-Meetup-Budapest/ --------------------------------------------------------------------------------------------- Building Event Driven Systems with Apache Kafka & Kafka Streams --------------------------------------------------------------------------------------------- Event Driven Systems come in many shapes and sizes from tiny functions that dip into an event stream right through to heavy, stateful streaming services. This talk examines the fundamentals of distributed logs and stateful stream processing, contrasting the event driven approach with REST/RPC-based methods. Then we’ll walk through a number of patterns for putting such systems together, evolving a typical monolith into a rich, globally deployed, event-driven ecosystem. *Ben Stopford* Ben is a technologist working in the Office of the CTO at Confluent Inc (the company behind Apache Kafka) where he’s worked on a wide range of projects, from implementing the latest version of Kafka’s replication protocol through to writing a book on Event Driven Systems. Before Confluent Ben led the design and build of a company-wide data service in a large financial services firm. His earlier career spanned projects at Thoughtworks and UK-based enterprise companies. He is the author of the book “Designing Event Driven Systems”, O’Reilly, 2018. Find out more at http://benstopford.com. ----------------------------------------------------------------------------------------------------- Talk Title Will be announced soon. ----------------------------------------------------------------------------------------------------- Abstract will be announced soon. * Chris Fregly * Chris Fregly is Founder and Research Engineer at PipelineAI, a Real-Time Machine Learning and Artificial Intelligence Startup based in San Francisco. He is also an Apache Spark Contributor, a Netflix Open Source Committer, founder of the Global Advanced Spark and TensorFlow Meetup, author of the O’Reilly Training and Video Series titled, "High Performance TensorFlow in Production." Previously, Chris was a Distributed Systems Engineer at Netflix, a Data Solutions Engineer at Databricks, and a Founding Member and Principal Engineer at the IBM Spark Technology Center in San Francisco. More talks will be announced soon.

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

    3 comments
  • Budapest meets Vienna
    Our friends from the Vienna Data Science Group and from Viennese Data Science companies are coming over for an evening. They are giving us a brief overview of what's going on in Vienna and we will take this time to investigate cooperation possibilities between the two cities. ------------------------------------------------------------------------------------------------------ GDPR - A technical view on GDPR ------------------------------------------------------------------------------------------------------ GDPR will be operative in end of May and effects heavily the BigData Landscape. During this talk Bernhard will provide some pitfalls and share his point of view how to address them. Bernhard Ortner, VDSG member - opinions are my own. https://www.linkedin.com/in/bortner/ Big Data enthusiast with a strong background on engineering in different areas such as finance, energy and governance. He was responsible for maintaining operational data lakes and developing proof-of-concepts for big data technologies. In his leisure time he enjoys various athletic activities like running, bicycling or sailing. Furthermore, he appreciates spending time with his family and friends. ------------------------------------------------------------------------------------------------------ What's going on regarding Data Science in Vienna? ------------------------------------------------------------------------------------------------------ Vienna is normally perceived like a conservative city, nonetheless, there is a very active and interesting community focus on Data Science. During this talk, we would explain the Data ecosystem in Vienna and opportunities for collaboration between Budapest and Vienna. Wolfgang Weidinger (Wulf), Chairman VDSG and Data Scientist freelance https://www.linkedin.com/in/wolfgang-weidinger/ Wulf: Data Science and Agile enthusiast, Entre-/Intrapreneur and avid climber. Maria Ines Plaza (Mari) VDSG Collaborations coordinator. https://www.linkedin.com/in/maria-ines-plaza-schwarck-9825962/ Mari is a professional with strong background in Business that loves data. She enjoys innovating in processes and business models through an effective use of the data. ------------------------------------------------------------------------------------------------------ Machine Learning and Medical Imaging in the Era of Precision Medicine ------------------------------------------------------------------------------------------------------ The talk will discuss the current state of machine learning in the context of clinical decision support and research. It will give examples highlighting challenges and possible roles of the interlinked advance in medical imaging and machine learning such as the identification of markers and the prediction of disease course and treatment response. Georg Langs, Medical University of Vienna - Lab Director, contextflow - Chief Scientist https://www.cir.meduniwien.ac.at/team/langs/

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

    3 comments
  • Data Science meetup April
    Dear Members! On our April meetup, we feature not one, not two, but THREE presentations, all from different companies. Automated text analysis and insight generation - Peter Szekeres (Founder, CEO at Neticle) ------------------------ Peter Szekeres, CEO of the leading Hungarian NLP company, Neticle; will talk about how they use NLP to analyze social media. They are working on a model which focuses on automated insight generation to save time for end users. Having a good and precise data and text analytics, queries and charts are not enough in many use-cases. Description engines and autodiagnosis combined with NLG make better data based solutions. Linkedin: https://www.linkedin.com/in/peter-szekeres-7b136541/ BigQuery and Airflow in concert - Balazs Mate (Data engineer, ingatlan.com) ------------------------ ingatlan.com is the biggest real-estate site of Hungary. They work with a lot of valuable data on how users use their website. It's not an ordinary challenge to analyze and prepare the data they have efficiently. In this talk, Balazs will give an overview of what they do and go into details about how they use Google BigQuery and Apache Airflow together to analyze their data. Linkedin: https://www.linkedin.com/in/balazsmate/ Agenda ================== 18:15 Doors open, pizza 18:45 - 19:50 Presentations! 20:00 Beers at apacuka! See you on 3rd April! ----------------------------------- EDIT: The presentation from Imre Szucs and Norbert Liki (Telenor) was cancelled due to illness. -----------------------------------

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

    13 comments
  • ML at Whitepages
    We are going to hear from Gyorgy Mora from Whitepages about ML for fraud detection. --About the Talk/Presenter -- Whitepages is a global leader in digital identity verification providing services for businesses and consumers. Gyuri is leading the machine learning efforts beyond the Identity Score product which helps merchants and other customers to fight fraud. Schedule: 6:00-6:30 Warmup, pizza, beers and chit-chat :) 6:45-7:30 Talks + Q&A 8:00 - Beers in Apacuka We hope to see you soon at Prezi House of Ideas!

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

  • Data Christmas 2017
    Join us for the last meetup of the year, where we look back at the most significant and interesting events of 2017. This meetup is jointly organized by the data meetups in Budapest. Current participants (in alphabetical order): BI Meetup (https://www.meetup.com/Budapest-BI-Meetup/events/245459960/), Big Data Meetup (https://www.meetup.com/Big-Data-Meetup-Budapest/events/245444973/), Data Science Meetup, PyData meetup (https://www.meetup.com/PyData-Budapest/events/245460002/), R meetup (https://www.meetup.com/Budapest-Users-of-R-Network/events/245454475/), R-Ladies (https://www.meetup.com/R-Ladies-Budapest/events/245460382/) Schedule: 18:30 Doors open, pizza 19:00 The evolution of the Data Community in Hungary - Máté Gulyás 19:30 6 facts you should know about R-Ladies -Windhager-Pokol Eszter 19:45 Here we R - Gergely Daróczi 20:10 Machine learning trends in 2017 - Szilárd Pafka 20:30 Afterparty! This is an English speaking event. Due to the parallelly organized, distributed nature of the meetup, RSVPs will be closed when the combined number of RSVPs from all participating meetups hits the limit.

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

    1 comment
  • CrunchConf warmup meetup & party
    This is a free event. Hi, let's get together for a Meetup and go out for a few beers after! First speaker: Anna Mandli Anna is an applied mathematician from Eötvös Loránd University, Budapest. Anna works at Bosch in a technology development team on data mining topics, while she continues her Ph.D. studies at the Hungarian Institute for Computer Science and Control (SZTAKI). Abstract: Data Mining in the Electronic Manufacturing Bosch as a leading automotive electronics supplier puts emphasis on Industry 4.0 applications. This presentation will outline the results of a data mining project, which aimed to reduce the fall out rate of an electronic part manufactured by Bosch. Analysis of the archive manufacturing data helped to reveal factors contributing to defects and to select the most informative diagrams to be visualized in an application. During this work we faced many challenges: the data is unbalanced, inhomogeneous and the measurements are often noisy. Moreover, data mining modeling was used for scrap rate prediction. A data-driven system was designed which can warn the machine operators if a critical increase in failure rate is expected. _____________________________________________ Second speaker: Sean Kross Sean formerly worked in the Johns Hopkins Data Science Lab where he and his colleagues developed The Data Science Specialization on Coursera.org. Sean is the author of Mastering Software Development in R, Developing Data Products, and The Unix Workbench. You can find him on Twitter @seankross. Abstract: My colleges and I saw the demand for data scientists ballooning and we decided to do something about it. In this talk, I will explain how the Johns Hopkins Data Science Lab leveraged the latest statistical, computational, and open source methods in order to create over a million new data scientists. _________________________________________________ Third speaker: Justin Bozonier Justin is the author of Test-Driven Machine Learning (published by Packt) and Lead Data Scientist in GrubHub's Financial Planning & Analytics group. The founding data scientist of GrubHub's split testing efforts, his team runs the company's experiment analysis platform, develops experiments and models to tune larger business operations, and data mines experiments and operational data to look for new business opportunities and value existing programs. Abstract: The Test-Driven Company Imagine if you came into work tomorrow and knew how much money your new product improvement made for your company. Imagine if you could come up with some crazy idea and prove it worked through rigorous experimentation. It's rare to work someplace that is that experiment-driven but we've achieved that at GrubHub. We'll review how experimentation started at GrubHub, how it grew to where it is today, and some lessons learned. By the time we're done, you'll wonder why anyone would do it any other way.

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

  • ML at Prezi's Research Lab and Data Products at Skyscanner
    ==== Live Stream: http://ustre.am/1e1aq Hi, We are having two promising talks this time: Finally, we got someone to speak from Prezi's secret research laboratory! ;) Agoston Nagy from Prezi’s Delight Research Lab will talk about visualizing data and finding similarity patterns in images with the help of Machine Learning. https://www.linkedin.com/in/agostonnagy Skyscanner data team shares their experiences on data products, too! Julianna Gobolos-Szabo from Skyscanner will present their use case on how to use data science to help travelers find the best deals on a global scale. https://www.linkedin.com/in/gszjulianna/ Schedule: 6:30pm: Pizza and mingling 7:30pm: Talks start After talks: Beers in Apacuka. See you next Thursday!

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

    4 comments
  • Python Pandas and Log Validation in Practice
    Hi, We are having our next meetup on 19 July with two pretty exciting talks about Python Pandas and Log Validation. Schedule: 6:30-7:00 Warmup, pizza and chit-chat :) 7:00-8:30 Talks 8:30 - Beers in Apacuka We hope to see you soon at Prezi House of Ideas! Automatic logvalidation at Prezi Tamás Németh, Data Infrastructure Tech Lead, Prezi https://www.linkedin.com/in/nemetht/ Do you know what is the worst enemy of a data engineer? If somebody changes a log line in a non-backward compatible way and so breaks ETL jobs.This is quite frequent if you have 100 engineers who do code changes every day a couple of times. In my talk, I will show a system what we build at Prezi to take care of these issues automatically with streaming log validation and how we redesigned our logshipping infrastructure. Tamás has more than 10 years prior experience as software engineer in various fields like PKI and investment banking. Now at Prezi as the tech lead of data engineering team, he makes sure the data infrastructure rocks, it is reliable and a joy to work with. Data "What, Why, and How" to "You, Here, and Now” Manel Rhaiem, Data Engineer at Secret Sauce Partners https://www.linkedin.com/in/manel-rhaiem-4923104b/ This quote could explain everything “Data is the new oil? No: Data is the new soil.” – David McCandless. Why and How we can use data in our lives and what is important about it, also how Python could provide flexible tools to let you be a good data engineering and cook a good plate for the data analyst persons, also data analyst persons could use python to analyze and extract some insight. I believe in that and I am working on it now as a Data Engineer and I would love to share it with you. Manel Rhaiem, a woman in tech, from Tunisia, started her funny journey of learning with Open Source communities. Helping always in organizations stuff, facilitating sessions and trying to improve more her Skills in Talks, Mozilla Representative since 2013 and Tech speaker. Main Interest in speaking as a Woman in Tech about Data, Data analyst, how to manage data and the importance of data, "Still Learning and sharing those learnings”

    Prezi House of Ideas

    Nagymezo 54-56, · Budapest

    8 comments
  • Hacknovation 2017
    24-hour long tech startup Smart City hackathon. Gather your team and use any sensors, devices and APIs to build the next innovative idea to make people's life easier. ************ The tasks: ************ - Develop a prototype with your team under 24 hours in the theme of Smart City! (e.g. you could create a machine learning system that controls the street lights, or an AR app that can be used on people's smart devices to read data from the wall of city buildings, etc.) - Create a business model, marketing strategy and plan the future goals of your project! - Pitch your wok in front of the professional jury! - Grab the prizes and continue to work on your project even with the help of our partners and be the next startup unicorns! ******************* The registration: ******************* Registration ends on July 4, 11:59 PM! For more information and team creationg go to https://hacknovation2017.devpost.com/ page. ********* Venue: ********* SZTE JGYPK Felnőttképzési intézet 6723 Szeged, Szilléri sgrt. 12. ********** Contact: ********** The event is organized by GYEEK Club Association and INPUT Program. Contact us at [masked] if you have any questions! ------------------------------------------------------------------------ *Hungarian version*Magyar változat* 24 órás IT startup hackathon Smart City témában. Gyűjtsd össze a csapatodat és használj bármilyen szenzort, eszközt, API-t és építsd meg a következő innovatív ötletet, amellyel könnyebbé teszed az emberek életét. ************** A feladatok: ************** - Hozz létre csapatoddal egy prototípust 24 óra alatt Smart City témakörben! (pl. megalkothattok egy forgalomirányító lámpákat vezérlő gépi tanuló rendszert, vagy egy kiterjesztett valóság alkalmazást, mely segítségével az emberek okos eszközeikkel adatokat olvashatnak le az épületek falairól) - Állítsatok össze egy üzleti modellt, marketing stratégiát és tervezzétek meg a projekt jövőbeli céljait! - Mutassátok be munkátokat egy pitch formájában a szakmai zsűri előtt! - Vigyétek el a nyereményeket, illetve dolgozzatok tovább projekteteken, akár partnereink segítségével, és legyetek ti a következő startup unikornisok! ***************** A regisztráció: ***************** A regisztráció július 4-én, 23:59-kor zárul! További információkért és a csapat létrehozásához látogass el a https://hacknovation2017.devpost.com/ oldalra. *********** Helyszín: *********** SZTE JGYPK Felnőttképzési intézet 6723 Szeged, Szilléri sgrt. 12. ************ Kapcsolat: ************ A program a GYEEK Club Egyesület és az INPUT Program szervezésében valósul meg. Kérdéseitekkel keressetek minket az [masked] címen!

    SZTE JGYPK Felnőttképzési Intézet

    Szilléri sgt. 12., · Szeged