- Why R? Webinar platypus - image segmentation & object detection made easy with R
- speaker Michal Maj https://www.linkedin.com/in/michal-maj116/ - webinars http://whyr.pl/webinars/ - donate http://whyr.pl/donate/ - join Why R? Slack http://whyr.pl/slack/ - join Meetup http://tiny.cc/WarsawRUG - format: 45 min talk + 15 min for Q&A - comments: ask YouTube live chat - stream https://youtu.be/ONfqqSWwU9E # Description With the release of the R Keras package (https://keras.rstudio.com/) (by JJ Allaire and Francois Chollet) at the end of 2017 / beginning 2018 the topic of artificial neural networks and especially deep learning in R became red-hot within the R community. 'platypus' is an R package that allows you to build advanced deep learning models like YOLO3 and U-Net for object detection and image segmentation tasks. Thanks to build in data generators and ready to go architectures you can implement perfect solution with a few lines of code. # Biogram I’m a data scientist with over 5 years of experience in the field, R/Shiny developer and a freelancer interested in deep/machine learning and statistics. I love challenges and I’m always ready to help solving data science problems. In my day to day work I am responsible for designing, testing and development of advanced deep and machine learning models in R and Python. I’m a big R language enthusiast and a co-organizer of R Enthusiasts meetups in Gdańsk (https://www.meetup.com/Trojmiejska-Grupa-Entuzjastow-R/). Currently I'm trying to become a computer vision expert.
- Gradient Boosting Machines - From Zero to Hero - X-Europe Webinars
Gradient Boosting Machines (GBM): from Zero to Hero (with R and Python code) REGISTER HERE: https://gradient-boosting-zero-to-hero.carrd.co/ This talk will get you started with gradient boosting machines (GBM), a very popular machine learning technique providing state-of-the-art accuracy on numerous business prediction problems. After a quick intro to machine learning and the GBM algorithm, I will show how easy it is to train and then use GBMs in real-life business applications using some the most popular open source implementations (xgboost, lightgbm and h2o). We’ll do all this in both R and Python with only a few lines of code and this talk will be accessible for a wide audience (with limited prior knowledge of machine learning). Finally, in the last part of the talk I will provide plenty of references that can get you to the next level. GBMs are a powerful technique to have in your machine learning toolbox, because despite all the latest hype about deep learning (neural nets) and “AI”, in fact GBMs usually outperform neural networks on structured/tabular data most often encountered in business applications. OUR SPEAKER Szilard Pafka, PhD Chief Scientist, Epoch (USA) Szilard studied Physics in the 90s and obtained a PhD by using statistical methods to analyze the risk of financial portfolios. He worked in finance, then more than a decade ago moved to become the Chief Scientist of a tech company in Santa Monica, California doing everything data (analysis, modeling, data visualization, machine learning, data infrastructure etc). He is the founder/organizer of several meetups in the Los Angeles area (R, data science etc) and the data science community website datascience.la. He is the author of a well-known machine learning benchmark on github (1000+ stars), a frequent speaker at conferences (keynote/invited at KDD, R-finance, Crunch, eRum and contributed at useR!, PAW, EARL, H2O World, Data Science Pop-up, Dataworks Summit etc.), and he has developed and taught graduate data science and machine learning courses as a visiting professor at two universities (UCLA in California and CEU in Europe). LinkedIn: https://www.linkedin.com/in/szilard Twitter: https://twitter.com/DataScienceLA Github: https://github.com/szilard ABOUT THE X-EUROPE DATA SCIENCE WEBINARS This is a joint online event of Vienna Data Science Group, Frankfurt Data Science, Budapest Data Science Meetup, BCN Analytics, Budapest.AI, Barcelona Data Science and Machine Learning Meetup, Budapest Deep Learning Reading Seminar and Warsaw R Users Group. SUPPORTERS Thank you to Informance GmbH | www.informance.at, contextflow | www.contextflow.com, uni software plus GmbH | www.unisoftwareplus.com, FreeDOM Data | www.freedom-data.com and Gradient Zero | www.gradient0.com for supporting this.
- Why R? Webinar 017 - Suhem Parack - Introduction to Twitter data analysis in R
- speaker: Suhem Parack, Sr. Developer Advocate, Academic Research at Twitter https://twitter.com/suhemparack - donate http://whyr.pl/donate/ - join Why R? Slack http://whyr.pl/slack/ - join Meetup http://tiny.cc/WarsawRUG - format: 45 min talk + 15 min for Q&A - comments: ask YouTube live chat - stream: https://youtu.be/0sJVKxHodJo Abstract: The Twitter API provides access to a high volume of Tweets that developers and researchers use to study the public conversation as it is happening in real time. In this talk, learn how you can get started with accessing and analyzing Twitter data using R. Bio: Suhem Parack is a Sr. Developer Advocate at Twitter and focuses on helping the academic research community succeed on Twitter's developer platform. Follow or reach out to him on Twitter @suhemparack with questions about Twitter data
- The strong ML hypothesis by Antonio Piccolboni
Date:[masked]pm UTC+2 Title: The strong ML hypothesis Registration: https://ml-hypothesis.carrd.co/ SPEAKER: Antonio Piccolboni https://piccolboni.info/ TALK: The strong machine learning hypothesis can be roughly stated as: "test set generalization is all that matters" with the corollary that "scaling is all we need". These ideas are propelling much of the recent work and excitement in machine learning. I will list four reasons why the hypothesis is not likely to be true and two why scaling is not promising even if the hypothesis were true. SPEAKER: Antonio Piccolboni is a Data Scientist with 30+ years of experience, including a decade in bioinformatics. He started working on neural networks as a student during the second "AI winter" and has been a keen observer and at times a user of the technology ever since. His main claim to fame is the most widely adopted R package for Hadoop, rmr2, which has been downloaded more than 1M times. He recently switched his focus to the climate crisis.
- Why R? 2020 Conference - Remote!
[REGISTRATION STARTS JUNE 1st] - whyr.pl/2020/register/ We will stream the conference at youtube.com/WhyRFoundation Why R? 2020 Conference, the fourth meeting of R enthusiasts. It will be held remotely, 24-27 September 2020. As the meeting is held in English, we are happy to invite R users from all over the globe! Why R? 2020 Conference is an international event of R statistical software users. It is a remarkable occasion to gain knowledge about R, enhance data analysis and processing skills, and meet other members of European R community. TIMELINE [masked] - Hackathon[masked] - Workshops[masked] - Lectures Day[masked] - Lectures Day 2
- Workshops: Innovation Box (IP Box) in Poland + Legal basics for data scientists
# Register - 2020.whyr.pl/register/ # About Leśniewski, Borkiewicz & Partners Leśniewski, Borkiewicz & Partners is a modern, boutique law firm with a business approach. LB&P works mainly with clients from IT, TMT and e-commerce areas, both within defined projects and as a comprehensive, ongoing service. The main areas of LB&P specialization are negotiating commercial contracts, mergers, acquisitions and restructuring (M&A), personal data protection, legal support in the field of designing mobile applications, image and personal rights protection, cybersecurity, compliance and taxes. For more details please check a website lbplegal.com or LB&P LinkedIn account https://www.linkedin.com/company/lbplegal/ # Workshop 1 Time: 14:00-15:30 Register: 2020.whyr.pl/register/ Title: Innovation Box (IP Box) in Poland – how to use preferential 5% income tax rate Seats: 50 Tutors: Natalia Wojciechowska-Chałupińska (r.pr. / attorney) ; Grzegorz Leśniewski (adw. / attorney) Description: Data scientist, programmers and other professionals in Poland may lower they income tax to 5% tax rate for a qualified income derived from qualified IP rights. This is applicable both to natural persons and to companies. IP box is available also to Polish non-residents who receive income from qualified IP rights through their permanent establishment located in Poland. During these workshops it will be explained to what income the lowered tax rate can be applied, how to apply it and how to create necessary documentation. These workshops will be held in Polish (unless English speaking participants will sign up by Wednesday), whereas presentation will be in English. # Workshop 2 Time: 15:30-17:00 Register: 2020.whyr.pl/register/ Tite: Legal basics for data scientists Seats: 50 Tutors: Urszula Ilnicka - Karaban (r.pr. / attorney) ; Grzegorz Leśniewski (adw. / attorney) Description: There is a number of potential ways to conduct business activities in Poland. During these workshops it will be explained: 1. What legal forms are best for business activities of data scientists 2. What liability you may incur 3. How and on what legal basis employ your staff 4. How to secure Intellectual Property rights, company’s secrets and know-how 5. How to deal with personal data protection, if you analyze personal data 6. Getting investors – practical comments These workshops will be held in English.
- Text Mining Hackathon
Website - 2020.whyr.pl/hackathon/ Slack - whyr.pl/slack/ Theme We just announced the theme for the Hackathon - Text Mining. You can read some more details about the Hackathon on this website 2020.whyr.pl/hackathon/ Submit Teams If you find the topic interesting, please follow this form whyr.pl/2020/hackathon/register/ to submit your team. We would like participants to gather in teams of 4 or 5. If you don't yet have a team, please use our whyr.pl/slack/ to find team mates. Timespan We start the hackathon with Keynote Talk by Julia Silge at 5:30pm UTC[masked]. We finish the hackathon with Sponsored Talk by McKinsey at 5:30pm UTC[masked]. This means there are 24hours for solving challenges published at the beginning of the event! You can find more scheduled talks on our YouTube channel youtube.com/WhyRFoundation We will be adding content during upcoming days. Warmly, Why R? Organizing Committee
- Why R? Webinar 016 - Colin Gillespie - Me, Myself and my Rprofile
This webinar promotes Why R? 2020 conference - http://2020.whyr.pl that happens[masked] My Rprofile is optimal (for me). My RStudio set-up is optimal (for me). My desktop set-up is optimal (for me). This talk will demonstrate the myriad of options available to R users, such as customising your R prompt, selecting the best font, and optimising your RStudio pane layout. At the end of the talk, you too can have the optimal, but bespoke, R set-up. - speaker: Colin Gillespie https://www.jumpingrivers.com/ - donate http://whyr.pl/donate/ - join Why R? Slack http://whyr.pl/slack/ - join Meetup http://tiny.cc/WarsawRUG - format: 45 min talk + 15 min for Q&A - comments: ask YouTube live chat - stream: https://youtu.be/2rWi2KVcyjg
- Can Telematics Data Identify Risky Drivers?
Register - https://telematics-data.carrd.co/ This is a joint online event of Vienna Data Science Group, Frankfurt Data Science, Budapest Data Science Meetup, BCN Analytics, Budapest.AI, Barcelona Data Science and Machine Learning Meetup, Budapest Deep Learning Reading Seminar and Warsaw R Users Group. TALK Many insurance companies collect telematics data about drivers’ exposure to traffic (distance driven and type of road) and their driving behavior (excess speed, aggressiveness, operating hours). Actuaries, who are experts in risk analysis for insurers, use this information to improve the insurance rate-making process. Personalized driving risk indicators can also promote driving safety. The starting point is data collected from inertial measurement units installed in vehicles, combined with the corresponding records of accidents. We will address the most relevant current challenges in driving risk measurement. Illustrations with several real data sets provided by insurance companies will answer questions: How should pay-per-mile insurance schemes be designed? How can near-miss telematics be used to identify risky drivers? What is the power of risk analytics, percentile charts, and reference curves to monitor drivers? We will emphasize the enormous potential of this field to design better insurance products and improve safety. We will perhaps be able to guess why Tesla is urgently looking for actuaries in the last few weeks. SPEAKER Montserrat Guillen has a Ph.D. in Economics, MSc in Mathematical Statistics, and MA in Data Analysis. Professor at the University of Barcelona & Director of Riskcenter – Research Group on Risk in Insurance and Finance. Montserrat is a specialist in Actuarial Science and Insurance. Top-ten cited author (ResearchGate). Optimistic likes understanding data and mitigating risks. Loves having friends from around the world.
- Why R? Webinar 015 - Sydeaka Watson - Data Science for Social Justice
This webinar promotes Why R? 2020 conference - http://2020.whyr.pl that happens[masked] - speaker: Sydeaka Watson https://www.sydeaka.com/ - donate http://whyr.pl/donate/ - join Why R? Slack http://whyr.pl/slack/ - join Meetup http://tiny.cc/WarsawRUG - format: 45 min talk + 15 min for Q&A - comments: ask YouTube live chat - stream: https://youtu.be/hul0cOBgqPQ # Abstract: Social justice movements are leveraging data science in their efforts to promote positive change in our communities. In this talk, we will explore a few examples of successful, effective, ongoing partnerships between data scientists and community stakeholders to promote social justice. The discussion will include an overview of the tools/technologies (including R!) that were useful in those initiatives. We will also learn how we could lend our technical skills the fight for social justice by either aligning with existing initiatives or kickstarting new projects. # Bio: Dr. Sydeaka Watson is a native of New Orleans, Louisiana and currently lives in Dallas, Texas. Sydeaka earned a Ph.D. in Statistics from Baylor University and has several years of teaching experience. As a Research Assistant Professor in The University of Chicago Biostatistics Laboratory, she consulted with over 110 biomedical research teams in The University of Chicago Medical Center, specializing in statistical analysis and experimental design for clinical research studies. In both of her Senior Data Scientist roles in the AT&T Chief Data Office and at Elicit Insights, she used predictive analytics and visual tools to implement custom solutions for customer behavior analytics and customer management. Now, as Founder and Owner of Korelasi Data Insights, LLC, Sydeaka develops custom deep learning solutions to enable extraction of insights from complex data sources, particularly in the areas of computer vision (image recognition) and text analytics. Dr. Watson is passionate about issues related to social justice advocacy and diversity & inclusion in STEM careers. In previous years, Dr. Watson served as Chair of the American Statistical Association (ASA) Committee on Minorities in Statistics and was Co-Organizer of the Dallas Blacks in Technology Meetup. Currently, Sydeaka serves as Organizer of the Dallas Chapter of R-Ladies Global. She also volunteers for Data 4 Black Lives, Citizens Committee for New York City, the R Consortium for Diversity & Inclusion, and in the Dallas chapters of Girls Who Code and Black Girls Code.