What we're about

Bringing data enthusiasts together to foster the exchange of ideas and the intellectual growth of the data community. Our series of meetups showcase the work of talented data professionals across industries, for you to get insider tips and tricks to turn data into actionable insights (read more here (https://blog.dataiku.com/announcing-dataconnect-meetups-nyc)).
All types of data professionals, scientists, engineers, analysts, and students welcome!

Want to collaborate?

We partner with companies, schools, and individuals to organize meetups and showcase the work of inspiring data professionals to a broad audience.
Get in touch to contribute! (https://pages.dataiku.com/collaborate-with-dataiku-to-uplift-data-science-community)

About Dataiku:

Dataiku develops the unique advanced analytics software solution that enables companies to build and deliver their own data products more efficiently. Its collaborative, team-based user interface works for all profiles, from data scientists to beginner analysts, and the unified framework allows for both development and deployment of data projects. Learn more at www.dataiku.com (http://www.dataiku.com/)

Connect with us on social media:

• Follow us on Twitter (http://www.twitter.com/dataiku)
• Connect with us on LinkedIn (https://www.linkedin.com/company-beta/2770554/)
• Like our Facebook page (https://www.facebook.com/dataiku/)

Upcoming events (5)

[VIRTUAL MEETUP] Data Driven Business Growth & Cost Reduction w/ Airbnb

Thank you for your interest this Dataiku NY Meetup! The health & safety of our attendees and speakers is our primary concern. While this currently proves to be a tricky time for public gatherings, Dataiku is still committed to providing great tech content and facilitating discussions in the data science space. As such, we’ve decided to pivot towards online webinars via our partner platform, Brighttalk. The entire Meetup in the same format will be held virtually, which allows for a live presentation and Q&A session after. IMPORTANT: In order to gain entry, you must RSVP through this BrightTalk link: https://www.brighttalk.com/webcast/17108/411587?utm_source=Dataiku&utm_medium=brighttalk&utm_campaign=411587 Tentative Schedule: 7:00pm: Intro 7:05pm: How Data Can Propel Business Category Growth & Drive Cost Reduction with Airbnb by Cassie Cao 7:45pm: Q&A Talk Abstract: At Airbnb, Cassie helped launch new business categories such as Airbnb for work and the Luxury segment. Currently, she is leading a Data Science team to drive Customer Service cost reduction. From new business category growth to cost reduction, data can be the powerful driver. Cassie will share her key learnings with you about different data science tactics to address these two problems on the opposite side. Speaker bios: Cassie Cao is a Data Science Manager at Airbnb. She is deeply passionate about tackling challenging business problems using data. She has built the Airbnb Luxury Data Science team from scratch and recently transited into creating a new DS team focusing on cost reduction. Cassie has a master’s degree in Statistics from Harvard, and was a research associate at HBS.

Strategy and Marketing Attribution Models w/ Squarespace

Thank you for your interest this Dataiku NY Meetup! The health & safety of our attendees and speakers is our primary concern. While this currently proves to be a tricky time for public gatherings, Dataiku is still committed to providing great tech content and facilitating discussions in the data science space. As such, we’ve decided to pivot towards online webinars via our partner platform, Brighttalk. The entire Meetup in the same format will be held virtually, which allows for a live presentation and Q&A session after. IMPORTANT: In order to gain entry, you must RSVP through this BrightTalk link: https://www.brighttalk.com/webcast/17108/411600?utm_source=Dataiku&utm_medium=brighttalk&utm_campaign=411600 Tentative Schedule: 2:00pm: Intro 2:05pm: Strategy and Marketing Attribution Models with Squarespace by Braden Purcell, Omar Abboud, and Nate Lawson from Squarespace 2:45pm: Q&A Talk Abstract: Squarespace’s Data Science team helps the company make better strategic decisions using data. The Data Science and Marketing teams collaborate closely to ensure that they quantify the impact of their marketing decisions and use data to optimize spend allocation. In the fireside chat, they will be asked to give a broad overview of marketing data science at Squarespace. They will also be asked to provide examples as to how they have improved the quality of their checkout survey that serves as a key input to their marketing attribution model. To wrap up, they will hang out for a Q&A session with members of both data science and marketing teams to answer questions about how they work together. Speaker bios: Braden Purcell is a data scientist on the Marketing Data Science team at Squarespace. He helps the team develop tools and data products to optimize marketing spend and forecast performance. Before that he was a postdoctoral scientist at the NYU Center for Neural Science where he used experiments and computational modeling to understand how the brain makes decisions. He has a PhD from Vanderbilt University studying cognitive neuroscience. Omar Abboud is a data scientist on the Marketing Data Science team at Squarespace. Over the course of his time with the company, he has worked with various media teams to design experiments, develop measurement tools, and bring clarity to the Squarespace acquisition funnel through the lens of data science. Omar also works closely with Squarespace’s International Strategy team, using statistics and data analysis to inform the company’s strategic direction for international expansion. He holds an M.S. in Computational Science from Harvard University. Nate Lawson is a Data and Operations Lead within the Media & Acquisition team at Squarespace. He provides strategy, guidance, and oversight to marketing channel leads as to how best to implement, track, and report on media initiatives. He also works closely with the Data Science team in developing these strategies. Prior to this role, Nate managed Paid Social performance at The New York Times and General Assembly. He attended Purdue University in Indiana with a focus on technical writing.

[VIRTUAL MEETUP] Reducing AI Bias & Optimizing Data Labeling Frameworks w/ Appen

Thank you for your interest this Dataiku NY Meetup! The health & safety of our attendees and speakers is our primary concern. While this currently proves to be a tricky time for public gatherings, Dataiku is still committed to providing great tech content and facilitating discussions in the data science space. As such, we’ve decided to pivot towards online webinars via our partner platform, Brighttalk. The entire Meetup in the same format will be held virtually, which allows for a live presentation and Q&A session after. IMPORTANT: In order to gain entry, you must RSVP through this BrightTalk link: https://www.brighttalk.com/webcast/17108/412085?utm_source=Dataiku&utm_medium=brighttalk&utm_campaign=412085 Tentative Schedule: 2:00pm: Intro 2:05pm: Reducing AI Bias and Optimizing Data Labeling Frameworks w/ Appen by Monchu Chen 2:45pm: Q&A Talk Abstract: Bias in machine learning has become a significant concern as AI technology spreads to more application domains. While some bias is a consequence of limits in design and tooling, bias in the training data itself is much more common. Skewed training data often promotes AI models that reveal discrimination and amplify human prejudices. In this talk, we present a framework, developed at Appen, to minimize bias. This framework operates by routing data labeling tasks to the right labelers to avoid introducing bias. It also optimizes the process by determining a proper distribution of labelers for a given task. Our speaker, Monchu Chen, will review some use cases where this framework has been applied, and discuss results that show how the optimization process minimizes skew in the training data. Chen will also discuss extending this approach to other use cases and review the implications of this work. Speaker bios: Monchu Chen has worked in human-computer interaction for more than two decades. He has helped corporations and startups apply user insights to product innovation in multiple application domains. Monchu now focuses on the human aspects of AI as the Priciple Data Scientist for Appen's ML team, building models and systems to improve annotation quality, efficiency, and reducing AI bias. Dr. Chen holds a PhD from Carnegie Mellon University. He previously held a tenured faculty position at Carnegie Mellon Portugal and is the author of more than 70 publications and patents.

Data Science Disruption Across Industries ft. Hulu, Nordstrom, & Atomwise

Thanks for your interest this Dataiku NY Meetup! The health & safety of our attendees & speakers is our primary concern. While this currently proves to be a tricky time for public gatherings, Dataiku is still committed to providing great tech content & facilitating discussions in the data science space. As such, we’ve decided to pivot towards online webinars via our partner platform, Brighttalk. IMPORTANT: In order to gain entry, you must RSVP HERE: https://www.brighttalk.com/webcast/17108/413296?utm_source=Dataiku&utm_medium=brighttalk&utm_campaign=413296 Tentative Schedule: 2:00pm: Intro 2:05pm: Data Science Disruption Across Industries ft. Nordstrom, Hulu, & Atomwise 2:45pm: Q&A Talk Abstract: The impact of data science has been felt across a range of industries including retail, pharmaceuticals, & media. As the retail sector strives to stay technologically significant, data science has emerged as a lifeguard that can be leveraged to predict customer attitudes, visualize customer behavior, & implement knowledgeable decisions. In this fireside chat, we will cover how investments in data science by behalf of the retail industry has heightened the capabilities of retailers beyond just data collection & analysis. The pharma industry has also emerged as an industry where data science is increasing its application. We will touch on applications of data science within pharmaceuticals ranging from identifying suitable candidates for trails based on their physiological chemical structure, medical history, & more. The media & entertainment industry have also ventured into a digitally driven space & the amount of customer data available has exponentially increased. The bulk of the data work today within media & entertainment is dedicated to audience understanding & answering questions such “What are people reading, listening to, & watching?” We will discuss how modern applications of data science in the media & entertainment industry establish new rules & demand extra creative thinking from industry holders. Speaker bios: Skander Hannachi is a data scientist working for Nordstrom's demand forecasting & replenishment team. His current interests are time series models & optimization algorithms applied to merchandising & supply chain, & the use of NLP in business analytics. Prior to that, he worked in consulting roles, implementing retail forecasting & planning solutions for various retailers. He holds a Ph.D in computational intelligence & has done academic research on neural networks & fuzzy logic, both from a theoretical point of view, as well as applied to fisheries & environmental sciences. He also occasionally blogs about Data Science related topics on Medium, which you can read at medium.com/@skanderhannachi . Jared Thompson, PhD is a data scientist & software engineer with over a decade of experience in the pharmaceutical space. He is currently working with the wonderful team at Atomwise, where his primary efforts are towards the prediction of the biological activity of a drug molecule from its X-ray crystal structure. He has professional interests in control theory & optimization, information theory, agent-based models & intelligent systems, particularly interpretable & self-teaching neural architectures. Moe Lotfy is a Senior Product Data Scientist at Hulu leading the Subscriber Acquisition Data Analytics & Experimentation function where he combines statistical techniques with predictive modeling to uncover business insights & inform strategic product decisions. Prior to Hulu, he worked at KPMG where he led efforts to build marketing-focused ML models for several clients, as well as, at Tesla where he built predictive models to guide autopilot hardware’s thermal control strategy. Moe received his PhD in Mechanical Engineering at UCLA with a focus in nuclear fusion experimentation & numerical analysis. His passions include the state of AI, VR, shipping high-impact products, & all things data/experimentation.

Past events (73)

Photos (155)