- Intro to Elasticsearch
Agenda: Door Opens: 5:45 pm Pizza and Networking: 5:45 - 6:15 pm Presentation: 6:15 - 7:00 pm Wrap up: 7:00 - 7:15 pm Please collect your name tag from the front door. Front door will be closed after 6:15 pm. Please arrive before then. Topic: "An overview of Elasticsearch and its inner workings" Speaker: Naomi Sprague has been an associate Data Technologies Engineer at Digital river for about a year and is an aspiring data scientist. Along with learning about Elasticsearch, Apache Kafka, and Cassandra during the day, she's currently pursuing a M.S. in Data Science at St. Thomas University in the evenings with a projected graduation of Spring 2019.
- Big Data Meetup - Analyzing Real-Time Streaming IoT Data
Agenda: Door Opens: 5:45 pm Pizza and Networking: 5:45 - 6:15 pm First Presentation: 6:15 - 7:00 pm Wrap up: 7:00 - 7:15 pm Please collect your name tag from the front door. Front door will be closed after 6:15 pm. Please arrive before then. Topic: "Analyzing Real-Time Streaming IoT Data" Speakers: Fru Nde Description: In this session, we would demonstrate real-time streaming and machine learning capabilities using Talend platform. In this interactive session, Talend will be setup to receive accelerometer sensor data from mobile phones in real-time, push the data into a message queue, and perform machine learning to classify the data for analysis. On the processing side, a REST endpoint is created using Talend to which the sensor data is sent. The sensor data is parsed and pushed into a message queue (Kafka). Once the data is in the message queue, a Talend Big Data Streaming job reads the messages off the queue using a sliding window, passes the data through a machine learning model, and prepares the data for visualization. The most remarkable piece of this exercise is the fact that no hand coding is required. Everything from creating a REST service to acquire data, to the Spark Streaming job which implements a machine learning model, will be designed using a graphical user environment. Bio: Fru Nde (https://www.linkedin.com/in/frulouis) is a Solutions Engineer for Talend (www.talend.com). In this role, Fru provides product and industry expertise to customers; advising them on how to leverage Talend’s scalable, future-proof solutions and other software products to satisfy their critical business requirements. Fru has spent over 8 years in the data space, speaking and evangelizing about data at forums such as; MinneAnalytics, MicroStrategy World and more. Fru has a bachelor's degree in computer science and a masters degree in Enterprise Architecture.
- Big Data monthly meet up
• What we'll do Agenda : Spark in nutshell · What is big data? · Distributed file system needed · Processing big data stored · What is Spark? · How is Spark different from Map Reduce? · Spark Deployment model · Multiple language support · Demo Speaker: Anil kumar Gupta is working as a Director of IT at leading health insurance company. Anil has 19+ years of IT experience. He has lead and executed multiple projects in different technologies like BigData, Hadoop, MapReduce, Spark, PEGA, BPM technology, Java/J2EE, different databases, and many more. His current work involves making Mesos platform operation for application team. Twitter : @anilkgg (https://twitter.com/anilkgg) LinkedIn: https://www.linkedin.com/in/anilkumar-gupta-07a9371 More details will appear soon. • What to bring Additional Guests are welcome ! • Important to know Pizza and Drinks will be served
- Machine Learning, Deep Learning and Application security with Big Data
Agenda: Door Opens : 5:45 pm Pizza and Networking : 5:45 - 6:15 pm First Presentation : 6:15 - 7:00 pm Second presentation : 7:00-8:00 pm Wrap up : 8:00 - 8:15 pm Please collect your name tag from the front door. Front door will be closed after 6:15 pm.Please arrive before then. Topic 1: "Machine Learning and Deep Learning in Big Data Platform" Speakers : Bosky Mathew and Bradley Hoskins Description : This presentation will provide technical design and development insights to run Machine Learning and Deep Learning work load in a Big Data environment. The presentation will demonstrate how to set up the Jupyter notebook for the Data Science community by integrating JupyterHub, Mesos, ScikitLearn, Tensorflow, Spark and MapR Hadoop and how to securely access data from Datalake. Docker minimizes the complex integration challenges involving security and compute isolation which are essential for any Data Science community. No prior knowledge of these technologies is required in order to understand this presentation. Bio: Bosky Mathew is the director of big data platform and architecture at United Health Group. She played a key role in delivering UHG’s enterprise big data platform called BDPaaS and is responsible for defining and establishing enterprise architecture and direction for big data programs. Bosky is working with United Health Group for past 9 years and provided architecture leadership for many strategic programs such as Optum cloud and Tricare program. She started her career as Java developer and later moved into the role of application and platform architect for cloud and big data applications. Bradley Hoskins is a Big Data Architect at United Health Group and a member of the Big Data Platform Team which supports a large secure multi-tenant distributed computing / storage platform. Responsible for evaluation, testing, documentation and integration of new and existing open source technologies with the Big Data Platform. He graduated in 2014 from Iowa State University with a MIS degree, Go Cyclones! and started at UHG right out of college in the Technology Development Program. Topic 2: To Observe and Protect: Application Security and Big Data Speaker: John Bauer Application Security is a lot more than Security Development Lifecycle(SDL), Dynamic Application Security Testing (DAST), Static Application Security Testing (SAST), Runtime Application Self-Protection (RASP) or Web Application Firewalls (WAF). Network and application visibility is required to identify applications and assets which require one or more of these controls. On every network there are external and internal application attack surfaces and they should all be scanned, observed, and protected whether it’s home grown, out of the box or embedded systems, Internet of Things (IoT). These data points can be collected with active scanning, application and web log collection and passive observing with Splunk Stream
- The Challenges of Big Data Backups
Summary: Dr. Prasenjit Sarkar, will provide a technical perspective on the challenges of backup and recovery for next-generation distributed noSQL databases and big data file systems. Prasenjit will discuss the evolution of micro-services architectures, the strengths and weaknesses of native tools, and he will share a blueprint for next-generation backup and recovery. Speaker: Dr. Prasenjit Sarkar is Co-founder & CTO of Datos IO, where he is responsible for architecture, technical strategy and advanced development initiatives. With a focus on big data, distributed systems, and data management, Prasenjit has defined and led industry disruptions, delivered storage and systems products, and led product development organizations to deliver next-generation solutions at IBM Corporation. Agenda: 5:45 - Gate Opens 5:45 - 6:15 - Networking and Pizza 6:15 - 7:00 - Presentation by Dr. Sarkar 7:00 - 7:30 - Question and wrap up Please enter through the main entrance and take your name tag. Please make sure to arrive before 6:15. Gate will be closed after 6:15 and you cannot enter after that time. Pizza and drinks will be served at the venue. Looking forward to see you !
- Sensors, Spark and Kafka: Applied Machine Learning
Please Join us at the Twin Cities Big Data Meetup as we bring you a deep discussion on the real-world technologies that underlie IoT (Internet of Things). Working with real-time, streaming data from mobile phone sensors, and using tools that you can use yourself, we walk through the process of building a machine learning solution with Spark and Kafka to collect and analyze user activity. If you are using Kafka, Spark, or any real-time data technologies, or even if you are just trying to get a better understanding of them, this event is for you. Bio: Norbert Krupa is an experienced sales engineering professional, defining technical solutions that meet business and technical requirements of existing and prospective customers. In his current role, Norbert works hands-on with diverse organizations, demonstrating how to harness the power of data. Norbert holds an MS in Computer Science from Northeastern Illinois University. Here is the link to the deck: https://talend365.sharepoint.com/departments/mar... (http://meet.meetup.com/wf/click?upn=pEEcc35imY7Cq0tG1vyTtxRtaA-2FCDlxCxaRC26HNzfnxW5SzmIv2GKMppEALQn0BKhYvJcHAOPTCJa-2FPMkRS5E2YwtywLHQpEpBZHbB-2FmyuOVLEHfYekpxpExZWnFIEf7itiLe8guMNmHm58-2FElNF1XA48QTj6S900yvtAs-2BM5UAzNeR1-2F1ymX-2Bqzhvri-2FTrx2K0Tn-2BSYfQXRa2rvs7Z-2B-2BkyO1HuDHhqnMr8qHzBJHk-3D_Ocn7HtknIK8x9ikEBU9E2flmLZT2CxKCQyJTwXsY6ELroV-2Bf-2BNny6rLQ23QwHlaXVFpGsUWZOinFpiKoZBKty5T9zoc2dNVfi9IljfbpsMn9viDPePCzYr368XENCWGJgRbhsEd8nK14LzdG-2FlEwbLSxkhh64B71d5NZrIFFwoNnmD3-2FdNrpnpxx0MutjKblVHLBL6BDtNLCC1rHJlWWwbBLywbdD4Sine-2FyKVIAIpw-3D) Here is a video of Norbert's past presentation: https://youtu.be/9otVSVtcF0g (http://meet.meetup.com/wf/click?upn=pEEcc35imY7Cq0tG1vyTt-2BenqPd7ckPElOK0jGjK-2FYeb8JN1nWY4wDbxX4s51XKj_Ocn7HtknIK8x9ikEBU9E2flmLZT2CxKCQyJTwXsY6ELroV-2Bf-2BNny6rLQ23QwHlaXVFpGsUWZOinFpiKoZBKtyxsURxB9ZCroHJeVH2yAYTEoc7zv0Y2-2BJUcB1L0uOhuLw8PHzL9O1AyuzpElCHbbmNU4rDazLG7sm5O0zOY86JGeAHOg-2BFMYZheVLRurKLNEo-2BxeX0YLWbF0AFSigclE0yAKHvxAeZGco0bQy1WyEsc-3D) Agenda: 5:45 Door opens 5:45 - 6:30 Signup and food 6:30 - 7:30 - presentation 7:30 -8:00 - wrap up Please arrive through front door and signup at the reception. Somebody will escort you to the cafeteria. Door will be closed after 6:30 pm and you cannot enter venue after 6:30 pm. Pizza and Drinks will be served.
- Big Data Security and Other Diabolically Opposed Malaprops
Summary: We often find ourselves embroiled in a multi-flanked battle between security killjoys, dev-ops radicals, prophets of doom (auditors) and heroic data scientists. Amidst the chaos there lies a kernel of sanity that can guide the noble objectives of Big Data handlers, allowing them to deliver the data via safe paths, into safe hands and to safe refuge. In this short presentation, the beleaguered heroes will gain a core list of signposts to follow to ensure that confidentiality and compliance play nicely with availability and integrity. Your presenter, Brent Lassi, is a 17-year Information Security veteran, with a background in development and a passion, if not a talent, for data science. He promises not to lay it on as thickly as this blurb. Agenda: 5:45 - Gate Opens 5:45 - 6:15 - Networking and Pizza 6:15 - 7:00 - Presentation by Brent 7:00 - 7:30 - Question and wrap up Please enter through the main entrance and take your name tag. Please make sure to arrive before 6:15. Gate will be closed after 6:15 and you cannot enter after that time. Pizza and drinks will be served at the venue. Looking forward to see you !
- 2017 Big Data Trends
Speaker: Nagaraja Nayak , VP of Big Data at UHG In this presentation, Nagaraja will be talking about latest trend of Big Data technologies. Speaker's Bio: Nagaraja is a IT Leader, working in United Health Group, focused on data. He has been working in data space for several years in retail and healthcare industries. Please enter through the front door upstairs and collect the name tag. Someone will escort you to the cafeteria where the presentation happens. Agenda: 5:45 - Gate Opens 5:45 - 6:15 - Networking and Pizza 6:15 - 7:00 Presentation & Q/A 7:15- Wrap up Please note that gate will be close by 6:15. If you are running late, please send us text through meet up site and some one will let you in. We look forward to see you soon ! - The organizing commitee Sanjib Basak & Jim Paster
- Future of Analytics
Speech 1: Analytics driven by AI Year 2017 is believed to the year of AI powered applications and analytics. Currently, there is a race going on among the tech giants like Google, Microsoft, Facebook, Amazon, Salesforce etc. to dominate the market of AI. Microsoft CEO Satya Nadella recently commented, "We have no global growth. So we actually need technological breakthrough, we need AI". Let's evaluate some of the platforms that are available in the fields of AI powered app development. We will look at some of the conversational chat bots that will change our life in near future. Let's also evaluate "TRUE" machine learning capabilities of some of the applications that claim as "AI powered bot". We will evaluate some of platforms like Slack, Azure, Blumix (from IBM) that are providing APIs to develop those ML and AI bots. Speaker: Sanjib Basak is working as director of data science at Digital River. He is working on ML, AI and passionate about those things. He plans to share some of his findings in this session. Speech 2. Title: Rise of Crowd Sourced Analytics & Skill Volunteerism Topic: Times are changing and analytics changes with them, in this talk hear about the rise of crowd sourced analytics and skill driven volunteerism initiatives, impact and how to get involved both locally and globally. Speaker: John Hogue is a Senior Data Scientist at General Mills and founder of Social Data Science Agenda: Gate Opens: 5:45 pm Presentation 1: 6:15 - 6:45 pm Presentation 2: 6:45: 7:15 pm Wrap up by 7:30 pm Please enter through Front desk of Digital River. You will need to sign up at the desk and somebody will be there to escort you at the venue. Front desk will close by 6:15.If you are running late, please text me at[masked] and I will come and get you at the main entrance. Pizza and drinks will be served. Looking forward to see you tomorrow
- Distributed Data technologies in Big Data
Meet up presentation will be on what Distributed Data Processing principles are and how those principles help Big Data Technologies achieve massive scalability and always-on Availability features Presenter's Bio: SP Naidu is Director of Distributed Data Technologies at Digital River. He has extensive work experience in Enterprise Architecture and Data Processing. SP did MBA in Finance Strategy and from Carlson School of Business, UOM. SP also has done MS in Biomedical Engineering from IIT, Madras and B.Tech in Electronics Engineering from KITS, Ramtek Agenda: 5:45 - Door opens and sign-up starts. Front door will be closed by 6:15. After that you cannot enter into the building 6:15 - 7:15 - Presentation & QA 7:15 - 7:30 - Wrap up Pizza and drinks will be served!