• Profiling Your Data

    TEKSystems

    Have you ever started a warehouse or ETL project and realized that the data wasn’t as “clean” as you were told? If only you had profiled your data before you started then you wouldn’t have to rework design elements, change code or redesign your database. In this session we will talk about what data profiling is, why you should do it and how you can do it with tools that are already included in the SQL Server BI stack. Angela Henry is a DBA/BI Developer, living in High Point, NC and loves what she does. She's worked with all versions of SQL Server and worn all the hats that come with dealing with SQL Server throughout the years: developer, administrator, data architect and BI developer. She has her MSCE: Business Intelligence and is a Data Platform MVP. She volunteers with PASS in various capacities, ranging from Program Committee to cleanup at local SQL Saturday events. She is also the chapter leader for the PASS Triad SQL User Group in Greensboro, NC. In her spare time you can probably find her in the pool, she's an avid US Masters swimmer, Coach and Instructor.

    5
  • Scaling OLTP Workloads with Containers by Chris Adkin

    When pushing the database engine to its limits there are "Usual suspects" in terms of the bottlenecks that the database engine will always hit. In particular there is one "Anti-pattern" that always plagues any system that involves the processing of OLTP like workloads. That is synchronization between threads around structures and primitives that ensure ACID compliance. This session will not only explore this, but cover an innovative approach to solving this conundrum which involves containers and sharding. Chris has been working on the SQL Server since 2000 in various roles including DBA, developer and architect, he has been a regular speaker at various events in Europe for the last five years or so, including SQL Bits. Chris currently works at Pure Storage as the SQL Server Solutions Architect for EMEA.

    4
  • Getting Started With Apache Spark

    TEKSystems

    As companies work to gain insight from ever-increasing amounts of data, data platform practitioners need tools which can scale along with the data. Early big data solutions in the Hadoop ecosystem assumed that data sizes overwhelmed available memory, emphasizing heavy disk usage to coordinate work between nodes. As the cost of memory decreases and the amount of memory available per server increases, we see a shift in the makeup of big data systems, emphasizing heavy memory usage instead of disk. Apache Spark, which focuses on memory-intensive operations, has taken advantage of this hardware shift to become the dominant solution for problems requiring distributed data. In this talk, we will take an introductory look at Apache Spark. We will review where it fits in the Hadoop ecosystem, cover how to get started and some of the basic functional programming concepts needed to understand Spark, and see examples of how we can use Spark to solve issues like calculating PageRank and analyzing large data sets. Kevin Feasel is a Data Platform MVP and Engineering Manager of the Predictive Analytics team at ChannelAdvisor, where he specializes in T-SQL and R development, fighting with Kafka, and pulling rabbits out of hats on demand. He is the lead contributor to Curated SQL (https://curatedsql.com), a contributing author to Tribal SQL (http://www.tribalsql.com), and one of the contributors behind We Speak Linux (https://wespeaklinux.com). A resident of Durham, North Carolina, he can be found cycling the trails along the triangle whenever the weather's nice enough.

    3
  • Open Query Store (Web-Only)

    ChannelAdvisor

    **NOTE: This will be a webinar-only meetup. We will NOT host it at ChannelAdvisor tonight.** When SQL Server 2016 was released, it offered a fantastic new feature with the Query Store. Long term, statistics based, query tuning became a reality. But what about the thousands of servers that aren't upgrading to SQL 2016 or newer? The open source project Open Query Store is designed to fulfill that need. This session will give a short introduction to the Query Store feature in SQL 2016 and then dive into the Open Query Store (OQS) solution. Enrico and William (the co-creators of the OQS project) will explain the design of OQS and demonstrate the features. You will leave this session with an understanding of the features of Query Store and Open Query Store, and a desire to implement OQS in your systems when you return to the office. William Durkin is Co-Founder and Data Platform Architect at Data Masterminds. He uses his decade of experience with SQL Server to help multinational corporations achieve their data management goals. Born in the UK and now based in Germany, William has worked as a Database Developer and DBA on projects spanning 5 continents.

    1
  • SQL Saturday Raleigh 2019 - Planning Meeting

    Needs a location

    This is an ONLINE ONLY planning meeting for SQL Saturday Raleigh 2019. If you want to help plan the event, please try to make this meeting (though if you can't make the event and want to help with planning, we're happy to have you along for the ride). Agenda: - Quick 2018 review - Finding a date (or set of acceptable dates) - Finding a venue - What should we do differently this year? - Planning committees -- sponsors, speakers, events, marketing Depending on the number of attendees, we will either do this through Google Hangouts (up to 10 attendees) or we'll work out a different service if more than 10 attendees are expected.

    3
  • Using Open Source Products to Collect Performance Metrics

    Ever had a manager standing over your shoulder, wanting to know why an instance is running slow or if it can handle additional workload? What information would you use to answer these questions? If only you knew what performance metrics to collect and had them for your existing instances to answer these questions. In this session, we will discuss sp_whoisactive and Query Store. Then we will be combining three open source tools – Telegraf, InfluxDB, and Grafana – into an inexpensive system that collects performance metrics you can use to troubleshoot issues and answer important questions about your SQL Server instances, including your Linux SQL Server instances. We will learn what metrics to collect, how to use the tools to collect performance metrics and then we’ll put it all together in an interactive dashboard for easy visualization. Attendees will see how easy it is to get good performance data and visualize in an interactive way and combine with other tools to troubleshoot issues. For an example, we will combine this solution with Query Store and/or sp_whoisactive to find a problem that occurred on a system and caused me to get my nap interrupted on a Saturday. Tracy Boggiano am a Database Administrator for FM:Systeams. She has spent over 20 years in IT and has been using SQL Server since 1999 and is currently certified as a MCSE Data Platform. She covers all aspects of administration and deal heavily with performance tuning and disaster recovery. I a co-organizer of a Special Interest Group (SIG) dedicated to advanced database administration in our local user group TriPass. She also tinkered with databases in middle/high school to keep her sports card collection organized. She volunteers as a Guardian ad Litem in North Carolina advocating for abused and neglected foster children in court.

    6
  • Advanced DBA: Columnstore Indexes: What are we waiting for?

    Every time you see a Columnstore Index getting involved in the execution plan, do you realise that there are whole execution plans behind those Index Scans ? Did you ever ask yourself, what are those strange and weird HT_* waits stand for ? Why do we wait for seconds/minutes for something like HTBUILD while it seems that nothing happens ? Why do we have a ROWGROUP_VERSION wait on one server, while the other allows queries to run faster ? This session focuses on answering those question – to help you understand the reasons and the conditions behind every single available wait for the Columnstore Indexes and the Batch Execution Mode. Niko Neugebauer is a Data Platform Consultant. A SQL Server MVP with over 20 years of experience in IT, he is passionate about the Microsoft Data Platform and community. Founder of the Portuguese SQL Server User Group and the main organizer of the first SQLSaturday event outside of North America (#78 Portugal), Niko speaks regularly at events such as PASS Summit, SQLRally, SQLBits, and SQLSaturday events around the world. Niko loves sharing information and knowledge and has authored over 130 blog posts on Columnstore Indexes, and regularly contributes to the open-sourced CISL library focused on Columnstore Indexes.

  • Data Science: Jupyter Notebooks

    Vaco Office

    Jupyter Notebooks are one of the most popular tools in data science because they facilitate reproducible research. Jupyter Notebooks are freely available and support many programming languages. In this session, I will introduce Jupyter Notebooks via a tutorial on how to create production quality visualizations in R using the ggplot. At the end of the session, you will not only know how to create a polished Jupyter Notebook, but you will also learn how to create amazing visualizations using R! Ryan Wade: I am a data analytic professional with over 20 years of experience. My education and work experience enable me to understand things from a technical, analytical, and business viewpoint. I can intuitively present complex data using sound analytical and visualization methods. I have an advanced analytical and technical skill-set. My technical background includes the following tools and programming languages: Power BI, T-SQL, Excel, DAX, R, M, Python, VBA, Office 365, SSIS, SSAS Tabular & SSRS. I also have an elementary understanding of Azure ML and Octave. I am a data analytics enthusiast, and I have spoken at R meetups, Python meetups, SQLSaturdays, and PASS Summit about various topics dealing with data analytics.

    6
  • Always Encrypted

    TEKSystems

    Data breaches abound! SQL Server has several options to encrypt the data, Transparent Data Encryption, Cell Level Encryption and SQL Server 2016 Always Encrypted. Let’s review each option to hide the encrypt the data and learn about my experiences implementing Always Encrypted. Tom Norman, Database Architect / DBA, KPA. In 1998, Tom changed his career focus to begin working with SQL Server. He has worked in all aspects of SQL Server including Administration, Database Development, BI and Reporting Services. He has worked in the Finance and Compliance industry. His experience has included International deployments. Tom is the leader of the PASS Virtualization chapter and the past President of the Denver SQL Server User Group.

    1