Skip to content

16th Swiss Big Data User Group Meeting

Photo of Christian Gügi
Hosted By
Christian G.
16th Swiss Big Data User Group Meeting

Details

Agenda

  1. Information Discovery: Exploring Data-Driven Decision-Making for Improving the Control of CERN’s Accelerator Complex by Antonio Romero, IT Database group for the CERN openlab Data Analytics Project.

CERN’s particle accelerator infrastructure is comprehensively heterogeneous. A number of critical subsystems, which represent cutting-edge technology in several engineering fields, need to be considered: cryogenics, power converters, magnet protection, etc. The historical monitoring and control data derived from these systems has persisted mainly using Oracle database technologies, but also other sorts of data formats such as JSOM, XML and plain text files. All of these must be integrated and combined in order to provide a full picture of the overall status of the accelerator complex. Therefore, a key challenge is to facilitate easy access to, flexible interaction with, and dynamic visualization of large volumes of heterogeneous data from different sources and domains.

In this presentation Antonio shares his experience with Data Discovery. This will feature practical examples relating to:

• Future possibilities for improving the control and monitoring of CERN’s accelerator complex

• Optimization results for accelerator operations

• Demo of the implemented solution

  1. Memory, Big Data, NoSQL and virtualization by Alex Bordei (@alexandrubordei), Product Manager at bigstep (http://bigstep.com/)

In-memory processing has started to become the norm in large scale data handling. This is a close to the metal analysis of highly important but often neglected aspects of memory access times and how it impacts big data and NoSQL technologies.

We cover aspects such as the TLB, the Transparent Huge Pages, the QPI Link, Hyperthreading and the impact of virtualization on high-memory footprint applications. We present benchmarks of various technologies ranging from Cloudera’s Impala to Couchbase and how they are impacted by the underlying hardware.

The key takeaway is a better understanding of how to size a cluster, how to choose a cloud provider and an instance type for big data and NoSQL workloads and why not every core or GB of RAM is created equal.

  1. Networking apero sponsored by Oracle

Sponsors

Make sure to check out our great sponsor who helped with this event.

http://photos4.meetupstatic.com/photos/event/2/9/7/4/600_435610612.jpeg

Photo of Swiss Big Data User Group group
Swiss Big Data User Group
See more events
ETH Zurich, Building CHN, Room F46
Universitätstrasse 16 · Zürich