UC Berkeley research labs have been at the forefront of the design and planning for much of today's cloud computing industry. Meanwhile the University's own teams run cloud based production systems including those that support the teaching of data science using platforms like Jupyterhub (also developed at UC Berkeley). This meetup is aimed at bringing together students, researchers, faculty, IT staff and the wider local community to learn, share and discuss our work in designing and operating services built on cloud computing architectures. This is a welcoming, inclusive community and hosts meetups that are open to everyone with curiosity in the topic -- no need to have any technical knowledge or experience to join!

We will feature speakers and engaging discussions around current issues in cloud computing often with an enterprise focus. Presenters are invited to share their experience with a ~15 minute story framed around 3 questions:

  1. WHY - Start with Why & the business value/value to mission. Why did you need cloud computing? Describe the big vision, business problem and/or academic challenge (not technical!) that you were trying to solve. (For example, teaching thousands of Berkeley students data science and making it come alive, or enabling citizen scientists around the world to crowdsource bird migration data to ascertain patterns...)
  2. HOW - How did you end up using cloud computing to solve the challenge? How did you architect, build, design, etc. How do the pieces fit together and how does it work?
  3. WHAT - What was the result? What did you learn? Share your war stories, lessons learned, insights, gotchas etc.

This month features Patrick Huck, a Senior Computer Systems Engineer from the Lawrence Berkeley National Lab, who will discuss how they are transitioning the Materials Project (MP, https://materialsproject.org) to fully operate in the cloud.

The Materials Project, a public initiative supported by funding from the US Department of Energy (DOE), computes properties of all known materials and remove guesswork from materials design. Discovery of novel materials can be targeted to the most promising compounds by screening and applying machine learning algorithms on MP’s computational data sets. Over 300,000 registered users from industry and academia are heavily using MP’s popular online gateway and API. Transitioning MP to fully operate in the cloud has hence become a cornerstone in the longevity of MP.

In this talk, we will present how MP runs a microservices-based network architecture on the AWS cloud and uses Datadog’s cloud monitoring services for end-to-end visibility into its elastic cloud resources. As MP moves its data products into the AWS Open Data Sponsorship Program, it is also building the foundation for gradually transitioning its data processing pipelines and the required computing to the cloud. The new cloud architecture allows MP to meet modern requirements for high availability, rapid solutions, increased security and scalability. The small core team implements infrastructure-as-code from the ground up to efficiently use its limited cloud computing and human resources.

About the presenter
Patrick holds M.Sc. and B.Sc. degrees (Dipl. Phys.) in Physics from the Technical University of Munich (Germany) and obtained his Ph.D. (Dr. phil. nat.) in high-energy nuclear physics from the Goethe-University in Frankfurt based on his work as a member of the large international STAR collaboration at the Relativistic Heavy Ion Collider at Brookhaven National Laboratory. Since 2014, he's been a core member of the Materials Project (https://materialsproject.org) at Lawrence Berkeley National Laboratory and is now the Senior Computer Systems Engineer on the project in charge of its Cloud & Scientific Data Architecture.

Everyone is welcome - this is a really inclusive group! Even when the talks seem very technical you don't have to be an expert, or already know about cloud technologies; we get together so people can learn from each other.

