• Scale By the Bay 2020 begins this Thursday!

    Online event

    Folks -- the first-ever online SBTB is this week!


    The online ticket is a already a low $125, and we give you 20% off that with SFGRAPHQL20:


    Some highlights:

    Martin Odersky opens SBTB on 11/12 with

    Countdown to 3!

    Matei Zaharia and Anima Anandkumar keynote

    Li Haoyi, Getting Things Done in the Scala REPL

    Julien Truffault, Monocle 3: a peek into the future

    Adam Warski, Project Loom? Better Futures? What’s next for JVM concurrent programming

    Prof. Bayer, the co-creator of B-trees, presents C-chain: the Integration of 5G and real time Blockchain

    Shameera Rathnayaka of Spotify, Materialize Typeclasses with Magnolia

    Justin Heyes-Jones, YoppWorks, Applicative: The Origin Story

    Steve Cosenza, Twitter, Rebuilding Twitter’s public API

    Greg Kesler, Intuit, Query Planning in GraphQL

    Lei Gao, Workday Goku Flow: A Self-Service Data Pipeline Builder

    Prashant Sharma, IBM, Apache Spark meets FIPS standard

    Dean Wampler, Domino Data Labs, Ray: A System for High-performance, Distributed Machine Learning Applications

    Dirk Slama, VP Co-Innovation, Bosch, AIoT: Why now? And How To?

    Antje Barth, AWS, Put Your Machine Learning on Autopilot

    We have three debate panels we are (in)famous for:

    Will AI Kill Programming?

    Were Microservices a Huge Mistake?

    Programming Languages in the Era of the Cloud

    See the full program at https://www.scale.bythebay.io/schedule, and register!

  • Reactive Summit + SBTB 2020 CFP Open through July 31

    Online event

    Scale By the Bay is now following Reactive Summit, the conference of the Reactive Foundation, a Linux Foundation project focused on cloud-native applications.

    We're happy to report that Martin Odersky and Matei Zaharia, the creators of Scala and Spark, will keynote SBTB 2020, among other awesome keynote speakers.

    The general CFP is open through June 30.

    There's still time to submit a talk: https://scale.bythebay.io!

    Looking forward to more great speakers to join us in November.

  • A Tour of Caliban: FP GraphQL in Scala & Understanding Scala's Type System

    -------How to Join our online/Virtual event ------
    - Before the meetup visit https://www.sfbayareatech.xyz/ and enter your details to join our slack community. Once you submit your email address and name you will receive an invite to join so check your email.
    - Once you have joined our slack community please join the #sfscala channel
    - This is an SFScala event please RSVP to this event on the SFScala Meetup page here: https://www.meetup.com/SF-Scala/events/269884306/
    - Location part of this event page will have a link to Zoom to join the event. We will update this page with the link closer to the event. You will only be able to see the link if you have RSVP to this event.
    - Any questions please contact me (Salar Rahmanian) via our slack community or meetup.
    ------PLEASE FOLLOW ABOVE Instructions before event---------

    Talk1 Title: A Tour of Caliban: Functional GraphQL in Scala

    Caliban is a library for creating GraphQL backends in Scala. It was designed with the goal of reducing boilerplate to a minimum while exposing a purely functional interface. In this talk, we’ll discover how to create a simple GraphQL API from the ground up, then we’ll dig into advanced features such as query optimization and middlewares. Finally, we will take a look at the recently released GraphQL client support.

    Pierre Ricadat is a French engineer based in Seoul, South Korea. He’s been working on distributed systems architecture and development for the last 10 years. A few years ago he discovered the power of functional programming and started contributing to various projects within the ZIO ecosystem. He created Caliban in September 2019.

    Talk2 Title: Understanding Scala's Type System

    Scala has many types of types. This talk will take you on a tour of Scala’s type system, show you how it is evolving in Scala 3, and help you understand how everything fits together.

    When I teach Scala, I find that many Scala programmers aren’t familiar with the many kinds of types that Scala’s type system encompasses. In this talk I would like to try and cover all the types of types in Scala, such as nominal, structural, singleton, refinement, higher-kinded, parameterized, bounded, abstract, path-dependent, sub-, super-, union, intersectionand touch on variance to boot.

    Speaker Bio:
    Bill Venners is president of Artima, Inc., provider of Scala consulting, training, books, and tools. He leads the open source projects for the ScalaTest testing library and the Scalactic library for functional, object-oriented programming. He is coauthor with Martin Odersky and Lex Spoon of the book, Programming in Scala. And he is a community representative on the Scala Center's Advisory Board.


    6pm Welcome to SFScala by Alexy Khrabrov & Salar Rahmanian
    6.15pm A Tour of Caliban: Functional GraphQL in Scala by Pierre Ricadat
    7pm Understanding Scala's Type System by Bill Venners
    8pm Wrap up and continue conversations on Slack

    Note: Bill Venners is running a workshop that week, a hands-on Effective Scala training course April 21-24 over Zoom, details here:


    Scala is an unopinionated language: It offers a wide array of choices, from familiar object-oriented features to sophisticated functional programming tools. While these choices offer great freedom, they can be bewildering. This workshop will help you learn the most effective ways to apply Scala to real-world programming problems.

    Contact Salar Rahmanian via Slack for a discount code for this workshop for SFScala Members.

  • [External Registration][Conference] Scale By the Bay 2019, November 13-15

    This year Scale By the Bay (https://scale.bythebay.io) runs for only two days. But we packed an incredible 70 sessions in these two days! We start with a hot breakfast and excellent coffee. Coffee never ends -- continuous uninterruptible supply of great coffee is a hallmark of every conference By the Bay. Each morning there is a keynote where we all gather as a community, and a panel closing each conference day where we all get together again before the happy hour -- also every day.

    The heart of the conference are its iconic four tracks: Thoughtful Software Engineering, Service Architectures, End-to-end Data Pipelines up to ML/AI, which we historically call Functional, Reactive, and Data. That's three right? The fourth is the hallway track -- and we're legendary for it!

    The core theme this year is Distributed Systems. Joe Beda, Principal Engineer at VMware and the co-creator of Kubernetes, keynotes one day, and Heather Miller, Professor at CMU and former Director of the Scala Center, keynotes the other. We have multiple talks considering cloud deployments on Kubernetes in concert with other systems, such as Kafka, Spark, and Flink. We cover important new directions with Unison, and inherent issues such as Change Data Capture from Disney Streaming. We will learn about the new GIS features for Google BigQuery from their author, about the Databricks Data Lake approach, and infrastructure as code at Target.

    Our "reactive" track started as reactive microservice architectures but came to encompass all kinds of systems, as well as data manipulation techniques. We'll see how Lyft is enabling real-time queries with Apache Kafka, Flink and Druid. We'll hear about the lessons learned developing and running Netty from its creator. We see how Serverless is developed at Google.

    Machine Learning and AI are only as scalable as the data pipeline feeding them. Moreover, you need to ensure your data is typesafe and your predictions are based on the data whose integrity or even privacy is provable. This year, we have three talks on Swift for TensorFlow, including from the original Google team developing it, as well as Coinbase and Quarkworks. We hear from Sony Entertainment on near real-time, low latency predictions, and many many other leaders.

    And we'll uphold the rigorous and thoughtful software engineering that is underpinning of every system scalable in time and tech space -- a system that can deliver but also grow with companies and their people. We'll hear from Comcast and Netflix on human-centric software engineering and ML organizations. We'll hear about community-first Open-Source approaches. We'll see how F# invigorates .Net ecosystem with functional approach, including JavaScript apps, and how Scala with React is doing the same for the full-stack development on the JVM. We'll hear about Rust, Haskell, Scala, Java, Python, F# and other ecosystems used for quality development and production deployment. We'll see how the sausage is made at JetBrains to power our IDEs. We'll dig deeper into GraalVM with Oracle and Twitter, as well as Scala Native.

    We pioneered GraphQL at Scale By the Bay three years ago when almost nobody heard about it. Furthermore, our focus was not on the frontend alone but on middleware usage of GraphQL. This year Nick Schrock, a co-creator of GraphQL, joins us.

    The day before the conference, we run a bespoke, all-day, hands-on training that we build specifically for SBTB. This year, it's Portable Serverless Workshop with Ryan Knight and James Ward. James is now at GCP and has a driver seat going to the serverless future. You'll go home with a complete serverless backend under your belt!

    All in all, we'll have a lot of fun, pack a year of learning in just two or free days, and again experience the magic that makes Scale By the Bay a legend! Reserve your Early Bird seat soon at https://scale.bythebay.io.


    Note: ML Model Versioning, Deployment, and Monitoring are core themes of the https://scale.bythebay.io 2019, 11/14-15, Oakland. Reserve your seat today using the code MEETSFGRAPHQL15 for 15% off all passes, including the complete Serverless workshop!

    Joint meetup -- please RSVP at http://bay.area.ai!


    Models are the new code. While machine learning models are increasingly being used to make critical product and business decisions, the process of developing and deploying ML models remain ad-hoc. In the “wild-west” of data science and ML tools, versioning, management, and deployment of models are massive hurdles in making ML efforts successful. As creators of ModelDB, an open-source model management solution developed at MIT CSAIL, we have helped manage and deploy a host of models ranging from cutting-edge deep learning models to traditional ML models in finance. In each of these applications, we have found that the key to enabling production ML is an often-overlooked but critical step: model versioning. Without a means to uniquely identify, reproduce, or rollback a model, production ML pipelines remain brittle and unreliable. In this talk, we draw upon our experience with ModelDB and Verta to present best practices and tools for model versioning and how having a robust versioning solution (akin to Git for code) can streamlining DS/ML, enable rapid deployment, and ensure high quality of deployed ML models.

    Speakers: Manasi Vartak, CEO, Verta.ai, Conrado Miranda, CTO, Verta.ai

    Manasi Vartak is the founder and CEO of Verta.ai (www.verta.ai), an MIT-spinoff building software to enable high-velocity machine learning. Manasi previously worked on deep learning for content recommendation as part of the feed-ranking team at Twitter and dynamic ad-targeting at Google.

    Conrado Miranda is the CTO at Verta.AI. Conrado has a PhD in Machine Learning and a focus on building platforms for AI. He was the tech lead for the Deep Learning platform at Twitter’s Cortex, where he designed and led the implementation of TensorFlow for model development and PySpark for data analysis and engineering. He also led efforts on NVIDIA’s self-driving car initiative, including the Machine Learning platform, large scale inference for the Drive stack, and build and CI for Deep Learning models.

    (2) Model Monitoring in Production

    Machine Learning models continuously discover new data patterns in production they have never seen during training and testing iterations.

    The best offline experiment can lose in production. The most accurate model is not always tolerant to a minor data drift or adversarial input. Neither prodops, data science or engineering teams are skilled to detect, monitor and debug model degradation behaviour.

    Real mission critical AI systems require advanced monitoring and model observability ecosystem which enables continuous and reliable delivery of machine learning models into production. Common production incidents include:
    - Data anomalies
    - Data drifts, new data, wrong features
    - Vulnerability issues, adversarial attacks
    - Concept drifts, new concepts, expected model degradation
    - Domain drift
    - Biased Training set

    In this demo based talk we discuss algorithms for monitoring text and image use cases as well as for classical tabular datasets.
    Demo part will cover the full cycle of machine learning model in production:
    Model training and deployment with Kubeflow pipelines
    Production traffic simulation
    Model monitoring metrics configuration
    Data drift detection
    Drift exploration and monitoring metadata mining
    New training dataset generation from production feature store
    Model retraining and redeployment

    Stepan Pushkarev is a CTO of Hydrosphere.io - Model Management platform and co-founder of Provectus - an AI Solutions provider and consultancy, a parent company of Hydrosphere.io.

  • Integrating React with Scala at Domino Data Lab

    Domino Data Lab

    Do you hate JavaScript? Have you ever had a runtime error in your frontend? Is your UI written in Scala but you can't hire any frontend developers to work on it or backend developers that want to? Come hear about how Domino Data Lab faced these exact problems. Through trial and error we developed a way to inject React code into Scala Play, which maintains typesafety and takes advantage of JavaScript libraries to make frontend building faster.

    About the speaker:

    Niole Nelson builds Domino Data Lab's frontend. She likes to focus on making it harder to introduce runtime bugs and speed up development. While she isn't doing that, she is usually weight lifting or rock climbing. Her newest hobby is watching league of legends streamers, though she doesn't actually play the game.

  • Scale By the Bay 2019 CFP Open until May 31

    Needs a location

    Friends — the month of May is when the Scale By the Bay (SBTB) CFP always runs, for the conference in November.

    The CFP is now open at


    There are three tracks, as usual:

    — Functional Programming
    — Service Architectures
    — Data Pipelines, including ML/AI

    The theme for this year is the emergence of new distributed systems and their applications, including Edge, IoT, DLT, and AI on the Edge. Helena Edelson lead a team at Apple enabling ML/AI with Spark, Joe Beda started Google Compute Engine and Kubernetes, and Heather Miller lead Scala Center at EPFL and now advances distributed and edge systems at CMU.

    We have two talk lengths, 20 minutes and 40 minutes. There are 5-10 minute breaks between some, but not all, talk slots, and excellent coffee is served all day long so every break is a coffee break. Please check each time length you can work with. We often ask 40 min talks to shrink to 20 min as we try to accommodate all the best talks — and our acceptabnce rate is going down to 1:3 with years.

    We also serve hot breakfast and great lunch and amazing happy hours follow the main program in between all days. The hallway track is legendary, facilitated by the high ratio of speakers — 100+ out of the 600 attendees.

    We are committed to community above all and are working with underrepresented groups to send speakers. Please share this CFP with your diversity advocates, community managers, and encourage female engineers, African-American developers, and others to submit talks. If you could send such speakers on behalf of your company, it will help the community a lot. We’re also proactively reaching out to meetups, our core constituents, to help our established diversity program. We also work with companies like Stripe on diversity scholarships — let us know if you’d like to partner on this.

    Submit your best talks at https://scale.bythebay.io by May 31!

  • (1) Migrating Medium to GraphQL (2) Thunder, Samsara GraphQL IoT Data Platform

    Join us at Samsara for two great talks about GraphQL in production, spanning both hardware and software stacks!

    (1) 2 Fast 2 Furious: Migrating Medium's Architecture Without Slowing Down

    We’re shifting gears to leverage new technologies created since we built Medium 5 years ago, but we need to incrementally gain benefits from the new system along the way and we can’t afford to let it hinder feature development. By taking advantage of GraphQL’s flexibility and our existing infrastructure, we’re able to make widespread yet gradual architectural changes! Come see how Medium is changing lanes without slowing down. Anyone thinking about moving to GraphQL (or thinking about migrating an exisiting architecture in general) can benefit from this talk, but especially anyone who is building their own GraphQL server or needs practical advice on how to successfully migrate a legacy system to GraphQL without “stopping the world,” getting defunded partway through, or building a system no one uses. Abstract Migrating an entire system to new tools and frameworks isn’t an easy task. And doing that while not impacting feature development? That’s even harder. We’ll walk through how Medium is migrating off of our existing system, without hindering product development, and while also incrementally gaining the benefits of a new system along the way. We’ll go over the design of our new architecture, our phased migration approach, and how the layered structure of our GraphQL server (written in Scala with Sangria) was integral to the success of both.

    - Goals of the migration
    - Design of the new system
    - Phased approach
    - Phase 1: developer experience
    - IDLs (protobuf) + GraphQL
    - Phase 2: services + gRPC
    - GraphQL server layers
    - Fetchers
    - Repos
    - Schema (derivation)
    - Putting it all together

    Sasha Solomon is a Senior Software Engineer helping build the next generation API on Twitter's GraphQL team. She was previously the Tech Lead of the Platforms team at Medium working on GraphQL as well. She has spoken at GraphQL Conf, GraphQL Asia, Scale by the Bay (2019 CFP runs May 1-31 at http://scale.bythebay.io), and other meetups on various topics related to GraphQL and infrastructure. In her spare time, she enjoys tabletop RPGs, karaoke, and most recently, building mechanical keyboards.

    (2) GraphQL for the Samsara Sensor Data Plaform

    Samsara is a sensor data platform for operations, supporting a diverse set of features - everything from tracking vehicles in a fleet to monitoring quality control on a production line. In this talk, we’ll explore the origins and evolution of our GraphQL systems over the last three years and how they’ve helped us ship great products quickly and safely.

    Stephen is an infrastructure engineer at Samsara where he spends time thinking about making developer tools easy to understand, maintain, and use. He has worked on building both frontend and backend infrastructure to empower product teams and is a core contributor to Samsara’s open-source GraphQL framework, Thunder.

  • Distributed GraphQL for Enterprise Systems

    Credit Karma

    This is a joint event with SF Scala:


    (1) How we distributed our GraphQL across services to simplify workflows and increase developer productivity.

    Bio: Barry Dutton is a Senior Software developer at Credit Karma who has been instrumental in helping the Credit Karma team incorporate GraphQL into its microservices architecture. Before working at Credit Karma, he spent nearly a decade building core products and services at major tech companies, and ships code that reaches tens of millions of users.

    (2) Building a high-performance realtime GraphQL engine on Postgres at Hasura

    In this talk I'll talk about our unique approach to building a GraphQL server, by transpiling/compiling GraphQL to SQL instead of using resolvers. This approach allows low-latency, scalable querying with an extremely low-footprint, which becomes especially valuable for large GraphQL queries that fetch non-trivial amounts of data. These are the topics I'll cover:
    -- Compiling GraphQL to SQL
    -- Adding application user authorization primitives
    -- Implementing scalable, realtime with GraphQL subscriptions
    -- Benchmarks to show latency, concurrent connections

    Bio: Tanmai Gopal is the co-founder of hasura.io. He is a StackOverflow powered fullstack, polyglot developer whose areas of interest and work span react, GraphQL, nodejs, python, haskell, docker, postgres, kubernetes. He is passionate about making it easy to build things and is the instructor of India's largest MOOC imad.tech with over 250,000 students.

  • Scale By the Bay 2018


    Dear Friends — we are proud to announce the program of Scale By the Bay 2018, our sixth year of the flagship, and by now iconic, independent developer conference By the Bay. (Tl;dr: get your spot at http://scale.bythebay.io while supplies last, and especially when Early Bird is in effect until August 31.)

    We have great GraphQL talks at scale.bythebay.io, held at Twitter in November! Check out

    https://sched.co/FmES Transpiling GraphQL instead of writing customized server code -- Twitter Engineering

    https://sched.co/FmEh 2 Fast 2 Furious: migrating Medium's architecture without slowing down -- Medium Engineering

    https://sched.co/FmFC Edit Speaker Tools Graph-First Services Using GraphQL -- Twitter

    https://sched.co/FmF9 Edit Speaker Tools Orchestrating Microservices with GraphQL -- Ituit

    These talks link with many others showing full-stack architectures, microservices, legacy updates, and data pipelines. The conference is the best full stack in a box developer experience that goes all the way from devops to AI as done by engineers, who need results.

    The conference follows the established three-day, three track structure, hosted for the third year in a row by Twitter HQ in its wonderful modern building, with all of its spacious tracks, community spaces, cozy booths, and the commons area where so many connections are made during the hallway track.

    This year, Martin Odersky, the creator of Scala, opens the main conference on November 15. Neha Narkhede, the co-creator of Kafka and cofounder of Confluent, is keynoting the day 2.

    The three tracks are
    — Functional and Thoughtful Programming
    — Reactive Microservices and Streaming Architectures
    — End-to-end Data Pipelines all the way up to Machine Learning and AI

    The 100 sessions include technology leaders such as Twitter, IBM, Microsoft, Salesforce, Fauna, DataStax, Databricks, Confluent, Credit Karma, Sumo Logic, GoPro, Buoyant, Workday, Zignal Labs, and many more. We cover your tools with JetBrains, your shopping with Best Buy and Target, your vacations with HomeAway, your listening with Spotify, your viewing with Netflix, your reading with Medium, and your banking with JP Morgan Chase. The list goes on and on and on — we have the most of the advanced stacks and approaches employed by the best that Silicon Valley offers to the world at scale, shared as best practices, with code, yours to learn, take home, and build upon.

    Our speakers span the whole spectrum from the first-time presenters with leading companies to veterans of SBTB going all the way back to 2013, evolving their craft before our eyes. You can follow their progress by watching their previous talks on http://functional.tv and the photos of the past conferences at

    The three panels, closing each day, are:

    — Thoughtful Software Engineering
    — Data Engineering for AI, and
    — Cloud, Edge, and Silver Lining.

    Each day begins with a hot breakfast, that begins an uninterruptible supply of Philz coffee through the whole day, and lunch is provided. On the first two days, the closing panels are followed by our signature happy hours, with great drinks, food, and conversation. The hallway tracks are legendary.

    SBTB is famous for its bespoke, all-day, build-yourself-a-company training. This year, we double it. Cliff Click, the legend of software engineering, is teaching a full day Advanced Software Engineering workshop on 11/13, followed by Ryan Knight, now of Fauna, leading cloud-native data pipelines on 11/14. The workshops are limited by 80 participants each.

    As last year, we’ll plan an unconference track for those who want to share their ideas in an intimate setting for joint brainstorming.

    We are always sold out by the time the conference begins in November — so reserve your seat early at http://scale.bythebay.io!