• Insight and Creativity in Distributed Programming + Database In-Memory examples

    BootUp Ventures: Startup Ecosystem Co-Working & Office Suites

    Join us in Menlo Park on August 20 for two great talks plus dinner, beverages and stellar raffle prizes! This free event is sponsored GridGain Systems. (This is a special joint-event with the Bay Area In-Memory Computing Meetup.) (Everyone who registers for the raffle will receive a FREE ticket to the In-Memory Computing Summit, Nov. 13-14! See below for details) Speakers: > Nikita Ivanov, Founder & CTO, GridGain Systems > Andy Rivenes [@TheInMemoryGuy], Product Manager at Oracle for Database In-Memory >> Talk 1: Nikita. "Ignite Pearls - Insight and Creativity in Distributed Programming." Abstract: This presentation is full of small case studies, real examples, and interesting exercises for learning about how to do a modern distributed programming with Apache Ignite. (Apache Ignite is an open-source distributed database, caching and processing platform designed to store and compute on large volumes of data across a cluster of nodes.) https://ignite.apache.org/whatisignite.html >> Talk 2: Andy. "Oracle Database In-Memory by Example" Abstract: Starting with Oracle Database 12c a new In-Memory column store is available and is compatible with all existing applications. But how does it work and what steps are required to get started? This session will show step by step examples to demonstrate how Oracle's in-memory dual format enables both super-fast analytics and online transaction processing. It will also show how Database In-Memory integrates with Oracle Database's existing features. By attending this session, you will arm yourself with the necessary knowledge to not only get started with Database In-Memory but to dramatically improve the performance of your analytic workload. *** Raffle prizes > 1st Prize: Xbox One S 1TB Console > 2nd Prize: SoundPEATS True Wireless Earbuds > 3rd Prize: Color Gameboy/Nintendo console with 168 classic games >>> EVERYONE who registers for the raffle will receive a FREE ticket to the In-Memory Computing Summit, Nov. 13-14 at the Hyatt San Francisco Airport conference center <<< Be sure to RSVP for the meetup. Afterward, click the link below (or scan the QR code in the photos section) to register for the raffle! http://bit.ly/Aug20GridGainRaffle

    2
  • Talks from open-source Alluxio & Ignite + how to accelerate analytic queries!

    BootUp Ventures: Startup Ecosystem Co-Working & Office Suites

    Join us June 24 in Menlo Park for our next meetup! We'll have 3 valuable talks, a delicious BBQ dinner and amazing summertime-themed raffle prizes! This free event is sponsored by GridGain Systems and Oracle. > Talk 1: "Best Practices for Native Persistence and Data Recovery." With Valentin (Val) Kulichenko, lead architect at GridGain Systems. > Talk 2: "Feeding data to the Kubernetes beast: bringing data locality to your containerized big data workloads." With Bin Fan, founding engineer of Alluxio, Inc. and PMC member of Alluxio open source project. > Talk 3: Talk details coming soon! With Doug Hood, Oracle TimesTen Scaleout evangelist. Raffle prizes. * 1st prize: REMALI 4K Sports Action Camera! * 2nd prize: Fitbit Inspire HR Heart Rate & Fitness Tracker! * 3rd prize: Polaroid Originals OneStep 2 VF - Summer Blue! [Register for the raffle here: http://bit.ly/June24Meetup ] Talk details >> Talk 1 (Val): As an in-memory computing platform, GridGain and Apache Ignite support native persistence that stores data and indexes transparently on non-volatile memory, SSD or disk. When persistence is enabled, memory becomes a cache for the most frequently used data and indexes. Native persistence is ACID-compliant, durable and enables immediate availability on a restart of each node. Data is never lost; GridGain supports full and incremental snapshots along with continuous archiving, and provides Point-in-Time recovery to an individual transaction. Val will share insights into the underlying architecture and best practices for implementing native persistence in production. His talk will cover: - An architectural overview of native persistence, and centralized backup and recovery - Tips and tricks for for configuring and managing persistence - Best practices for checkpointing, using the Write-Ahead Log (WAL) and restoring from a failure - Performance tuning recommendations to balance durability and performance, including how to create snapshots under load >> Talk 2 (Bin): The latest advances in container orchestration by Kubernetes bring cost savings and flexibility to compute workloads in public/hybrid cloud / multi cloud environment. Such architectures typically lead to physically separated compute and storage services, where S3, Azure Data Lake or Google Cloud Storage are commonly used to provide data persistence. Open source Alluxio approaches this problem in a new way. It helps elastic compute workloads realize the true benefits of the cloud, while bringing data locality and data accessibility to workloads orchestrated by Kubernetes. Alluxio can orchestrate data locality from any persistent storage including object store such as Ceph and cloud storage such as AWS S3 or GCS and make it accessible to compute running in Kubernetes pods. As a stateless data access layer (as opposed to a long-running data storage daemon service), Alluxio runs as a native service making data-intensive compute workloads Kubernetes friendly. Bin will explain how this new approach of bringing data locality to data-intensive compute workloads in Kubernetes environments. He'll also cover the Alluxio, architecture, deployment with Kubernetes and real-world production use cases. >> Talk 3 (Doug): Details coming soon!

  • LARGE (a billion+ objects) DB search; SQL Caching; in-memory troubleshooting!

    BootUp Ventures: Startup Ecosystem Co-Working & Office Suites

    Join us May 23 in Menlo Park! This free event is sponsored by GridGain Systems and Oracle. Hot food, cold beverages (3 great talks) and cool raffle prizes (see below)! The meetup will run from 6-8 p.m. at BootUp, 68 Willow Road · Menlo Park, CA. RSVP now to reserve your spot because space will be limited! The talks > Talk 1: "Searching large databases with a billion objects or more!" with George Williams [Director of Data Science and Chief Evangelist, Embedded AI at GSI Technology]. > Talk 2: "Low Latency SQL Caching Without Custom Code" with Doug Hood [Oracle TimesTen Scaleout evangelist] > Talk 3: "Troubleshooting Apache Ignite (and best practices)" with Stan Lukyanov, [software engineer at GridGain Systems] Be sure to register for our raffle, too! Do that in advance here: http://bit.ly/May23raffle Talk details >> Talk 1 (George): What do face recognition, visual e-commerce, and drug discovery have in common? These days they involve searching large databases with a billion objects or more. Unlike traditional database search, exact match to a query is not the goal. Instead similarity search is employed to retrieve the closest items. One way to scale similarity search is to use specialized hardware and I’ll talk about GSI Technology’s custom chip that accelerates large-scale similarity search. GSI Technology designs, develops and markets a broad range of high performance memory products for networking, military, medical, automotive and other applications. >> Talk 2 (Doug): Learn how an In-Memory Database can be used as a read/write cache to dramatically increase the performance of SQL statements. See how this can be achieved using either SQL metadata or drag and drop. See how these techniques are being used by airlines, payment processors and telecom companies. >> Talk 3 (Stan): Summary: Whether you are getting started with Apache Ignite or have already deployed, this session is for you. Stan will explain how to set up deployments to make them easier to monitor, manage and keep up and running properly. He'll also hare best practice examples on how to: * Configure Ignite and GridGain for deployment, management and monitoring * Leverage log files during troubleshooting * Use monitoring interfaces and tools such as JMX, Visor and Web Console * Identify and fix top errors for newly installed and existing deployments ******** Be sure to register for our raffle, too! Do that in advance here: http://bit.ly/May23raffle > 1st prize: Xbox One S 1TB Console >> 2nd prize: Portal from Facebook. Smart, Hands-Free Video Calling with Alexa Built-in >>> 3rd prize (mystery prize!)

  • 2 Talks! Analyzing streaming data + Columnar Formatted Data for Analytics

    BootUp Ventures: Startup Ecosystem Co-Working & Office Suites

    Join us April 30 in Menlo Park for a joint-event with the Bay Area In-Memory Computing Meetup! Great food, drinks, raffle prizes -- and three stellar talks! Our speakers: > Pat Patterson [Director of Evangelism at StreamSets] > Andy Rivenes [@TheInMemoryGuy: Product Manager at Oracle for Database In-Memory] AGENDA: * 5:45 p.m. -- Dinner, drinks & networking * 6:10 p.m. -- Talk 1 (Pat): "Ingesting Streaming Data for Analysis" * 7:05 p.m. -- Talk 2 (Andy): “Oracle Database In-Memory – Columnar Formatted Data for Analytics” * 7: 50 p.m. -- Raffle drawings and closing remarks {Register here for the raffle: http://bit.ly/April30IMCmeetup } >>1st Prize: Lenovo Chromebook! >>2nd Prize: SwissGear Wenger Ibex Laptop Backpack! >>3rd Prize: Vintage-style "Suitcase Record Player" with 3-Speed Turntable! * 8:00 p.m. Finis! **** TALK DETAILS >> Talk 1 (Pat): This session, aimed at data architects, data engineers and developers, will explore how we can use the open source StreamSets Data Collector to build robust data pipelines. Attendees will learn how to collect data from cloud platforms such as Amazon and Salesforce, devices, relational databases and other sources, continuously stream it to Ignite, and then use features such as Ignite's continuous queries to perform streaming analysis. Pat will start by covering the basics of reading files from disk, move on to relational databases, then look at more challenging sources such as APIs and message queues. You will learn how to: * Build data pipelines to ingest a wide variety of data into Apache Ignite * Anticipate and manage data drift to ensure that data keeps flowing * Perform simple and complex ad-hoc queries in Ignite via SQL * Write applications using Ignite to run continuous queries, combining data from multiple sources >> Talk 2 (Andy): Analytic queries typically scan large amounts of data using aggregations to find patterns or trends in the data. In a traditional row-based database this can be slow because each row must be examined to access the columns in a query. Columnar formatted data does not have this problem because just the columns in the query need to be accessed. In addition, columnar formatted data tends to compress well and work well with vectorized processing like Single Instruction Multiple Data (SIMD). Oracle Database In-Memory can transform existing row-format database objects into an in-memory columnar format. These columnar formatted objects can be queried at orders of magnitude faster speed than the equivalent row format. This session will explore how this columnar format provides such a dramatic performance improvement for analytic queries, and how it works with the rest of Oracle Database so that no application changes are required. *** See you April 30! Please RSVP because space will be limited!

  • Distributed databases, Node.js & how to scale trivial workloads

    BootUp Ventures: Startup Ecosystem Co-Working & Office Suites

    Join us March 26 in Menlo Park for three great talks, authentic Jamaican food, drinks and some amazing raffle prizes (PlayStation 4 & more). This free event is sponsored by GridGain Systems. Venue: Bootup Ventures, 68 Willow Road · Menlo Park, CA Speakers: * Kevin Xu, general manager of global strategy and operations at PingCAP * Denis Magda, vice president of product management at GridGain Systems and Apache Ignite vice president * Doug Hood, Oracle TimesTen Scaleout evangelist AGENDA: * 6 p.m. – Authentic Jamaican and cold drinks * 6:10 p.m. -- Talk 1 (Kevin): "Deploying TiDB a distributed open-source MySQL-Compatible HTAP Database with K8s." * 6:45 p.m. -- Talk 2 (Denis): "Apache Ignite for Node.js Developers." * 7: 25 -- Talk 3 (Doug): "How to scale non trivial applications" * 8 p.m. Raffle drawings (register here: http://bit.ly/March26Meetup ) > 1st Prize: PlayStation 4 Slim 1TB Console! > 2rd Prize: Retro CASIO calculator watch! * 8:05 p.m. Finis! >> Talk 1 (Kevin): PingCAP's TiDB is a cloud-native MySQL-compatible distributed database. Its key-value storage layer, TiKV, is a CNCF member project. Kevin's talk will introduce the architecture of TiDB and TiKV (a CNCF project), key use cases, a user story with Mobike (one of the largest bikesharing platforms in the world), and a demo of deploying TiDB + TiKV as a stateful application using Kubernetes in a Operator pattern on GKE. (https://www.pingcap.com/en/) >> Talk 2 (Denis): Node.js is a very popular and powerful JavaScript runtime environment. It is lightweight and efficient and benefits from an extensive package ecosystem. Apache Ignite is a memory-centric distributed database, caching, and processing platform for transactional, analytical, and streaming workloads delivering in-memory speeds at scale. The benefits of Ignite are now available to Node.js developers with the addition of the Node.js Thin Client for Ignite. (https://ignite.apache.org/) In this talk, using examples, Denis will cover the specifics of how to use Node.js with Ignite, including: * Instantiating an Ignite Client * Creating an Ignite Client Configuration * Connecting a Client to an Ignite Cluster Node * Obtaining an Ignite Storage Instance * Configuring an Ignite Storage Instance * Performing Key-Value Queries * Performing SQL, SQL Fields and Scan Queries At the end of his talk, attendees will have a solid understanding of how to use Node.js with Ignite and can then use this knowledge to build scalable Node.js applications with Ignite. >> Talk 3 (Doug): Scaling trivial workloads is easy (eg 38 million YCSB B ops/sec or 1.4 Billion SQL queries per seconds), but what if your workload is not trivial? This talk will cover a customer use case for how to scale with: - Five table joins where the data is distributed [hashed] over many machines - ACID Transactions where the unit of work has seven updates and seven queries - The database must be highly available and 'just work' - A performance comparison with other well know distributed databases (http://bit.ly/OracleTimesTenScaleout) * * * We expect space to be limited for this event so please RSVP to reserve your spot and help us plan for food and drinks. And don't forget to register for the raffle! Do that in advance here: http://bit.ly/March26Meetup