Getting started with fault tolerance

Till now, we've been creating components that "work", and work quite easily. We haven't taken into account various types of failures. In production, a lot of things can go wrong. With the right(!) combination of stupidity, bad luck, chicken blood and lightning strikes, we can get deadlocked, live locked, our nodes could go down, and the universe might explode. An exhaustive discussion of fault tolerance could span years if not decades, but in this session, we'll look at some things we can do to deal with failures without bringing our applications to their knees (or worse). 

This is likely to be a bit more diagrams and discussions, but there's likely to be some coding involved too. Remember your laptops :)

Some basic skeleton code is likely to be sent out before the session. There's only so many times we can start from scratch and having the boilerplate ready should save some time. Watch this space!

Please remember to register on http://skillsmatter.com/event/open-source-dot-net/getting-started-with-fault-tollerance as well. It helps with administration considerably.

Join or login to comment.

  • Rob

    http://zguide.zeromq.org/page:all

    The ZeroMQ guide, mentioned in the talk by Andy and Ashic.

    August 27, 2013

  • Andy P.

    comment from Nathan Marz in a presentation called "Lessons Learned Building Storm"

    http://www.infoq.com/presentations/storm-lessons
    42:50 Q: “What do you think about the ZeroMQ library?”

    A: pause… It saved a lot of time initially. Then we ran into a lot of problems with it. A lot of limitations. So we’re actually getting rid of ZeroMQ in Storm … We’re replacing ZeroMQ with Netty. A lot of problems I had with ZeroMQ were, like, a lot of weird stuff with the community … Lots of regression in the code. Nothing above 2.1.7 works … It just a weird, very unstable piece of software. It could have been a really awesome library. But it just has all these weird problems with it.

    August 26, 2013

    • Ashic M.

      Ah yes...it's unfortunate. ZeroMQ went through a bit of a turmoil period, and that's kind of why 3.x came about. Breaking client API stuff was debated hotly and I believe things like crossroads was started as some members did not agree with the way going forward. As of now, there's a firm direction in place that client apis will have additive changes only, and no present APIs will ever be broken.

      August 27, 2013

  • Ashic M.

    Cross registration page with our awesome hosts Skillsmatter is up at http://skillsmatter.com/event/open-source-dot-net/getting-started-with-fault-tollerance . Please take a minute to register. It helps with administration considerably.

    August 18, 2013

  • Andy P.

    Do we still need to cross register at SkillMatter?

    August 14, 2013

    • Ashic M.

      Yes...I expect the SM registration page to be up at some time today. I'll announce it when available. It really helps with administration, room allocation, recording, etc.

      August 14, 2013

8 went

People in this
Meetup are also in:

Create your own Meetup Group

Get started Learn more
Allison

Meetup has allowed me to meet people I wouldn't have met naturally - they're totally different than me.

Allison, started Women's Adventure Travel

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy