HPC Nightmares & HPC Best Practices


Details
Remember that time you hit Enter instead of Backslash and deleted your entire virtual infrastructure? We are looking for volunteer speakers. If you manage or use HPC resources (even modest ones) please add your voice.
This open round table discussion will offer a chance to hear stories from the datacenter as well as some of the best ways to get real work done in HPC.
Should you participate?
At some level, anything that gets the job done is a best practice. We're all guilty of implementing a hack to get things working. Perhaps if we join forces, we can all reduce the number of hacks and increase the quality of our HPC.
Not all HPC is thousands of cores. Real work gets done on single systems with one GPU. Students, users and administrators at all levels can teach us something new.
Stories we want to hear:
• When things went wrong and couldn't be fixed
• When things went wrong and you managed to save the day
• What toolchain you use to manage your infrastructure/software/users/etc
• A problem that has been frustrating you and no one has stepped up to fix it
I have a story!
Please send a message to Eliot Eshelman (either via Meetup or eliote@microway.com ). I will publish teasers as stories come in.

HPC Nightmares & HPC Best Practices