I'm a Senior Software Engineer at Netflix on the Cloud Operations & Reliability Engineering (CORE) team.
Once upon a time, I studied software developers to understand what problems they face and to evaluate whether proposed technologies actually make their lives easier. Nowadays, I study operational surprises (some people call these incidents) in order to improve how Netflix engineers software.
You might be interested in some of the resources I've collected on resilience engineering:
I received a PhD in computer science from the University of Maryland (2006), an M.S. in electrical engineering from Boston University (2002), and a B.Eng. in computer engineering from McGill University (1999). I was an Assistant Professor at the University of Nebraska-Lincoln in the Computer Science & Engineering department from 2006-2008. I was a Computer Scientist at USC/ISI in the Adaptive Parallel EXecution (APEX) Computing Group from 2008-2011, a Lead Architect at Nimbis Services from 2012-2013, and a Lead Software Engineer at SendGrid Labs from 2014-2015.