Uber의 결제 시스템을 만들기면소 얻은 대규모 분산 시스템 운영 노하우에 대해: 모니터링 방법, 인시던트 관리 프로세스, 장애의 사후 분석 및 개선, SLO, SLA 등을 정리함.
Fast forward to 2018. Netflix has grown to 125M global members enjoying 140M+ hours of viewing per day. We’ve invested significantly in improving the development and operations story for our engineering teams. Along the way we’ve experimented with many approaches to building and operating our services. We’d like to share one approach, including its pros and cons, that is relatively common within Netflix. We hope that sharing our experiences inspires others to debate the alternatives and learn from our journey.