October 31 - November 1 - Co-Located Events
October 28-30 - Conference
Lyon Convention Centre - Lyon, France
More information for Open Source Summit + Embedded Linux Conference Europe 2019
Back To Schedule
Monday, October 28 • 16:20 - 16:55
The Observatorium: Combining Machine Learning and Observability to Improve Incident Response - Alex Kass, DigitalOcean

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
At DigitalOcean, a global hosting company predicated on providing building blocks for developers, the proliferation of microservices necessary to support a worldwide cloud creates a unique-yet-universal conundrum - while the internal code is decidedly custom to DO, the incidents that arise are common to many companies.

In the Observability group, open source tools like Prometheus, Kafka, and Spark play critical roles feeding data into a central application called The Observatorium, whose primary goal is to reduce MTTD/R by curating information intelligently. Combining distributed platform data engineering and predictive machine learning, all through open source tools, the team surfaces signals essential to first responders to help improve detection times and reduce service downtime.

In this talk, the speaker will describe in detail the architecture of The Observatorium, and how its creative amalgamation of OSS tools has measurably improved the company’s overall reliability.

avatar for Alex Kass

Alex Kass

Engineering Manager, DigitalOcean
Alex Kass has worked at companies ranging from large financial institutions to early-stage startups, regularly building successful analytical models and systems of varying size. At DigitalOcean, a fast-growing global cloud hosting provider, he has at his disposal sufficient software... Read More →

Monday October 28, 2019 16:20 - 16:55 CET
Bellecour 2
  Cloud Infrastructure & Automation
  • Session Slides Included Yes