A hands‑on workshop with a live fintech application running in Docker: Flask API, PostgreSQL, Prometheus, and Grafana. You will act as a DevOps/SRE engineer, managing failures and investigating incidents using metrics and dashboards instead of user complaints.
Format
We work via GitHub and GitHub Codespaces — all you need is a browser and a GitHub account. The infrastructure repository can be run both in Codespaces and locally with Docker.
What the workshop is about
We briefly cover monitoring basics (key metrics, SLOs, observability) and then switch to practice: how incidents show up on graphs and how to interpret them. By the end, you will have a basic understanding of “real‑world” monitoring and draft runbooks/post‑mortems for typical incidents.
The workshop is aimed at junior DevOps/SRE, QA and backend engineers who want to safely experience real incidents and production‑like monitoring on a training environment.