Grafana and Site Reliability Engineer
Marks Sattin · Glasgow
Job description
About the role
We are seeking an experienced Site Reliability Engineer with deep expertise in Grafana and AWS observability. You will own the design, implementation, and maintenance of monitoring, alerting, and dashboard solutions across a large AWS environment, turning raw metrics, logs, and traces into actionable insights for engineering teams and stakeholders.
Key responsibilities
- Design, build and maintain production‑grade Grafana dashboards for application and infrastructure health, performance, and SLO/SLI reporting.
- Define golden‑signal metrics, integrate logs and traces, and create unified observability views across services.
- Establish and track SLIs/SLOs, manage error budgets, fine‑tune alerting policies, and support incident response and post‑mortem reviews.
- Integrate Grafana with AWS CloudWatch and other data sources to monitor EC2, ECS/EKS, Lambda, RDS, DynamoDB, S3, API Gateway, and more.
- Collaborate with platform, data, and application teams on instrumentation best practices and provide training on observability.
Required profile
- 6+ years of experience in SRE, observability, or cloud reliability roles on AWS.
- Hands‑on expertise with Grafana (dashboards, alerts, RBAC, multiple data sources).
- Strong understanding of SLA, SLO, SLI, error budgets, and golden‑signal monitoring.
- Proven track record troubleshooting production systems using observability tooling.
Required skills
- Grafana
- AWS (EC2, ECS, EKS, Lambda, RDS, DynamoDB, S3, API Gateway, CloudWatch)
- Prometheus
- Loki
- OpenTelemetry
- OpenSearch
- Terraform or CloudFormation
- Snowflake (nice to have)
- Databricks (nice to have)
Questions fréquentes
Why are you reporting this job?
Apply in 30 seconds
Enter your email to apply. An account will be created automatically.
By continuing, you accept our terms of use.
Already have an account? Login
Published 3 days ago
Expires 1 month from now
14 views · 0 applications
Boost your chances
Upload your CV — we will match you with relevant openings.
Analyzing your CV...
Marks Sattin
Glasgow
Related job offers
-
Junior Software Developer – Digital Projects
UK Regulators' Network Glasgow -
IT Automation Architect
WSP in the UK & Ireland Glasgow -
Post‑Sales Customer Success Lead (Cyber Security)
Sapphire Glasgow -
Engagement Manager – AI Implementations
tether London -
Ingénieur PLM – Gestion et évolution du système PLM
VINCI Energies New Plymouth