Senior Site Reliability Engineer

Remote (Europe)Remoteplvia direct

// Job Type

Full Time

// Salary

PLN 29,000 - 34,500/month

// Salary Range

29,000–34,500 PLN / month

// Posted

3 months ago

// Seniority

senior

// Work Mode

remote

// Experience

7+ years

About the Role

Site Reliability Engineers (SREs) are essential to PandaDoc's success, ensuring customers receive a reliable service with minimal downtime. The SRE team achieves this by: Owning the incident management processes and tools. Managing the observability stack and alerting systems to enable timely investigation and mitigation. Actively contributing to service codebases to proactively prevent incidents and resolve performance bottlenecks. In essence, SREs are the cornerstone of production service resiliency, driving efforts in observability, incident management, capacity planning, and maintaining reliable operations. In this role, you will: Own and influence the incident management process end-to-end Maintain and evolve on-prem observability stack Keep production applications running smoothly by participating in the on-call rotation Develop automations and tools to support platform reliability Contribute to production services with performance and resiliency in mind Collaborate with product engineers to foster SRE principles within the R&D organization Be a mentor for the SRE team or product engineers About you: Solid programming experience, namely Python (Django and AsyncIO) and/or Java (Spring Boot) Experience in maintaining an observability tools suite (specifically, LGTM - Loki, Grafana, Tempo, Mimir) Experience in development and maintenance of Python services in production Strong experience with AWS and Kubernetes Solid proficiency in working with relational databases (PostgreSQL) and messaging systems (e.g. RabbitMQ, NATS, Kafka) An experienced on-call SRE engineer Enjoy hands-on troubleshooting of distributed systems in production environments You act like an owner and strive to do work you're proud of You enjoy communication and knowledge sharing on all-things reliability Proficiency in English, both written and spoken

Tech Stack

PythonDjangoJavaSpring BootKubernetesAWSPostgreSQLKafkaGrafanaLokiobservability

View on Original Source

Interested in this job?

Use our AI to tailor your resume for this Senior Site Reliability Engineer position at PandaDoc Poland.