
Site Reliability Engineer
EPAM Systems (Poland) sp. z o.o.
B2B
Status
Hexjobs Insights
Role: Site Reliability Engineer. Responsibilities include implementing SRE practices, designing cloud solutions, troubleshooting, and ensuring system reliability. Requirements: 3+ years experience in SRE, knowledge of cloud platforms, DevOps tools.
Schlüsselwörter
Site Reliability Engineering
Cloud Solutions
AWS
Azure
GCP
Python
CI/CD
Kubernetes
Monitoring Tools
Vorteile
- Flexible schedule and opportunity to work remotely within Poland
- Outstanding career roadmap
- Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
- Benefits package (health insurance, multisport, shopping vouchers)
- Participation in the Employee Stock Purchase Plan
Technologies we use
About the project
Your responsibilities
- Collaborate with development, security, quality, and operation teams to implement SRE practices and ensure system reliability
- Define and support required level of reliability, availability, and performance for services and applications
- Design and deliver Cloud-based solutions tailored to client needs
- Troubleshoot, mitigate, and support fixing of the infrastructure and application issues in a timely manner
- Implement a monitoring system for the infrastructure and application reliability
- Communicate technical concepts clearly to both engineering teams and management stakeholders
Our requirements
- Bachelor’s degree in Computer Science, Engineering, or a related field
- 3+ years of hands-on experience in Site Reliability Engineering or related roles
- Proven experience in any cloud (AWS/GCP/Azure)
- Experience with implementing SRE practices such as SLO/SLI, Error budgets, Postmortems, Reducing Toil, capacity planning, and Incident Management
- Python or other scripting/programming language
- Strong background in monitoring tools
- Proficiency in CI/CD tools, infrastructure as code, and configuration management
- Solid knowledge of container orchestration technologies (Kubernetes, Docker)
- English language proficiency at an Upper-Intermediate level (B2) or higher
Optional
- Expertise in deployment and management of LLMs, including technologies like RAG
- Certification in Kubernetes, AWS/GCP/Azure, or similar technologies
- Proven experience in DevOps
- Knowledge of managing and optimizing AI/ML models in production environments, including basic deployment, monitoring, and maintenance
This is how we work on a project
Development opportunities we offer
What we offer
- Engineering community of industry professionals
- Friendly team and enjoyable working environment
- Flexible schedule and opportunity to work remotely within Poland
- Chance to work abroad for up to 60 days annually
- Business-driven relocation opportunities
- Outstanding career roadmap
- Leadership development, career advising, soft skills, and well-being programs
- Certification (GCP, Azure, AWS)
- Unlimited access to LinkedIn Learning, Get Abstract, Cloud Guru
- English language classes
- Stable income (Employment Contract or B2B)
- Participation in the Employee Stock Purchase Plan
- Benefits package (health insurance, multisport, shopping vouchers)
- Strategically located offices featuring entertainment and relaxation zones, table tennis and football, free snacks, fantastic coffee, and more
- Referral bonuses
- Corporate, social and well-being events
Benefits
Aufrufe: 2
| Veröffentlicht | vor 4 Tagen |
| Läuft ab | in 26 Tagen |
| Art des Vertrags | B2B |
Ähnliche Jobs, die für Sie von Interesse sein könnten
Basierend auf "Site Reliability Engineer"
Keine Angebote gefunden, versuchen Sie, Ihre Suchkriterien zu ändern.