Principal Site Reliability / DevOps Engineer - Exascale Cloud Service
Oracle
- București
- Permanent
- Full-time
- Build new monitoring/administration solutions including architecture, provisioning, configuration, deployment, and patching of network components
- React to production deficiencies by continuously implementing automation, self-healing, and real-time monitoring to production systems
- Conduct periodic on call duties
- Solve complex and difficult problems and build automation to prevent problem recurrence
- Participate in cloud service capacity planning and demand forecasting, software performance analysis and system tuning.
- Partner with distributed teams in prototyping new solutions
- Stay informed of new technologies
- Innovate
- BS or MS degree in Computer Science, or equivalent experience
- Proficient with scripting skills (for example Shell, Perl and Python); and programming languages (for example C/C++/Java/Python etc)
- Strong experience with Continuous integration and Continuous Deployment (CI/CD) using tools like GIT/Bit Bucket, TeamCity, Artifactory, jira, Phabricator and Octopus or equivalent
- Strong knowledge of different development environments (Git, Atlassian tools: JIRA, Confluence, Bitbucket)
- Good knowledge on containerization using Docker/Kubernetes
- Experience with configuration management tools
- Experience with monitoring tools
- Expertise in designing, analyzing and troubleshooting large-scale distributed systems.
- Systematic problem solving approach, combined with a strong sense of ownership and drive.
- Possess a passion for technical leadership and mentoring
- Possess strong verbal and written communication skills
- Which includes being a United States Affirmative Action Employer