GEP Site Reliability /DevOps Engineer in Clark, New Jersey
GEP is a diverse, creative team of people passionate about procurement. We invest ourselves entirely in our client’s success, creating strong collaborative relationships that deliver extraordinary value year after year. Our clients include market global leaders with far-flung international operations, Fortune 500 and Global 2000 enterprises, leading government and public institutions.
We deliver practical, effective services and software that enable procurement leaders to maximise their impact on business operations, strategy and financial performance. That’s just some of the things that we do in our quest to build a beautiful company, enjoy the journey and make a difference. GEP is a place where individuality is prized, and talent respected. We’re focused on what is real and effective. GEP is where good ideas and great people are recognized, results matter, and ability and hard work drive achievements. We’re a learning organization, actively looking for people to help shape, grow and continually improve us.
Are you one of us?
GEP is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, ethnicity, color, national origin, religion, sex, protected veteran status, disability status, or any other characteristics protected by federal, state or local law. We are committed to hiring and valuing a global diverse work team. GEP is proud to be an EEO/AA employer M/F/D/V.
For more information please visit us on GEP.com or check us out on LinkedIn.com.
Create, manage, and maintain cloud infrastructure
Implement monitoring for applications and infrastructure
Automate developer processes
Ensure organization compliance requirements are met
Ensure security of platform
Plan, deploy, and maintain critical business applications in prod/non-prod cloud environments.
Engineer suitable release management procedures and provide production support.
Drive improvements to processes and design enhancements to automation to continuously improve production environments.
Build tools to reduce occurrences of errors and improve development experience
Perform root cause analysis for incidents and outages
Design procedures for system troubleshooting and maintenance
Maintain and contribute to our knowledge base and documentation.
3+ years of proven experience in support or development globally distributed cloud SaaS services
Experience with Continuous Integration and Continuous Delivery concepts, including Infrastructure as code utilizing tools like Azure Arm templates ,Git, Terraform, CloudFormation, Jenkins, Ansible, Chef, Puppet, spinnaker, etc.
Hands-on with Containerization infrastructure like Kubernetes, Mesos, Docker, and cloud PaaS services preferred AWS/GCP/On-prem.
Ability to use a wide variety of open-source technologies and cloud services to automate microservices environments.
Experience with public cloud technologies (AWS, GCP, Azure, etc.) is a must.
Experience with monitoring tools PagerDuty, New relic, site24x7.
SQL/NoSQL (MariaDB, MySQL, MongoDB, Cassandra) knowledge is a plus.
Sound written and oral communication skill.
Ability to work independently with minimal supervision.
Should be able to interact with multiple teams and available for on-call rotations.
Requisition ID: 2021-19193
External Company URL: https://careers.gep.com/
Street: 100 Walnut Avenue
What You Will Do (Text Only): - Create, manage, and maintain cloud infrastructure - Implement monitoring for applications and infrastructure - Automate developer processes - Ensure organization compliance requirements are met - Ensure security of platform - Plan, deploy, and maintain critical business applications in prod/non-prod cloud environments. - Engineer suitable release management procedures and provide production support. - Drive improvements to processes and design enhancements to automation to continuously improve production environments. - Build tools to reduce occurrences of errors and improve development experience - Perform root cause analysis for incidents and outages - Design procedures for system troubleshooting and maintenance - Maintain and contribute to our knowledge base and documentation.