Site Reliability Engineer - Infrastructure
Sydney, Australia IT development
Job description
- Location:Sydney, NSW, Australia
- Area of InterestEngineer - Software
- Job TypeProfessional
- Technology InterestCloud and Data Center
- Job Id1260152
As SREs at Meraki we are responsible for building the reliable and scalable cloud infrastructure that supports millions of Meraki devices across the world. Meraki's customer base has grown by a factor of 2-3 every year, serving more than 8 billion HTTP requests per day across eight data centres. Our customers depend on our products to run their critical infrastructure of network switches, security appliances, wireless APs and security cameras. We're passionate about using automation to raise the bar.
In this role you will join the Infrastructure SRE team that is based out of our offices in Sydney and London. The Infrastructure Site Reliability Engineering team's mission is to make Infrastructure as a Service (IaaS) a reality at Cisco Meraki and to ensure it meets the needs of the various platforms supported within Software Engineering. This typically includes areas such as operating system, compute (virtualized or traditional bare metal servers), storage, security, networking, network & infrastructure support services, and the vendor management of our physical sites.
You will responsible for the design, development and operational aspects of the global infrastructure which supports our private cloud. We believe in automating manual tasks with the right tools. You will design, build and run automated systems written in ruby. You will work closely with our existing vendors to coordinate all hands on work. We embrace the *nix way, automate away tedious tasks and strive to build infrastructure as code whenever possible.
Example projects of a Site Reliability Engineer (Infrastructure):
- Design and deploy new IaaS architecture to provide private cloud to internal stakeholders by leveraging tools like OpenStack.
- Design and deploy tooling and framework to facilitate the transition to a hybrid-cloud world.
- Build an automated service lifecycle platform to manage the full lifecycle of all infrastructure (server, storage, network and site).
- Developing comprehensive monitoring tools that provide visibility into the performance and reliability of our infrastructure.
- Automated testing infrastructure to accelerate the velocity at which we can deploy changes.
You are an ideal candidate if you:
- 2+ years of work experience in software development, particularly in cloud systems, networking, distributed systems, databases, and data processing frameworks
- Script or code with 1-2 languages like Ruby, Scala, Python or Bash. You are comfortable digging into other people's source code in search of the root cause of a problem and you automate all the things.
- Have previous experience designing and deploying cloud management platforms: OpenStack, CloudStack, etc.
- Have experience on a pager rotation where you responded to escalations quickly to minimize customer downtime. This role requires being part of a workday on-call rotation.
- Believe in the Unix way. You build large systems out of small components that each do one job and do it well. We run Debian.
Bonus points for:
- Experience with SRE/dev-ops/infrastructure tasks
- Experience with private cloud management platforms (OpenStack)
- Interesting personal projects or contributions to open-source projects
- A BS/MS/Ph.D in Computer Science, Computer Engineering, or a STEM field
Cisco is an Affirmative Action and Equal Opportunity Employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, national origin, genetic information, age, disability, veteran status, or any other legally protected basis.