• Cloud Linux Operations Engineer

    Location(s) UK-Greater London-Hayes
    Req #
    System Administration / Engineering
  • Overview & Responsibilities

    Excited about OpenStack and automation at scale? Do you want to work as an infrastructure engineer on the biggest OpenStack public Cloud in the world? This is the role for you...


    A Linux Operations Engineer is a key element in the day to day maintenance of Rackspace Cloud Servers product line and the largest OpenStack deployments in the world.

    As a IOPS Engineer you are expected to apply thorough problem-solving techniques to proactively identify the source of infrastructure problems and own their resolution while also identifying opportunities for automation and process improvements. 


    As a member of the IOPS Team, the IOPS Engineer works with the customer facing teams to ensure maximum stability and performance of our Cloud systems by responding to and troubleshooting issues with our infrastructure and customer-facing cloud server and Cloud portfolio of products.  Main responsibilities include keeping all Cloud and Cloud Servers systems online, ensuring the uptime and stability of the infrastructure while also serving as an escalation point for the customer-facing technical support teams.




    • Work with cutting edge hardware and software at scale in the cloud computing space.
    • Provide Fanatical Support to our customers through your innovative solutions.
    • Install, configure, update and troubleshoot services such as MySQL/NoSQL, SaltStack, Scale and many others. 
    • Collaborate in a “DevOps” environment where you will work closely with software developers, QA and more (no silos tolerated).
    • Constantly search for new ways to help us better monitor our infrastructure, deploy new capacity faster, release code more safely, and make us more resilient. 
    • Write automation tools/scripts to help keep the operations team stay nimble and focused on the challenges ahead. 
    • Perform periodic on-call duties as part of a global team.


    • Passionate contributor, works as a member of the team, produces high quality work.
    • Leverages established processes to get work done efficiently and assists in the creation of better solutions.
    • Assists team by proactively addressing problems and inefficiencies in our work and systems.
    • Makes connections between seemingly unrelated concepts and ideas, aware when to diverge from standard practices. 
    • Serves as an expert for some products or projects. 
    • Contributes to central store of knowledge.



    • Intermediate to advanced knowledge in OSes and distributions, patching, monitoring, backups, RAID, hardware, virtualization, networking, firewalls, load balancing, storage, security, high availability, root cause analysis, systems optimization, APIs, Shell Scripting (python, bash), SSL, Web servers, auth/directory services, caching services, SSH, HTTP, DNS, FTP, SNMP, SMTP and troubleshooting.
    • Basic to Intermediate knowledge of: Cluster management, configuration and source code management, database administration, packet analysis, stack trace analysis, design and implementation methodologies, production cycle including best practices.
    • Leads projects or portions of projects for the business  
    • Participates in interdepartmental meetings. 
    • Works with leadership on project planning and resolution of systemic problems.
    • Mentor and escalation for junior staff
    • Detail oriented in documenting information. 



    • Bachelor’s degree in a technology related field required.  
    • At the manager’s discretion, additional relevant experience may substitute for the degree requirement. 
    • RHCE or equivalent certification or professional experience.
    • Years of experience typically required: 4-6 years.