15 Maintenance Reliability jobs in Bahrain
Industrial Equipment Maintenance Engineer
Posted today
Job Viewed
Job Description
Responsibilities:
- Perform routine inspections and preventative maintenance on industrial machinery.
- Diagnose and repair mechanical and electrical faults in complex equipment.
- Utilize remote monitoring tools for equipment health assessment.
- Develop and implement effective maintenance plans and schedules.
- Manage spare parts inventory and ensure availability.
- Analyze equipment performance data to identify areas for improvement.
- Collaborate with production and engineering teams on equipment upgrades and new installations.
- Read and interpret technical drawings, schematics, and manuals.
- Troubleshoot and resolve urgent equipment breakdowns.
- Provide technical guidance and support to junior technicians.
- Bachelor's degree in Mechanical Engineering, Electrical Engineering, or a related field.
- Minimum of 5 years of experience in industrial maintenance and engineering.
- Proficiency in diagnosing and repairing heavy machinery, automation systems, and PLCs.
- Experience with hydraulics, pneumatics, and electrical control systems.
- Skilled in reading and understanding technical documentation.
- Familiarity with remote diagnostic tools and software.
- Strong analytical and problem-solving capabilities.
- Excellent communication and teamwork skills.
- Ability to manage time effectively in a hybrid work environment.
- Experience in a manufacturing or production setting is preferred.
Site Reliability Engineer
Posted 4 days ago
Job Viewed
Job Description
Overview
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of globally distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution. The company is founder-led, profitable, and growing. We are hiring a Site Reliability Engineer. Our goal is to perfect enterprise infrastructure DevOps practices, raising the bar on what’s possible with automation by embracing a model-driven approach, whether on-premise or on public clouds. We run hundreds of private cloud, Kubernetes clusters, and applications for customers across both physical and public cloud estates. We identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to ultimately achieve high-quality standards in our open source portfolio. The role is a globally remote position.
To succeed in this role, you need to have a strong background in Linux, Python, networking, and knowledge of how clouds work. Your work will encompass the entire stack, from bare-metal networking and kernel up to Kubernetes and open source applications. You can expect to be trained in our core technologies like OpenStack, Kubernetes, security standards, open source products like Kubeflow, Kafka, OpenSearch, databases, and many others. Automation for us is a software engineering problem that we approach with a scientific mindset to bring operations at scale, driven by metrics and code.
The roleWe deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices. To become a member of our team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from bare metal to containers, and you need the ability to work in operations with mission-critical services for global brand-name customers. As a member of the team, you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure.
What We Are Looking For In You- Degree in software engineering or computer science
- Python software development experience
- Operational experience in Linux environments
- Experience with Kubernetes deployment or operations
- Excellent interpersonal skills, curiosity, flexibility, and accountability
- Ability to travel internationally twice a year, for company events up to two weeks long
- Familiarity with OpenStack deployment or operations
- Familiarity with public cloud deployment or operations
- Familiarity with private cloud management
We consider geographical location, experience, and performance in shaping compensation worldwide. We adjust compensation every 6 months to ensure we recognize outstanding performance, and in addition to base pay, we offer annual bonuses. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Every 6 months compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Employee Assistance Programs
- Opportunity to travel to new locations to meet your colleagues
- Priority Pass and travel upgrades for long-haul company events
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.
Equal opportunityCanonical is an equal opportunity employer. We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.
#J-18808-LjbffrSite Reliability Engineer
Posted 6 days ago
Job Viewed
Job Description
Overview
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world's leading public cloud and silicon providers, and industry leaders in many sectors. The company is a pioneer of globally distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person, in interesting locations around the world, to align on strategy and execution.
The company is founder-led, profitable, and growing.
We are hiring a Site Reliability Engineer . Our goal is to perfect enterprise infrastructure DevOps practices, raising the bar on what's possible with automation by embracing a model-driven approach, whether on-premise or on public clouds.
We run hundreds of private cloud, Kubernetes clusters, and applications for customers across both physical and public cloud estates. We identify and address incidents, monitor and observe applications, anticipate potential issues, and enable product refinement to ultimately achieve high-quality standards in our open source portfolio.
To succeed in this role, you need to have a strong background in Linux, Python, networking, and knowledge of how clouds work. Your work will encompass the entire stack, from bare-metal networking and kernel up to Kubernetes and open source applications. You can expect to be trained in our core technologies like OpenStack, Kubernetes, security standards, open source products like Kubeflow, Kafka, OpenSearch, databases, and many others.
Automation for us is a software engineering problem that we approach with a scientific mindset to bring operations at scale, driven by metrics and code.
Location: Globally remote role
The roleWe deploy and run OpenStack, Kubernetes, storage solutions, and open source applications, applying DevOps practices.
To become a member of our team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from bare metal to containers, and you need the ability to work in operations with mission-critical services for global brand-name customers.
As a member of the team, you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure.
What we are looking for in you
- Degree in software engineering or computer science
- Python software development experience
- Operational experience in Linux environments
- Experience with Kubernetes deployment or operations
- Excellent interpersonal skills, curiosity, flexibility, and accountability
- Ability to travel internationally twice a year, for company events up to two weeks long
Bonus skills
- Familiarity with OpenStack deployment or operations
- Familiarity with public cloud deployment or operations
- Familiarity with private cloud management
What we offer colleagues
We consider geographical location, experience, and performance in shaping compensation worldwide. We adjust compensation every 6 months to ensure we recognize outstanding performance, and in addition to base pay, we offer annual bonuses. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Every 6 months compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Employee Assistance Programs
- Opportunity to travel to new locations to meet your colleagues
- Priority Pass and travel upgrades for long-haul company events
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT, and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.
Canonical is an equal opportunity employer
We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.
#J-18808-LjbffrSenior Site Reliability Engineer
Posted 1 day ago
Job Viewed
Job Description
Senior Site Reliability / Gitops Engineer
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our customers include leading public cloud and silicon providers, and industry leaders across sectors. The company is founded on global distributed collaboration, with 1200+ colleagues in 75+ countries and very few office-based roles. Teams meet two to four times yearly in person in interesting locations around the world to align on strategy and execution. The company is founder led, profitable and growing.
We are hiring a Senior Site Reliability Engineer
Next-gen operations at scale, with pure Python infra-as-code, from bare metal to containers and applications. Our goal is to perfect enterprise infrastructure devops. We run hundreds of private cloud, Kubernetes, and application clusters for customers across physical and public cloud estate, and we are raising the bar on automation by embracing a universal operator pattern and model-driven operations.
To succeed in this role you need to believe in automation as a pure software engineering problem, not a hack-it-till-it-works-for-me problem. You need to be interested in the scientific approach to operations at scale, driven by metrics and code, and you need to be able to learn the entire stack, from bare metal networking and kernel up to serverless and open source applications.
Location: Globally remote role
The role entails
Our cloud operations engineers bring Python software-engineering skills and rigour to the operations domain. We practise devsecops from bare metal to application. We architect and run OpenStack, Kubernetes and software defined storage, and we enable devsecops for applications running on that infrastructure too.
To become a member of this team, you need to be a software engineer fluent in Python, you need a genuine interest in the full open source infrastructure stack from metal to containers, and you need the ability to work in a high pressure operations environment with mission-critical services for global brand name customers.
As a member of the team you will gain experience in a broad range of cloud technologies. We evolve our offerings as the state of the art improves, so you get to stay current with the latest capabilities in open source infrastructure. We drive upgrades to keep our customers on the latest, best solutions.
What We Are Looking For In You
- Degree in Software Engineering or Computer Science
- Experience with Linux and familiarity with Linux networking and storage
- Python software development expertise
- Operational experience
- Excellent interpersonal skills, curiosity, flexibility, and accountability
- Ability to travel internationally twice a year, for company events up to two weeks long
Nice-to-have skills
- Experience with OpenStack or Kubernetes deployment or operations
- Familiarity with public or private cloud management
What we offer colleagues
We consider geographical location, experience, and performance in shaping compensation worldwide. We revisit compensation annually (and more often for graduates and associates) to ensure we recognise outstanding performance. In addition to base pay, we offer a performance-driven annual bonus or commission. We provide all team members with additional benefits, which reflect our values and ideals. We balance our programs to meet local needs and ensure fairness globally.
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Annual compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Employee Assistance Programme
- Opportunity to travel to new locations to meet colleagues
- Priority Pass, and travel upgrades for long haul company events
About Canonical
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence — in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.
Canonical is an equal opportunity employer
We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background create a better work environment and better products. Whatever your identity, we will give your application fair consideration.
Job Id: hSJJDsHgC2zcU3W/4bXowB8gGMyZ/PX5ipOq1g1ITUR2DPLG+dps5c8f6wK6hKpPCdbtbyUgMQ==
#J-18808-LjbffrSenior Site Reliability Engineer
Posted 3 days ago
Job Viewed
Job Description
Senior Site Reliability Engineer
Canonical is hiring a Senior Site Reliability Engineer. Location: Globally remote role. We run hundreds of private cloud, Kubernetes, and application clusters for customers across physical and public cloud estate, and we are raising the bar on automation by embracing a universal operator pattern and model-driven operations. To succeed in this role you need to believe in automation as a pure software engineering problem, be interested in the scientific approach to operations at scale, driven by metrics and code, and be able to learn the entire stack, from bare metal networking and kernel up to serverless and open source applications.
Responsibilities- Architect and run OpenStack, Kubernetes and software defined storage, and enable devsecops for applications running on that infrastructure.
- Bring Python software-engineering skills and rigour to the operations domain; practice devsecops from bare metal to application.
- Confidently operate in a high pressure operations environment with mission-critical services for global brand name customers.
- Evolve offerings with the state of the art in open source infrastructure and stay current with capabilities.
- Degree in Software Engineering or Computer Science
- Experience with Linux and familiarity with Linux networking and storage
- Python software development expertise
- Operational experience
- Excellent interpersonal skills, curiosity, flexibility, and accountability
- Ability to travel internationally twice a year, for company events up to two weeks long
- Experience with OpenStack or Kubernetes deployment or operations
- Familiarity with public or private cloud management
- Distributed work environment with twice-yearly team sprints in person
- Personal learning and development budget of USD 2,000 per year
- Annual compensation review
- Recognition rewards
- Annual holiday leave
- Maternity and paternity leave
- Employee Assistance Programme
- Opportunity to travel to new locations to meet colleagues
- Priority Pass, and travel upgrades for long haul company events
Canonical is a pioneering tech firm at the forefront of the global move to open source. As the company that publishes Ubuntu, one of the most important open source projects and the platform for AI, IoT and the cloud, we are changing the world of software. We recruit on a global basis and set a very high standard for people joining the company. We expect excellence - in order to succeed, we need to be the best at what we do. Most colleagues at Canonical have worked from home since its inception in 2004. Working here is a step into the future, and will challenge you to think differently, work smarter, learn new skills, and raise your game.
Canonical is an equal opportunity employer. We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background create a better work environment and better products. Whatever your identity, we will give your application fair consideration.
#J-18808-LjbffrRemote Site Reliability Engineer
Posted today
Job Viewed
Job Description
Our client, a cutting-edge technology firm, is looking for a highly skilled Remote Site Reliability Engineer (SRE) to join their globally distributed team. This is a fully remote position, offering the opportunity to work from anywhere with a stable internet connection. The SRE will be instrumental in ensuring the availability, performance, scalability, and reliability of our client's mission-critical systems and infrastructure. This role involves a blend of software engineering and systems administration, focusing on automating operations, improving system resilience, and reducing manual toil. Key responsibilities include designing and implementing infrastructure as code, developing monitoring and alerting systems, managing cloud environments (AWS, Azure, or GCP), and participating in on-call rotations to respond to incidents. You will collaborate closely with development teams to ensure that services are designed for reliability and operability from the outset. The ideal candidate possesses deep expertise in cloud technologies, containerization (Docker, Kubernetes), CI/CD pipelines, and scripting/programming languages (e.g., Python, Go). Strong troubleshooting skills, a proactive approach to identifying and mitigating risks, and a passion for building robust, scalable systems are essential. We are seeking individuals who are committed to continuous improvement, possess excellent analytical and problem-solving abilities, and can communicate technical concepts effectively within a remote team environment. This is an exceptional opportunity to work on challenging problems with a talented team, shaping the future of reliable and scalable cloud infrastructure.
Key Responsibilities:
- Design, build, and maintain scalable and reliable cloud infrastructure.
- Implement and manage infrastructure as code using tools like Terraform or Ansible.
- Develop and maintain robust monitoring, alerting, and logging systems.
- Automate operational tasks and reduce manual toil.
- Troubleshoot and resolve incidents affecting system availability and performance.
- Collaborate with development teams to ensure operability and reliability of services.
- Participate in on-call rotation to provide 24/7 system support.
- Continuously improve system performance and efficiency.
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
- Proven experience in Site Reliability Engineering, DevOps, or Systems Engineering.
- Strong experience with cloud platforms (AWS, Azure, GCP).
- Expertise in containerization technologies (Docker, Kubernetes).
- Proficiency in scripting and programming languages (e.g., Python, Go, Bash).
- Experience with CI/CD tools and practices.
- Solid understanding of networking concepts and protocols.
- Excellent problem-solving and troubleshooting skills.
- Strong communication and collaboration skills for remote work.
Senior Site Reliability Engineer
Posted today
Job Viewed
Job Description
Responsibilities:
- Design, implement, and manage highly available and scalable production systems.
- Develop and maintain infrastructure as code using tools like Terraform or Ansible.
- Automate operational tasks, deployment pipelines, and incident response.
- Monitor system performance, identify bottlenecks, and implement solutions.
- Troubleshoot and resolve complex technical issues in production environments.
- Collaborate with software development teams to ensure system reliability and performance.
- Implement and manage CI/CD pipelines.
- Participate in on-call rotations to provide 24/7 system support.
- Develop and enforce reliability best practices and standards.
- Contribute to capacity planning and performance tuning.
- Bachelor's degree in Computer Science, Engineering, or a related field.
- Minimum of 5 years of experience in Site Reliability Engineering, DevOps, or Systems Administration.
- Proficiency in at least one major cloud platform (AWS, Azure, GCP).
- Strong experience with containerization technologies (Docker, Kubernetes).
- Expertise in scripting languages such as Python, Go, or Bash.
- Solid understanding of networking concepts (TCP/IP, DNS, HTTP).
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack).
- Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI).
- Excellent problem-solving and troubleshooting skills.
- Ability to work effectively in a hybrid work environment.
Be The First To Know
About the latest Maintenance reliability Jobs in Bahrain !
Senior Site Reliability Engineer
Posted 1 day ago
Job Viewed
Job Description
Key Responsibilities:
- Design, implement, and maintain robust and scalable infrastructure and services.
- Develop and deploy automation tools and scripts to streamline operational tasks, such as deployment, monitoring, and incident response.
- Monitor system health and performance, identifying and resolving performance bottlenecks and issues proactively.
- Lead incident response efforts, including on-call rotations, troubleshooting, and post-mortem analysis to prevent recurrence.
- Collaborate with software development teams to ensure the reliability and operability of new features and services.
- Implement and manage CI/CD pipelines for efficient and reliable software delivery.
- Optimize system performance and resource utilization.
- Develop and maintain comprehensive documentation for systems, processes, and runbooks.
- Contribute to capacity planning and disaster recovery strategies.
- Mentor junior engineers and share knowledge across teams.
- Evaluate and recommend new technologies and tools to enhance reliability and efficiency.
- Ensure adherence to security best practices and compliance requirements.
- Participate in architectural reviews to ensure systems are designed for reliability and scalability from the outset.
- Bachelor's degree in Computer Science, Engineering, or a related technical field; Master's degree is a plus.
- Minimum of 7 years of experience in Site Reliability Engineering, DevOps, or a related field.
- Proven experience with cloud platforms such as AWS, Azure, or GCP.
- Strong proficiency in at least one programming language (e.g., Python, Go, Java, Ruby).
- Extensive experience with containerization technologies like Docker and Kubernetes.
- Deep understanding of infrastructure-as-code tools (e.g., Terraform, Ansible, Chef, Puppet).
- Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack, Datadog).
- Solid understanding of networking concepts (TCP/IP, DNS, HTTP, load balancing).
- Experience with relational and NoSQL databases.
- Strong problem-solving and analytical skills.
- Excellent communication and collaboration abilities.
- Experience in an on-call rotation and handling production incidents effectively.
Senior Site Reliability Engineer (SRE)
Posted today
Job Viewed
Job Description
- Designing, building, and maintaining scalable and reliable infrastructure using infrastructure-as-code principles (e.g., Terraform, Ansible).
- Developing automation tools and scripts to streamline deployment, monitoring, and operational tasks.
- Implementing and managing robust monitoring, alerting, and logging solutions (e.g., Prometheus, Grafana, ELK stack).
- Proactively identifying and resolving performance bottlenecks and reliability issues across the stack.
- Participating in on-call rotations to respond to and mitigate production incidents.
- Conducting post-mortems for incidents, identifying root causes, and implementing preventive measures.
- Collaborating with development teams to improve system design for reliability and operability.
- Defining and tracking key Service Level Objectives (SLOs) and Service Level Indicators (SLIs).
- Implementing chaos engineering practices to test system resilience.
- Contributing to capacity planning and performance tuning efforts.
- Ensuring the security and compliance of the infrastructure.
- Mentoring junior SREs and promoting SRE best practices within the organization.
Qualifications:
- Bachelor's degree in Computer Science, Engineering, or a related field; equivalent practical experience will be considered.
- 5+ years of experience in SRE, DevOps, or Systems Engineering roles.
- Strong proficiency in at least one scripting/programming language such as Python, Go, or Bash.
- Hands-on experience with cloud platforms (AWS, Azure, or GCP) and containerization technologies (Docker, Kubernetes).
- Deep understanding of Linux operating systems and networking fundamentals (TCP/IP, DNS, HTTP).
- Experience with CI/CD pipelines and tools (e.g., Jenkins, GitLab CI, CircleCI).
- Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, Datadog, Splunk).
- Experience with configuration management tools (e.g., Ansible, Chef, Puppet).
- Strong understanding of distributed systems and microservices architectures.
- Excellent troubleshooting and problem-solving skills.
- Effective communication and collaboration abilities, especially in a remote setting.
- Experience with database administration or management is a plus.
Senior Site Reliability Engineer (Remote)
Posted 1 day ago
Job Viewed
Job Description
Key responsibilities include:
- Designing, implementing, and managing scalable and highly available production environments.
- Developing and maintaining automation tools and scripts for deployment, configuration, and monitoring.
- Implementing and managing monitoring, logging, and alerting systems to proactively identify and resolve issues.
- Participating in on-call rotations to respond to and resolve production incidents.
- Conducting root cause analysis (RCA) for system outages and implementing preventative measures.
- Collaborating with development teams to improve application reliability and performance.
- Defining and tracking key service level objectives (SLOs) and service level indicators (SLIs).
- Managing and optimizing cloud infrastructure resources for cost-effectiveness and performance.
- Developing and advocating for SRE best practices across the engineering organization.
- Mentoring junior engineers and contributing to a culture of continuous learning and improvement.
- Automating operational tasks and reducing toil.
- Participating in capacity planning and performance testing.