Monitoring Engineer
Contents
What Your Day Will Look Like
You will play a significant role in building and managing a complete stack of monitoring solutions and processes to power our NOC and SOC services.
You will
▪ Implement and maintain monitoring solutions to proactively detect issues across our infrastructure, networks, and security services. You will respond to alerts promptly and effectively to minimize downtime and disruptions.
▪ Investigate and diagnose incidents reported by monitoring systems or users, working with support teams to resolve technical issues and restore service operations swiftly.
▪ Analyze monitoring data to identify performance bottlenecks and areas for improvement. You will also implement optimizations to enhance system reliability and efficiency.
▪ Monitor resource utilization trends and forecast capacity requirements. Provide recommendations for scaling resources to support business growth and ensure optimal performance.
▪ Maintain accurate documentation of monitoring configurations, incident response procedures, and performance metrics and generate regular reports for stakeholders to communicate system health and performance.
▪ Work closely with cross-functional teams, including operations and security, to align monitoring strategies with business objectives.
What Your Day Will Not Look Like
▪ Performing routine tasks unrelated to monitoring and system management.
▪ Ignoring alerts or incidents without timely investigation and resolution.
▪ Working in isolation without collaborating with other teams or sharing insights.
Your Profile
“It doesn’t matter where you come from, where you went to school or what degrees you have. If you’ve done exceptional work, join us and work on the future of technology-driven platform companies”.
▪ Technical Expertise: Proven experience implementing and managing monitoring tools. Strong understanding of IT infrastructure components, including servers, networks, operating systems and middleware services.
▪ Troubleshooting Skills: Ability to analyze complex technical issues, perform root cause analysis, and implement effective solutions under pressure.
▪ Scripting and Automation: Familiarity with scripting languages (e.g., Python, PowerShell) to automate monitoring and alerting tasks to streamline operational workflows.
▪ Communication Skills: Clear and concise communication abilities to articulate technical concepts to technical and non-technical stakeholders. Experience in writing technical documentation and reports.
▪ Problem-Solving Attitude: Proactively identify and address potential issues before they impact service availability or performance.
Bonus Points If You Have
▪ Experience with Monitoring tools such as (but not limited to) Prometheus, Grafana, Splunk, DataDog, or similar.
What We Offer You
▪ Competitive salary package, including performance-based bonuses and stock options.
▪ Opportunities for professional growth and career development.
▪ A collaborative and innovative work environment with access to the latest tools and technologies.
▪ The chance to significantly impact the direction and success of our products and services.
About Nebul
Nebul offers a Sovereign-Hybrid cloud solution that combines dedicated private cloud based on European values on Privacy, Security and Compliance with your on-premises infrastructure or the convenience, scale and reach of the global Hyperscalers to forge it into a genuinely European-Native and sovereign Hybrid Cloud platform. Nebul Cloud is powered by NVIDIA technology, enabling AI development and operations on the industry-leading platform. From Generative AI to Digital Twins and Scientific simulations, Nebul empowers you to harness the power of AI with confidence and trust.
Nebul is an equal opportunity employer
We are proud to foster a workplace free from discrimination. Diversity of experience, perspectives, and background creates a better work environment and better products. Whatever your identity, we will give your application fair consideration.