Job Summary
As a Domain Software Operation & Maintenance (O&M) Engineer, you will play a critical role in ensuring the reliability, availability, and performance of Univers SaaS products. You will be responsible for maintaining and supporting domain-specific software applications in production environments, working closely with development, DevOps, and security teams. Your mission will be to troubleshoot issues, optimize operations, drive automation, and contribute to system stability and product improvement.
Key Responsibilities
- Monitor, manage, and optimize the daily operation of domain software systems to ensure high availability and reliability.
- Troubleshoot and resolve production incidents related to domain applications.
- Support the deployment and go-live process of in-house developed applications.
- Oversee software deployment and application maintenance governance.
- Serve as a key liaison among product, project, and customer teams for resolving product-related issues.
- Provide feedback to product teams for continuous improvement and better feature enablement.
- Build and maintain technical documentation, operational runbooks, and troubleshooting guides.
- Develop subject matter expertise (SME) in both IT systems and specific domain knowledge (e.g., Energy Management, Micro-Grid).
- Coordinate with cross-functional teams to ensure alignment of software changes, upgrades, and fixes.
- Maintain a high level of professionalism when interacting with internal teams and external stakeholders.
Required Qualifications
- Bachelor’s degree in computer science, Computer Engineering, or a related field.
- 2–5 years of experience in application maintenance or production support roles.
- Strong hands-on experience with:
- Linux system administration and command-line troubleshooting
- Docker & Kubernetes container orchestration
- Relational and non-relational databases (MySQL, Redis, TSDB, etc.)
- Familiarity with web debugging tools and log analysis.
- Experience with cloud environments like VMware and/or Microsoft Azure is a plus.
- Domain knowledge in Renewables, Energy Management, or Micro-Grid systems is advantageous.
- Strong troubleshooting, analytical, and creative problem-solving skills.
- Proactive mindset with the ability to take ownership and operate independently in a fast-paced environment.
- Excellent communication skills with the ability to work effectively across teams.
- Positive, team-oriented attitude with a willingness to adapt and learn new technologies.
- Chinese language proficiency (speaking and reading) is a strong plus.
Nice to Have
- Experience in automation and scripting (e.g., Bash, Python, etc.)
- Familiarity with ITIL processes or service management frameworks
- Experience in monitoring tools (e.g., Prometheus, Grafana, ELK Stack)
Report job