Site Reliability Engineer

TRIBUS
Full time
3 weeks ago

This is a rare opportunity to join a fast-growing firm at the forefront of the digital asset ecosystem, working on cutting-edge infrastructure and tooling with a global, remote-first team.

Key Responsibilities

  • Maintain and scale highly available, low-latency trading infrastructure deployed across multiple regions.
  • Design, build, and improve monitoring, alerting, and incident response systems.
  • Develop automation for deployments, testing, and system health checks.
  • Work closely with developers and traders to ensure production reliability and performance.
  • Support both CeFi and DeFi infrastructure, with a focus on security, scalability, and fault tolerance.

What We’re Looking For

  • 3 - 7+ years' experience in Site Reliability, DevOps, or Infrastructure Engineering roles.
  • Deep Linux systems knowledge and strong scripting skills (Python, Bash, etc.).
  • Experience with containerisation (Docker) and orchestration tools (Kubernetes preferred).
  • Familiarity with observability stacks (e.g. Prometheus, Grafana, ELK).
  • Prior experience in trading, low-latency, or crypto systems is a strong plus.
  • Comfortable working independently in a fully remote environment.
Apply
Other Job Recommendations:

Site Reliability Engineer, Machine Learning Systems - Singapore

ByteDance
Singapore
The ByteDance Large Model Team is committed to developing the most advanced AI large model technology in the industry, becoming a...
6 days ago

Site Reliability Engineer - Game

ByteDance
Singapore
With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao,...
6 days ago

Site Reliability Engineer - Applied Machine Learning Engine (Singapore)

ByteDance
Singapore
ByteDance will be prioritizing applicants who have a current right to work in Singapore, and do not require ByteDance's...
2 weeks ago

Site Reliability Engineer (Traffic) - Infrastructure Engineering

ByteDance
Singapore
The team manages the end to end lifecycle of server fleet, providing cloud solutions and various infrastructure services ensuring...
2 weeks ago

Site Reliability Engineer, Traffic Platform - Traffic SRE

ByteDance
Singapore
The Traffic Infrastructure Global Engineering (TIGE)-Traffic Platform team at ByteDance builds and operates multi-cloud based...
2 weeks ago