SkyLab
Why this role is different
Most engineering jobs ask you to maintain legacy systems or add features to crowded codebases. This isn't that.
You'll be one of the founding engineers shaping two products that sit at the intersection of AI, enterprise software, and infrastructure — at a moment when those three things are being rewritten from scratch. The code you write in your first six months will still be running in production five years from now. The architectural decisions you make will shape how thousands of Korean small businesses experience AI, and how GPU compute is delivered across Southeast Asia.
We don't believe in 200-person engineering orgs where you ship one feature per quarter. We believe in small teams of strong engineers who own entire systems end-to-end, ship fast, and use AI tools to operate like teams 5x their size.
The two products you'll build
CoSAP — AI-Powered Business Intelligence for Korean SMEs
Korean small and medium businesses run on Douzone and ECount — ERP systems that hold decades of accounting and operational data, but expose it through forms and reports built for accountants, not founders. CoSAP changes that. We let business owners ask questions in plain Korean — "왜 이번 달 마진이 떨어졌어?" (why did our margin drop this month?) — and get answers backed by their actual financial data.
Under the hood: a multi-tenant data layer connecting Douzone and ECount, retrieval over financial and operational data using PostgreSQL with pgvector and Qdrant, event streaming with Kafka, workflow orchestration with Kestra, and self-hosted LLM serving with vLLM. The Korean SME market is 7+ million businesses. Almost none of them have access to real business intelligence today.
Fusionflow — GPU & AI Infrastructure Orchestration
GPUs are the most expensive, most contested compute resource on earth right now. We operate Kubernetes-based GPU clusters across multiple data centers and turn them into reliable, multi-tenant compute that customers can actually use. Fusionflow is the orchestration layer: scheduling, isolation, observability, and operations.
You'll work on real distributed systems problems — GPU scheduling under contention, node health and failure recovery, network performance tuning across InfiniBand and RoCE, tenant isolation, and the operational tooling needed to run hundreds of GPUs reliably. We currently operate a 19-node K3s cluster with 160+ GPUs and are scaling significantly through 2026.
What you'll actually do
Concretely — not buzzwords, but the kind of tickets you'll close in your first year:
Compensation & equity
Growth & learning
In the AI era, raw coding skill is no longer the bottleneck. Almost anyone can produce working code with Claude Code or Cursor. What separates strong engineers from average ones now is judgment, taste, and verification discipline. Here's what we actually screen for, in priority order:
1. Verification instinct — the ability to spot when AI output is wrong
2. Strong fundamentals in distributed systems
3. Problem decomposition over prompt engineering
4. Production ownership and operational maturity
5. Code reading > code writing
6. AI tool fluency with healthy skepticism
7. Clear written communication in English
The stack you'll work with
Nice-to-haves (not required)
English
Speaking: Intermediate - Reading: Intermediate - Writing: Intermediate
Korean
Speaking: Intermediate - Reading: Intermediate - Writing: Intermediate
We believe that every node has a role to play and by connecting the dots we become something bigger and more useful. We believe that we can play a role by Designing, Building, Maintaining and Operating a network for our customers.
We believe in being green and efficient. We believe by building a smart and efficient communication network can help us to protect our only home - Earth.
Please contact us to find out more and let’s work together to build a GREEN network.
ITJobs is founded in 2014 in Vietnam and the primary goal is grow to one of the leading specialists in recruitment and selection of IT staff in Asia.