Share this opportunity
Related Job
- CTV Kinh Doanh (Soundbox), Zalopaythành phố hồ chí minh
- Head of Internal Audit, Business Operationsthành phố hồ chí minh
- IT Risk & Compliance Specialist, VNGGamesthành phố hồ chí minh
Job Search
Senior System Engineer, GreenNode
OfficialTechSystem26-PRD-3629
thành phố hồ chí minh
View this job in
English
Job description
Overall Company:
GreenNode is the Leading AI Cloud Infrastructure and Solutions Provider in Southeast Asia, a member of VNG Group, and an official NVIDIA Cloud Partner.
With over 20 years of experience building and operating large-scale cloud infrastructure - starting from our own internal “customer zero” - VNG, GreenNode possesses deep expertise in security, infrastructure optimization, and cloud transformation. GreenNode delivers a streamlined AI Cloud ecosystem focused on core products designed for large-scale, user-intensive applications and AI workloads. Our infrastructure is deployed across multi-availability zones and multi-region environments in Vietnam and Thailand, ensuring high performance, availability, stability, and flexible scalability for mission-critical workloads.
With a strong understanding of the technology needs of digital-native enterprises - especially mid-tier Banks, FinTech companies, and Retail businesses, GreenNode partners closely with customers throughout their transformation journey, supporting sustainable growth and global expansion.
Job Summary
Senior System Engineer responsible for operating, troubleshooting, and optimizing large-scale OpenStack-based cloud systems with strong focus on networking, SDN data plane, kernel interaction, container/runtime behavior, automation, and performance debugging.
Responsibilities:
GreenNode is the Leading AI Cloud Infrastructure and Solutions Provider in Southeast Asia, a member of VNG Group, and an official NVIDIA Cloud Partner.
With over 20 years of experience building and operating large-scale cloud infrastructure - starting from our own internal “customer zero” - VNG, GreenNode possesses deep expertise in security, infrastructure optimization, and cloud transformation. GreenNode delivers a streamlined AI Cloud ecosystem focused on core products designed for large-scale, user-intensive applications and AI workloads. Our infrastructure is deployed across multi-availability zones and multi-region environments in Vietnam and Thailand, ensuring high performance, availability, stability, and flexible scalability for mission-critical workloads.
With a strong understanding of the technology needs of digital-native enterprises - especially mid-tier Banks, FinTech companies, and Retail businesses, GreenNode partners closely with customers throughout their transformation journey, supporting sustainable growth and global expansion.
Job Summary
Senior System Engineer responsible for operating, troubleshooting, and optimizing large-scale OpenStack-based cloud systems with strong focus on networking, SDN data plane, kernel interaction, container/runtime behavior, automation, and performance debugging.
Responsibilities:
- Operate and troubleshoot OpenStack components (Neutron, Nova, LB) or similar cloud platforms with focus on tenant networking, routing, NAT, security groups, and production incident handling.
- Analyze end-to-end packet flow and debug connectivity issues, packet drops, latency spikes, and unstable behaviors using tcpdump, iproute2, flow inspection, logs, and system traces.
- Work with SDN or virtual networking technologies such as OVS, OVN, Tungsten Fabric/Contrail, VMware NSX or equivalent and understand overlay concepts such as VXLAN, MPLS, and EVPN.
- Investigate performance bottlenecks including PPS limits, CPU saturation, NIC offload behavior, MTU issues, RSS, NUMA/CPU pinning, kernel/network stack behavior, and feature compatibility across OS, kernel, driver, and platform versions.
- Debug system-level issues involving Linux kernel, Docker/container runtime behavior, cgroup v1/v2 differences, kernel modules, driver interactions, and feature mismatches across distributions or kernel versions.
- Build or use automation to collect logs, inspect system settings, validate runtime state, compare configuration across nodes, and support standardization at scale using tools such as Ansible in combination with shell or Python scripts.
- Handle production incidents, perform root cause analysis, and collaborate with monitoring/logging systems to identify systemic issues and prevent recurrence.
Requirement
- Strong Linux system troubleshooting skills with understanding of kernel interaction, networking stack, process/resource control, and container runtime behavior such as Docker or containerd.
- Solid networking fundamentals including TCP/IP, routing, NAT, and L2/L3 behavior in virtualized and overlay environments.
- Hands-on experience with virtual networking, SDN, or cloud networking platforms such as OpenStack, Kubernetes networking, VMware, or similar systems.
- Ability to debug issues using packet-level and system-level tools rather than relying only on configuration, UI, or vendor documentation.
- Experience with automation/configuration management tools such as Ansible for gathering logs, checking system parameters, validating configuration consistency, and executing operational changes safely across multiple nodes.
- Coding mindset with ability to read, understand, review, and troubleshoot code or logic in Python, Go, Shell, or C/C++ and investigate root causes beyond standard runbooks.
- Nice-to-have: OpenStack Neutron, Tungsten Fabric/Contrail, EVPN/MPLS, VPN/IPSec, kernel tuning, Docker/containerd internals, high PPS system experience.
Registration was successful!
We've received your profile and we do appreciate your interest in our job opportunities. We will screen your application and contact you for further steps if you are short-listed. Otherwise, the application with no response received within 2 weeks is considered unsuitable application, and we will keep your resume in our database and may consider for appropriate future openings. Again, thank you for considering VNG as a potential employer.
