At Oracle Cloud Infrastructure (OCI), we are building the future of cloud computing—designed for enterprises, engineered for performance, and optimized for AI at scale. We are a fast-paced, mission-driven team within one of the world’s largest cloud platforms.
The Generative AI Service team within OCI is focused on developing infrastructure and tools to operationalize Large Language Models (LLMs) and agentic AI systems. Our goal is to empower developers and enterprises to deploy intelligent applications and agents that integrate seamlessly with cloud services.
Role SummaryAs a Principal Software Engineer (IC4), you will contribute to the design and implementation of scalable, distributed systems that serve LLMs and support agent-based workflows. You will work in a collaborative environment with applied scientists, ML engineers, and software teams to deliver performant and reliable AI infrastructure. This is a high-impact engineering role with opportunities to grow technical expertise in large-scale systems and advanced AI technologies.
Minimum Qualifications- BS in Computer Science or related technical field.
- 6+ years of experience in backend software development with cloud infrastructure.
- Strong proficiency in at least one language such as Go, Java, Python, or C++.
- Experience building and maintaining distributed services in a production environment.
- Familiarity with Kubernetes, container orchestration, and CI/CD practices.
- Solid understanding of computer science fundamentals such as algorithms, operating systems, and networking.
Preferred Qualifications- MS in Computer Science.
- Experience working with LLM serving frameworks like vLLM, DeepSpeed, or FasterTransformer.
- Exposure to agent-based AI systems or tool-based inference workflows.
- Knowledge of cloud-native observability tools and scalable service design.
- Interest in compiler or systems-level software design is a plus.
Why Join Us- Build mission-critical AI infrastructure with real-world impact.
- Work closely with a collaborative and experienced global team.
- Expand your knowledge in AI, cloud computing, and distributed systems.
- Contribute to one of Oracle’s most innovative and fast-growing initiatives.
- Contribute to the development and optimization of distributed systems for model inference and agent execution.
- Implement features and enhancements in LLM service infrastructure using modern cloud technologies.
- Collaborate with cross-functional teams to support scalable and secure deployment pipelines.
- Assist in diagnosing and resolving production issues, improving observability and reliability.
- Write maintainable, well-tested code and contribute to documentation and design discussions.
Disclaimer:Career Level - IC4
As a world leader in cloud solutions, Oracle uses tomorrow’s technology to tackle today’s challenges. We’ve partnered with industry-leaders in almost every sector—and continue to thrive after 40+ years of change by operating with integrity.
We know that true innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing an inclusive workforce that promotes opportunities for all.
Oracle careers open the door to global opportunities where work-life balance flourishes. We offer competitive benefits based on parity and consistency and support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling +1 888 404 2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
Want to receive frequent updates by email? Subscribe to our automatic job service!
Company:
OracleEmployee Type:
Full timeLocation:
United StatesSalary:
$ 96800 - $ 223400