Project Description:
Oracle Corporation's 'SaaS Engineering' team is setting up an exciting new team to work on advances in service reliability with teams of autonomous AI agents. This initiative aims to develop a robust system using advanced ML/AI tools to analyze system logs, predict failures, and autonomously resolve issues before they impact cloud services. The project combines the cutting-edge domains of anomaly
detection and autonomous AI agents to enhance service resiliency. This newly formed team will be crucial in driving innovation and ensuring the reliability of Oracle's cloud services, making significant
contributions to service uptime and customer satisfaction.
We are looking for key technical leads with expertise across critical domains. This includes leaders in machine learning, who will drive model development, training, and optimization; generative AI and LLMs, focused on fine-tuning large-scale models and advancing capabilities in language understanding; data engineering, ensuring robust data pipelines and high-quality data management; MLOps, streamlining model deployment, monitoring, and automation; and AI software engineering, enabling seamless integration of AI models into scalable applications. Together, these roles will form the foundation for building high-performance, production-grade AI systems.
Career Level - IC4