Job Description
We are not just building software for today; we are architecting the intelligent infrastructure for 2026 and beyond. Nexus Horizon is seeking a visionary Senior Generative AI Engineer to lead the development of next-generation Large Language Models and autonomous agents. If you are passionate about pushing the boundaries of Artificial Intelligence and want to define the future of human-computer interaction, this is your opportunity to lead the charge.
Why Nexus Horizon?
Join a team of world-class engineers and researchers dedicated to solving the hardest problems in AI safety, scalability, and efficiency. We offer a competitive compensation package, equity opportunities, and the flexibility to work from our state-of-the-art facility in San Francisco.
Key Responsibilities:
- Architect and deploy scalable Generative AI models (LLMs) focused on enterprise-grade performance and safety.
- Lead the research and implementation of cutting-edge techniques such as Retrieval-Augmented Generation (RAG) and fine-tuning strategies.
- Collaborate with cross-functional teams to integrate AI capabilities into consumer and B2B products.
- Optimize model inference for speed and cost-efficiency in production environments.
- Define technical roadmaps for AI infrastructure, ensuring alignment with 2026 product vision.
- Mentor junior engineers and data scientists, fostering a culture of innovation and excellence.
Qualifications:
- Bachelor’s degree in Computer Science, Mathematics, or a related field; Master’s or PhD preferred.
- 5+ years of experience in Machine Learning, Deep Learning, or Natural Language Processing.
- Proficiency in Python and deep frameworks (PyTorch, TensorFlow, or JAX).
- Strong understanding of LLM architectures, transformers, and prompt engineering.
- Experience with cloud platforms (AWS, GCP, or Azure) and MLOps pipelines.
- Proven track record of shipping production-level AI applications.
- Excellent problem-solving skills and ability to thrive in a fast-paced, high-stakes environment.
Skills: Python, PyTorch, TensorFlow, LLMs, GPT-4, RAG, MLOps, AWS, Docker, Kubernetes, SQL, Agile Methodologies
Responsibilities
- Architect and deploy scalable Generative AI models (LLMs) focused on enterprise-grade performance and safety.
- Lead the research and implementation of cutting-edge techniques such as Retrieval-Augmented Generation (RAG) and fine-tuning strategies.
- Collaborate with cross-functional teams to integrate AI capabilities into consumer and B2B products.
- Optimize model inference for speed and cost-efficiency in production environments.
- Define technical roadmaps for AI infrastructure, ensuring alignment with 2026 product vision.
- Mentor junior engineers and data scientists, fostering a culture of innovation and excellence.
Qualifications
- Bachelor’s degree in Computer Science, Mathematics, or a related field; Master’s or PhD preferred.
- 5+ years of experience in Machine Learning, Deep Learning, or Natural Language Processing.
- Proficiency in Python and deep frameworks (PyTorch, TensorFlow, or JAX).
- Strong understanding of LLM architectures, transformers, and prompt engineering.
- Experience with cloud platforms (AWS, GCP, or Azure) and MLOps pipelines.
- Proven track record of shipping production-level AI applications.
- Excellent problem-solving skills and ability to thrive in a fast-paced, high-stakes environment.