Home Job Details
N
Information Technology 🏒 Full Time ⭐️ Verified

Senior AI Architect | GenAI & LLM Specialist

Nexus Future Labs
San Francisco
Estimated Salary
USD 180.000 – USD 250.000
New
Live Update
2 Juli 2026
Deadline
2 Jul 2027

Job Description

Are you ready to define the technological landscape of 2026? Nexus Future Labs is seeking a visionary Senior AI Architect to spearhead our next generation of Generative Intelligence systems. We are building the foundational infrastructure for the autonomous web and are looking for a leader who can bridge the gap between theoretical AI research and scalable, production-grade software engineering.

In this pivotal role, you will architect the core of our proprietary Large Language Model (LLM) ecosystem, ensuring our models are not only state-of-the-art but also ethical, efficient, and safe. You will work in a dynamic environment where your code will directly impact millions of users globally. If you are passionate about the future of AI and want to be at the forefront of the 2026 technological revolution, we want to hear from you.

Responsibilities

  • Design and implement advanced Retrieval-Augmented Generation (RAG) pipelines and vector databases to enhance model accuracy.
  • Lead the end-to-end development of LLM fine-tuning workflows using Hugging Face and custom PyTorch implementations.
  • Optimize model inference latency and throughput for high-scale distributed systems.
  • Establish and enforce best practices for AI safety, bias mitigation, and responsible deployment.
  • Collaborate with cross-functional teams (product, security, design) to translate complex AI capabilities into user-friendly features.
  • Conduct research on emerging AI architectures, including Mixture of Experts (MoE) and state-space models.

Qualifications

  • Master’s or PhD in Computer Science, Machine Learning, or a related quantitative field.
  • 7+ years of professional software engineering experience with at least 3 years specifically focused on AI/ML infrastructure.
  • Deep expertise in Python, PyTorch, TensorFlow, or JAX.
  • Proven track record of deploying large-scale ML models to production environments (AWS, GCP, or Azure).
  • Experience with prompt engineering, LLM orchestration frameworks (LangChain, LlamaIndex), and vector search.
  • Strong understanding of deep learning architectures, transformers, and neural network optimization.

Required Skills

Python PyTorch TensorFlow LLMs Generative AI Machine Learning AWS Cloud Architecture NLP RAG Hugging Face

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All