Job Description
Architect the Future of Intelligence
We are on a mission to define the technological landscape of 2026 and beyond. At Apex Future Systems, we are building the next generation of autonomous AI agents and generative models. We are seeking a visionary Senior AI Architect to lead our research and engineering teams in creating scalable, high-performance systems that redefine human-computer interaction.
Why This Role?
You will have the unique opportunity to shape the roadmap for our flagship product, working directly with C-level executives to drive innovation. We offer a competitive compensation package, equity opportunities, and the chance to work in a culture that prizes audacity and technical excellence.
Key Responsibilities
- System Architecture: Design and implement cutting-edge neural network architectures for large-scale generative AI and autonomous decision-making systems.
- Model Optimization: Lead initiatives to optimize model inference latency and reduce token costs, ensuring our solutions are production-ready and cost-efficient.
- Research Integration: Bridge the gap between theoretical research and practical application, deploying state-of-the-art algorithms into our core platform.
- Talent Leadership: Mentor a high-performing team of data scientists and engineers, fostering a culture of continuous learning and innovation.
- Technical Strategy: Collaborate with product and engineering teams to define technical roadmaps and ensure alignment with business objectives.
- Scalability: Architect cloud-native infrastructure capable of handling millions of concurrent interactions and petabytes of data.
Qualifications
- Education: Master’s or Ph.D. in Computer Science, Machine Learning, or a related field from a top-tier institution.
- Experience: 7+ years of professional experience in AI/ML engineering, with at least 3 years in a leadership or architect role.
- Technical Skills: Deep expertise in Python, PyTorch, TensorFlow, and distributed computing frameworks (Kubernetes, Docker).
- Specialization: Proven track record of working with Large Language Models (LLMs), Transformers, and reinforcement learning.
- Problem Solving: Exceptional ability to solve complex, ambiguous problems and navigate the trade-offs between model accuracy and computational efficiency.
- Communication: Excellent written and verbal communication skills, capable of translating complex technical concepts for diverse audiences.
What We Offer
• Competitive base salary and equity package
• Comprehensive health, dental, and vision insurance
• Unlimited PTO and flexible remote work options
• Access to the latest hardware for AI research
• A dynamic, fast-paced environment where your work matters
Responsibilities
- Design and implement cutting-edge neural network architectures for large-scale generative AI and autonomous decision-making systems.
- Lead initiatives to optimize model inference latency and reduce token costs, ensuring our solutions are production-ready and cost-efficient.
- Bridge the gap between theoretical research and practical application, deploying state-of-the-art algorithms into our core platform.
- Mentor a high-performing team of data scientists and engineers, fostering a culture of continuous learning and innovation.
- Collaborate with product and engineering teams to define technical roadmaps and ensure alignment with business objectives.
- Architect cloud-native infrastructure capable of handling millions of concurrent interactions and petabytes of data.
Qualifications
- Master’s or Ph.D. in Computer Science, Machine Learning, or a related field from a top-tier institution.
- 7+ years of professional experience in AI/ML engineering, with at least 3 years in a leadership or architect role.
- Deep expertise in Python, PyTorch, TensorFlow, and distributed computing frameworks (Kubernetes, Docker).
- Proven track record of working with Large Language Models (LLMs), Transformers, and reinforcement learning.
- Exceptional ability to solve complex, ambiguous problems and navigate the trade-offs between model accuracy and computational efficiency.
- Excellent written and verbal communication skills, capable of translating complex technical concepts for diverse audiences.