Perplexity is a free AI search engine designed to revolutionize the way you discover information. Ask any question, and it searches the internet to give you an accessible, conversational, and verifiable answer.
Senior iOS Software Engineer
Perplexity AI is looking for a Senior iOS Engineer to join our small team revolutionizing the way people search and interact with the internet. You will be responsible for building new experiences and improving the performance of Perplexity’s iOS app.
The ideal candidate should have strong programming skills, an interest in search and large language models, and a passion for delivering a great UX backed by a quality UI.
Responsibilities
- You will develop a native iOS app for Perplexity’s evolving product
- You will define the processes around mobile development, including planning, testing, releasing, and monitoring
- You will work with the core team to design and implement novel mobile experiences
Qualifications
- 5+ years industry experience
- Strong Swift fundamentals and a proven track record of working with a modern iOS stack built with Swift, SwiftUI (iOS16+) and UIKit
- Enjoys the craft of building nice UI, creating a good UX, and writing reusable and testable code
- Understands low-level intricacies, and knows how to profile and measure app performance and speed
- Comfortable working with a small, fast-moving team, and must be willing to dive in and take ownership
- A passion for iOS development, and enjoys experimenting with what’s new with each iteration of iOS and iPadOS
Nice to have
- Experience with WebSockets, HTTP
- Experience with concurrency and multithreading (GCD, critical sections, race conditions)
Our prominent investor base includes IVP, NEA, Jeff Bezos, NVIDIA, Databricks, Bessemer Venture Partners, Elad Gil, Nat Friedman, Naval Ravikant, Tobi Lutke, and many other visionary individuals.
AI Inference Engineer
We are looking for an AI Inference to join our growing team. Our current stack is Python, C++, TensorRT-LLM, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.
Responsibilities
- Develop APIs for AI inference that will be used by both internal and external customers
- Benchmark and address bottlenecks throughout our inference stack
- Improve the reliability and observability of our systems and respond to system outages
- Explore novel research and implement LLM inference optimizations
Qualifications
- Experience with ML systems and deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
- Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
- Experience with deploying reliable, distributed, real-time model serving at scale
- (Optional) Understanding of GPU architectures or experience with GPU kernel programming using CUDA
The cash compensation range for this role is $190,000 – $240,000.
Backend Software Engineer
Perplexity is looking for a backend engineer to join our small team revolutionizing the way people search and interact with the internet. You will be responsible for leading design, implementation, and scaling of systems that power web and mobile products. Our backend stack is Python, Postgresql, DynamoDB, Redis, and Kubernetes, built alongside dedicated inhouse AI and search interfaces. The ideal candidate should have experience building systems, infrastructure, scaling, testing, and maintenance.
Responsibilities
- Maintaining and optimizing key-value and relational databases.
- Building interfaces between backend and consumer web/mobile clients.
- Scaling and load balancing web server backends with rapidly changing needs.
Qualifications
- Strong experience with Python.
- Strong experience with databases and caching.
- Experience with infrastructure/kubernetes.
- Extensible system design skills.
- 4+ years of engineering experience.
Our cash compensation range for this role is $180,000 – $230,000.
AI Inference Engineer
We are looking for an AI Inference to join our growing team. Our current stack is Python, C++, TensorRT-LLM, Kubernetes. You will have the opportunity to work on large-scale deployment of machine learning models for real-time inference.
Responsibilities
- Develop APIs for AI inference that will be used by both internal and external customers
- Benchmark and address bottlenecks throughout our inference stack
- Improve the reliability and observability of our systems and respond to system outages
- Explore novel research and implement LLM inference optimizations
Qualifications
- Experience with ML systems and deep learning frameworks (e.g. PyTorch, TensorFlow, ONNX)
- Familiarity with common LLM architectures and inference optimization techniques (e.g. continuous batching, quantization, etc.)
- Experience with deploying reliable, distributed, real-time model serving at scale
- (Optional) Understanding of GPU architectures or experience with GPU kernel programming using CUDA
The cash compensation range for this role is $190,000 – $240,000.