Senior Engineering Manager - Accelerated Compute Memory Systems
About Pryon:
We’re a team of AI, technology, and language experts whose DNA lives in Alexa, Siri, Watson, and virtually every human language technology product on the market. Now we’re building an industry-leading knowledge management and Retrieval-Augmented Generation (RAG) platform. Our proprietary, cutting-edge natural language processing capabilities transform unstructured data into meaningful experiences that increase productivity with unmatched accuracy and speed.
Pryon is building one of the industry's most ambitious AI infrastructure platforms: a petabyte-scale ingestion and inference system powering mission-critical government and enterprise deployments. We need an Engineering Manager with deep HPC expertise—someone who can teach, not be taught. You’ll lead the technical team building our ingestion, retrieval, and inference layers, ensuring scalability, reliability, and compliance.
In This Role You Will:
- Build and lead a team delivering the ingestion, retrieval, and inference layers that will power mission-critical deployments for commercial and federal entities with millions of public users.
- Architect and deliver horizontally scalable, fault-tolerant systems capable of handling billions of documents and burst loads of 30K+ concurrent users.
- Guide implementation of multimodal ingestion pipelines (eg PDF, HTML, DOCX, JSON, XML, PPTX, TIFF).
- Oversee design and optimization of LLM-driven data ingestion and retrieval workflows.
- Own optimization and tuning of high-throughput, low-latency production environments via async orchestration frameworks.
- Establish performance benchmarking, compliance frameworks, and automated testing for scale.
- You will balance technical leadership with people leadership, guiding architecture decisions, while also scaling and mentoring a high-performing team.
- Collaborate cross-functionally with Product, Executive Leadership, and Customer Success.
What You'll Need to Be Successful:
- 10+ years in software engineering, 5+ years in management roles with large-scale AI/ML systems and infrastructure.
- Expert-level proficiency in Python and Golang, with 5+ years building production distributed systems.
- Experience with orchestration frameworks (Kubernetes, Ray, Dask) .•Proficiency with vector databases (Pinecone, Weaviate, Qdrant, or similar) .
- Experience with message queuing systems (Kafka, Pulsar, RabbitMQ) .
- In-depth knowledge and hands on experience building scalable distributed architectures and high-performance compute systems.
- Proven experience in multimodal ingestion pipelines within RAG platforms.
- Direct experience in designing, fine-tuning, and optimizing LLMs for ingestion and retrieval workloads.
- Previous success managing engineering teams delivering production-grade, HPC-scale RAG systems.
- Deep understanding of infra domains: compute, storage, networking, observability, security, disaster recovery, and cost management.
- Familiarity with HPC cluster management softwares such as Slurm
- Familiarity with cloud platforms (AWS, Azure, GCP) and/or on-prem datacenter operations.
Benefits for Full Time Employees:
- Remote first organization
- 100% Company paid Health/Dental/Vision benefits for you and your dependents
- Life Insurance, Short-term and Long-term Disability
- 401k
- Unlimited PTO
We are interested in every qualified candidate who is authorized to work in the United States. However, we are not able to sponsor or take over sponsorship of employment visas at this time.
Pryon will not consider race, religion, sex, sexual preference, or national origin in ways that violate the Nation's civil rights laws.
#J-18808-LjbffrRecommended Jobs
Consulting Utility Forester
Company: Davey Resource Group, Inc. Locations: Great Barrington, MA Additional Locations: Great Barrington, MA Work Site: On Site Req ID: 219652 Position Overview T…
Human Resources Administrator / Scheduler
We are seeking a detail-oriented and organized Human Resources Administrator with scheduling experience to join our growing home care agency in West Springfield, MA. This full-time position plays a…
ASSISTANT DIRECTOR, LOANS, Enrollment & Student Administration, Financial Assistance
About the Role The Assistant Director for Loans will help Boston University achieve University goals for recruitment, retention, and superior service by assisting students in financing their educa…
Warehouse Supervisor
Job Title: Warehouse Specialist **Key Responsibilities:** Supervise operations of DSPs and related warehouse staff, ensuring compliance with company standards. Develop and enforce quality ass…
Customer Service Representative
Full-time or Part-time options available Reporting directly to the Sales Manager, this position interacts with customers answering inquiries, processing orders, and handling basic problems about p…
Senior Paid Media Manager
Every day, Global Payments makes it possible for millions of people to move money between buyers and sellers using our payments solutions for credit, debit, prepaid and merchant services. Our worldwi…
Senior principal machine learning engineer
Job Summary At Red Hat we believe the future of AI is open and we are on a mission to bring the power of open-source LLMs and vLLM to every enterprise. Red Hat Inference team accelerates AI for th…
Senior Medical Administrative Coordinator
Senior Medical Administrative Coordinator What You’ll Do: Manage daily office operations, ensuring smooth scheduling, communication, and workflow Oversee administrative functions including b…
Clamp Roll Truck Driver
WestRock, a leader in the packaging industry, seeks an experienced Clamp Roll Truck Driver to join our dynamic manufacturing team. As part of our commitment to excellence and innovation in sustainable…
Jack Runner
Join the Pak as a Jack Runner Location: Fall River, MA Schedule: Full Time FLSA Status: Nonexempt Reports To: Operations Manager Who We Are At F…