
About QuantHealth
QuantHealth is a growing AI startup in the clinical trial space, leveraging AI, biomedical data, knowledge graphs, and real-world patient data to simulate and optimize clinical trials for pharmaceutical companies.
Our platform helps customers simulate clinical trials, reduce development risk and cost, shorten timelines, and improve the probability of clinical trial success.
About the role
Our clinical trial SaaS platform leverages AWS infrastructure spanning multiple transactional databases, data warehouses, and key services like Vercel and Databricks to deliver predictive analytics for enterprise healthcare clients. We have a distributed, single-tenant architecture that demands robust cloud infrastructure management and optimization. As an infrastructure engineer, you'll own and refine the cloud systems that power our clinical trial innovation platform.
Key Responsibilities:
Develop, maintain, optimize and harden our single-tenant cloud infrastructure
Implement a secure, high-performance network topology that connects frontend services, databases, and ML processing clusters
Design and implement disaster recovery strategies, including backup automation, fail-over procedures, and restore drills
Coordinate organization-wide SRE practices, including cross-component tracing, incident management, alerting, and reliability metrics
Work closely with engineering teams to understand their infrastructure requirements and enable them to achieve continuous and stable deployments
Administer key systems (AWS, Databricks, DBs, etc.) including access controls, security hardening, monitoring, and compliance management
Establish and manage an infrastructure request ticketing system with self-service capabilities enabling engineers to request changes, provision resources, and receive guidance
Requirements:
At least 3 years of experience in each of the following: managing core services on AWS and/or Azure, cloud networking, IaC tools (Terraform, Cloud Formation, etc.), Relational DB Management, CI/CD pipelines
At least 2 year of experience in each of the following: Kubernetes, working with single-tenant architectures, infrastructure operations process, Linux and bash scripting, SRE and observability platforms (NewRelic, Data Dog, etc.)
Excellent written and verbal communication skills
Ability to work independently and as part of a team
Advantage:
Experience with Azure
Experience developing single-tenant solutions for large enterprise clients
Experience with Databricks: metastore management, access control, asset bundles and the Databricks terraform provider, etc.
Security-first mindset, history working on or closely with security teams
Working knowledge of PySpark, Spark, AWS ECR and the like
Experience with container orchestration platforms (EKS, ECS, Podman, etc.)
Experience with Vercel