Data Scientist - Innovation - PhD (Irving, TX)
Confirmed live in the last 24 hours
Caris
Job Description
At Caris, we understand that cancer is an ugly word—a word no one wants to hear, but one that connects us all. That’s why we’re not just transforming cancer care—we’re changing lives.
We introduced precision medicine to the world and built an industry around the idea that every patient deserves answers as unique as their DNA. Backed by cutting-edge molecular science and AI, we ask ourselves every day: “What would I do if this patient were my mom?” That question drives everything we do.
But our mission doesn’t stop with cancer. We're pushing the frontiers of medicine and leading a revolution in healthcare—driven by innovation, compassion, and purpose.
Join us in our mission to improve the human condition across multiple diseases. If you're passionate about meaningful work and want to be part of something bigger than yourself, Caris is where your impact begins.
Position Summary
Caris Life Sciences is seeking a creative, driven, and analytically strong Data Scientist to join the Innovation Team. This role will support the development and application of machine learning and statistical methods using next-generation sequencing (NGS) and related clinical and molecular data. The Data Scientist will contribute to assay development, biomarker research, and analytic pipeline execution under the technical guidance of senior and principal data scientists. The successful candidate will demonstrate sound analytical judgment, scientific curiosity, and the ability to collaborate effectively within cross-functional research teams. Opportunities may exist to contribute to publications and scientific presentations as part of Caris’ ongoing innovation efforts.
Job Responsibilities
Processing, manipulating, and analyzing large diverse datasets generated from NGS to develop biomarkers for cancer diagnosis, prognosis, and treatment.
Developing algorithms to generate novel features and biomarkers from molecular sequencing data.
Implementing, refining, and testing analytical workflows that achieve strategic goals in molecular profiling and R&D objectives.
Utilizing state-of-the-art statistical, machine learning, deep learning, and survival analysis methods to analyze and interpret data and drive insights.
Building predictive models with both structured and unstructured datasets.
Designing and executing agentic workflows to accelerate research iteration, model development, and pipeline automation.
Creating rigorous evaluation frameworks and tracking experiments systematically using tools such as MLflow.
Required Qualifications
PhD in Data Science, Bioinformatics, Computational Biology, Genomics, Statistics, Computer Science, Engineering, or a related field, with demonstrated exposure to cancer biology or translational research.
2+ years relevant experience (or PhD + 0-3 years), depending on scope and demonstrated impact.
Strong Python; comfortable in Linux; proficient with git and collaborative workflows.
Proficiency with PyTorch and modern deep learning architectures (transformers, attention mechanisms) with experience applying ML/DL to biological or clinical data.
Working knowledge of generative AI tools and large language models for scientific research.
Familiarity with molecular sequencing data (WGS, WES, and/or RNA-seq) and common QC/processing concepts.
Familiarity with CNV calling algorithms and related analysis tools in a sequencing context.
Interest in algorithm development for feature extraction and denoising.
Preferred Qualifications
Experience with cfDNA biology or liquid biopsy analysis.
Experience designing and operating agentic AI workflows for scientific research, data analysis, or pipeline automation.
Experience with foundation models, LLM-based tooling, or AI-assisted scientific workflows in a research or production setting.
Experience with DNA methylation analysis or epigenetic signal processing.
Proficiency in AWS (EC2, S3, HealthOmics); Docker/containers.
Knowledge of survival analysis and event data.
Knowledge of wet lab sequencing processes.
Track record of peer-reviewed publications in relevant fields.
Physical Demands
Working at a computer for the majority of the day.
Training
All job-specific, safety, and compliance training are assigned based on the job functions associated with this employee.
Other
This position is based in Irving, TX (on-site).
Relocation assistance may be available for qualified candidates.
Conditions of Employment: Individual must successfully complete pre-employment process, which includes criminal background check, drug screening, credit check ( applicable for certain positions) and reference verification.
This job description reflects management’s assignment of essential functions. Nothing in this job description restricts management’s right to assign or reassign duties and responsibilities to this job at any time.
Caris Life Sciences is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, religion, color, national origin, gender, gender identity, sexual orientation, age, status as a protected veteran, among other things, or status as a qualified individual with disability.
Similar Jobs
Apple
Machine Learning - Data Scientist Lead
Salesforce
Lead Decision Scientist
CACI International
Data Scientist (AI/ML)
Salesforce
Staff Product Data Scientist - Slack
Thermo Fisher
Data Scientist, Ai Infrastructure and MCP (onsite)
Walmart