Data Scientist – Computational Biology (New Grad) – Regeneron
Regeneron (NASDAQ: REGN) is one of the world's most innovative biotechnology companies — creator of blockbuster medicines Dupixent (atopic dermatitis), Eylea (macular degeneration), Kevzara (rheumatoid arthritis), and REGEN-COV (COVID-19 antibody). Regeneron Genetics Center (RGC) is the world's largest human genetics program with genetic data on 3+ million research participants, making Regeneron a powerhouse of human genomics data science. Regeneron uses big genomic datasets to identify and validate drug targets with human genetic evidence — dramatically increasing the probability of clinical trial success. We are hiring New Grad Data Scientists in Tarrytown, NY to work on computational biology at this world-leading genetics program.
Responsibilities
- Analyze whole-exome and whole-genome sequencing data from Regeneron Genetics Center's 3M+ participant database to identify genetic variants associated with disease
- Build genome-wide association study (GWAS) analysis pipelines using Python and R — identifying drug targets with human genetic validation
- Develop polygenic risk score (PRS) models integrating thousands of genetic variants to predict disease risk and treatment response
- Apply ML methods (random forests, gradient boosting, neural networks) to predict protein structure-function relationships and antibody binding affinities
- Build single-cell RNA-seq analysis pipelines characterizing cell-type-specific gene expression patterns in disease tissue
- Collaborate with Regeneron's Antibody Discovery and drug hunting teams to translate genomic findings into therapeutic hypotheses
Requirements
- Bachelor's or Master's degree in Computational Biology, Bioinformatics, Genetics, or Data Science
- Python and R proficiency for genomics data analysis (bioconductor, biopython, plink)
- Understanding of human genetics, GWAS methodology, and genomic data types (VCF, BAM, FASTQ)
- Familiarity with cloud-based genomics platforms (AWS, Google Genomics, Terra)
- Statistical knowledge: association testing, multiple hypothesis correction (Bonferroni, FDR), and linkage disequilibrium
Benefits
- Competitive salary with Regeneron RSU equity and annual bonus
- Work on one of the world's largest human genetics programs driving drug discovery
- Comprehensive medical, dental, and vision benefits
- 401(k) with Regeneron matching
- Tarrytown, NY campus — beautiful Hudson Valley location with excellent NY Metro access