Accomplished Senior Bioinformatics Engineer at Generate:Biomedicines, specializing in designing cutting-edge NGS assays and creating scalable Python tools. Demonstrated expertise in enhancing data integrity and optimizing workflows through advanced machine learning techniques and effective collaboration. Acknowledged for implementing best practices and conducting impactful training sessions for multidisciplinary teams.
Chroma: Contributor to generative protein design model (Nature, 2024)
Performed high-throughput pooled expression and solubility profiling of >300 de novo Chroma-designed proteins using a split-GFP fluorescence assay coupled with Oxford Nanopore long-read sequencing. Designed and executed the full wet- and dry-lab pipeline: DNA digestion, AMPure cleanup, barcode ligation (EXP-NBD104), and MinION sequencing using Bonito (basecalling) and Minimap2 (alignment). Developed a custom demultiplexing and quantification pipeline using SeqKit, samtools, and pysam to compute enrichment scores across FACS-sorted fluorescence bins. Implemented normalization strategies and bin-weighted scoring to resolve protein-level solubility phenotypes in a pooled format, enabling structure-function insights at scale. Supported three sequencing runs totaling >30M reads across unconditional and semantically conditioned designs.
I'm a Senior Bioinformatics Engineer with 8+ years crafting production-grade NGS and AI/LLM-driven pipelines, bridging bench and data science to accelerate protein/DNA design and assay development.