I'm a full-stack software engineer with several years of experience in the Python and JavaScript ecosystems. I recently co-created Chunkwise⁠⁠, an open-source platform for evaluating document chunking strategies and deploying validated strategies for production RAG systems via automated ETL pipelines. I have hands-on experience with prompt engineering and LLM evals.

Professional Experience
Software Engineer & Co-Creator, Chunkwise⁠
10/2025 – Present
  • Architected a modular Python backend with 3 containerized FastAPI services on AWS ECS Fargate, isolating compute-heavy chunking and evaluation services from an orchestration layer via type-safe Pydantic contracts.
  • Adapted Chroma's token-level IR evaluation framework (precision, recall, IoU) to support cloud-based, corpus-specific benchmarking via reusable LLM-generated ground truth, enabling quantitative measurement of chunking impact on retrieval quality for different document types. (OpenAI APIs, embedding models)
  • Scaled the ETL pipeline to parallel document processing across AWS Batch jobs while managing the external API rate limits to prevent throttling and streaming real-time progress to the UI via Server-Sent Events.
  • Developed a RESTful orchestration API managing multi-tier data persistence across Amazon S3, RDS for PostgreSQL, and pgvector-enabled RDS, separating development state from production outputs.
  • Implemented a document normalization layer handling multi-byte Unicode edge cases, preventing replacement-character corruption during token-based chunking and ensuring accurate substring matching in token chunker evaluation.
  • Built a web interface for standardized experimentation with 8 chunkers across 2 libraries, featuring real-time chunk visualization, side-by-side metrics comparison, and one-click deployment of validated strategies. (LangChain, Chonkie, TypeScript, React)
  • Developed AWS infrastructure-as-code provisioning the full stack (VPC, ECS, RDS/​ S3, ALB, Batch, Secrets Manager), wrapped in an interactive CLI that enables rapid environment deployment and teardown for iteration and testing. (CDK, Python)
  • Authored a technical case study ⁠(chunkwise.github.io⁠) and built a professional website. (TypeScript, Astro)
  • Software Engineer, Open-Source Projects
    06/2023 – 10/2025
  • Request Bin: A developer tool for capturing and inspecting HTTP requests and webhooks via dynamically generated endpoints, with real-time data streaming via WebSockets. (JavaScript, React, Express.js, PostgreSQL, MongoDB, AWS, NGINX)
  • NYT Obituaries Explorer: A RAG system enabling semantic search across 380 New York Times obituaries, with LLM observability. (OpenAI APIs, pgvector, Arize, Python)
  • My $tocks: A full-stack stock portfolio application simulating trades via an external market API. (Python, Flask, PostgreSQL)
  • Instructional Assessment Fellow, Columbia Center for Teaching and Learning
    09/2022 – 05/2023 | New York, NY

    • Designed and executed an assessment of a training program using qualitative and quantitative data from interviews and surveys.

    • Delivered actionable insights for program redesign, increasing the completion rate by 82%.

    Skills
    Languages & Frameworks: Python, JavaScript, TypeScript, Node.js, FastAPI, Flask, Express.js, React, PostgreSQL, MongoDB, Pydantic, RESTful APIs, distributed systems, AI Engineering: RAG, vector databases, embeddings, prompt design, context engineering, tool calling, OpenAI APIs, LangChain, Chonkie, Cloud: AWS (ECS Fargate, EC2, Lambda, Batch, RDS, S3, Secrets Manager, API Gateway, IAM, CDK), DigitalOcean, Render, Infrastructure-as-Code, Tools & Practices: Git/GitHub, Docker, unit & integration test (Pytest, Jest, Vitest), WebSockets, NGINX, Linux, AI-assisted coding (Claude, Copilot, Cursor), OOP
    Education
    Columbia University, Ph.D. Candidate (ABD), M.A. & M.Phil. in Germanic Languages and Comparative Literature
    2018 – 2023 | New York, NY
    Macalester College, B.A. in German Studies & International Studies
    2010 – 2014 | Saint Paul, MN