01. About Me

I build reliable data platforms and analytics that translate complex business rules into trustworthy, self-serve insights.

Recent work

  • Owned an enterprise Power BI/SSAS semantic model (300+ tables) and standardized KPI logic across product lines.
  • Added an NLP Q&A layer for Copilot with curated synonyms, metadata, and guardrails for faithfulness.
  • Hardened SQL views/CTEs that power finance and clinical dashboards, keeping DAX performance under 2s.

Earlier, I deployed a federated data lake/warehouse that cut a critical billing process from ~20 days to hours across ~350 hospitals and labs.

02. Experience

USC Institute for Creative Technologies · Research Assistant – HCI

May 2025 – Present

Research assistant on the Human-AI Trust Calibration project led by PI Gale Lucas.

  • Co-authored annotation codebooks, performed thematic coding and qualitative analysis, and documented methods plus inter-rater reliability (Cohen's κ) to support calibrated human–agent teaming.

Keck Medicine of USC / University of Southern California · Data Engineer Intern

Mar 2025 – Present

Own enterprise BI/tabular semantic model; ship NLP Q&A over KPIs; tune DAX/SQL for performance and consistency.

  • Implemented an end-to-end NLP Q&A layer on a Power BI/SSAS semantic model by exposing KPI metadata, synonyms, and lineage to Copilot; improved exact-match/faithfulness on a 100-question benchmark.
  • Owned and standardized KPI definitions/documentation across 300+ tables using Power BI/SSAS, Tabular Editor, and DAX; authored hardened SQL Server views encoding business rules.
  • Improved query performance by eliminating CROSSJOIN and bi-directional filter hotspots using DAX Studio, VertiPaq Analyzer, and SQL Profiler/Query Store.
  • Designed reusable SQL CTEs and parameterized views to codify dimensions, hierarchies, and period-to-date logic for consistent reporting.

Diagnostics of America · Data Engineer & Analytics

Jun 2021 – Dec 2022

Built a federated data platform and decision-ready reporting across ~350 hospitals and labs.

  • Designed and deployed a federated data lake + warehouse across AWS (S3/Athena), Databricks, and BigQuery; automated a mission-critical billing process from ~20 days to hours.
  • Developed reliable pipelines with Alteryx and SQL, adding incremental loads, RI/outlier/duplicate checks, and lineage/audit logs.
  • Delivered C-suite Power BI dashboards for service-line and patient-level costing; optimized DAX/SQL and established daily refresh SLAs.
  • Standardized data schemas and pipelines across entities to enable seamless integration and analytics.

Link School of Business · Business Analyst

Dec 2019 – May 2022

Drove grant scoring, marketing ROI models, and monthly forecasting/insights.

  • Built a startup grant scoring model and executed market research in Excel (Power Query, Pivot Tables) to identify high-potential startups; improved grant allocation efficiency by ~30%.
  • Maximized ROAS by allocating budgets across search/social/display/email based on ROI models and CPA targets; produced forecast and pacing reports.
  • Created comprehensive performance reports and visualizations to drive strategy adjustments.

03. Some Things I’ve Built

Featured Project

ChatDB — Natural‑Language Database Query System

Streamlit + LangChain app that turns English into parameterized SQL (PostgreSQL) and MongoDB aggregations with schema introspection and optional local LLM (Ollama) plus LangSmith tracing.

PythonStreamlitLangChainPostgreSQLMongoDBOllama

Featured Project

RAG Assistant for Finance KPIs

Semantic‑aware RAG for finance KPIs by exporting 300+ DAX measures from Tabular Editor, embedding in Chroma (sentence‑transformers), with retrieval evaluation dashboards.

PythonChromasentence-transformersDAXTabular Editor

Featured Project

NLP Q&A over BI Semantic Model (USC/Keck)

End‑to‑end Q&A layer on a Power BI/SSAS tabular model; exposes KPI metadata, synonyms, and lineage to Copilot, improving exact‑match and faithfulness on a 100‑question benchmark.

Power BISSASDAXCopilotSQL Server

Featured Project

Federated Healthcare Data Lake & Warehouse

Deployed a federated data lake/warehouse across AWS (S3/Athena), Databricks, and BigQuery; standardized schemas/pipelines for ~350 hospitals and labs; automated a billing process from ~20 days to hours.

AWS S3AthenaDatabricksBigQueryETL/ELT

04. Skills

  • Python
  • SQL
  • DAX
  • NLP
  • LLMs
  • Data Modeling
  • Dimensional Modeling
  • ETL
  • ELT
  • Pandas
  • scikit-learn
  • PyTorch
  • Statistics
  • Hypothesis Testing
  • A/B Testing
  • Power BI
  • Tabular Editor
  • SSAS
  • PostgreSQL
  • MongoDB
  • BigQuery
  • Azure
  • AWS
  • GCP
  • Databricks
  • Spark
  • Airflow
  • DAX Studio
  • VertiPaq Analyzer
  • SQL Profiler
  • Query Store
  • LangChain
  • Chroma
  • sentence-transformers
  • Tableau
  • Excel
  • Power Query
  • Alteryx
  • Java
  • C#

05. Blog

Blog posts are coming soon.

06. Sneak Peek

Data Analytics Engineer

1/3
PreviewOpen PDF

07. What’s Next?

I’m currently open to new opportunities. Whether you have a question or just want to say hi, I’ll try my best to get back to you!