Data Scientist / ML Engineer

Cole Campbell

I'm a new-grad data scientist and ML engineer with a deep applied portfolio in generative AI, deep learning, and statistical modeling. I build end-to-end systems — from data pipelines through model integration in production backends. I'm pursuing an M.S. in Data Science atop a 2026 B.S.

Headshot of Cole Campbell

Education

Arizona State University

B.S. in Data Science (Concentration in Computer Science), 2026

GPA: 3.52

Arizona State University

M.S. in Data Science, Analytics, and Engineering (Computational Mathematics & Data), 2027

Skills

Languages
PythonRDartC/C++JavaScriptSQLBashMATLAB
ML / AI
Deep learninggenerative modelsLLMs & prompt engineeringLoRA/QLoRA/PEFTtransfer learningneural style transferaudio/music generationmodel evaluation
Frameworks
PyTorchTensorFlowHugging Facescikit-learnNumPyPandasSciPyStatsModelsFastAPIFlutter/Riverpod
Data Science
Regression & statistical modelingclassificationtime seriesBayesianclusteringlongitudinal/panel analysisfeature engineeringA/B testingcausal inferencehypothesis testingvisualization
Data Engineering
ETL & data pipelinesREST API integrationSQLAlchemy + AlembicPostgreSQLFirebaseDockerCUDAcloud
Tools
GitLinuxJupyterVS CodeOpenCVlibrosaParaView/VTKpytestWeights & BiasesKaggle/NBA APIsREST/OpenAPI
Math
Linear algebraprobabilityoptimizationnumerical methodsstochastic processesstatistical theory

Projects

Multi-Architecture Generative Content Studio — Deep Learning Capstone

PyTorchTransformersDiffusionGANsCNNsRNNs

A unified generative system that produces story text, scene illustrations, character portraits, and background music from a single prompt.

  • Built a unified generative system producing story text, scene illustrations, character portraits, and background music from a single prompt.
  • Integrated GPT-2, Stable Diffusion, StyleGAN2/3, VGG16/19, and LSTM into one cohesive pipeline.
  • Fine-tuned GPT-2 for narrative generation, entity extraction, and structured scene descriptions; implemented Stable Diffusion with LoRA + ControlNet for composition-controlled scenes.

Tempo — AI Scheduling Assistant

FastAPIFlutterPostgreSQLLLM / NLPConstraint Optimization

A cross-platform AI scheduling app that generates, compares, and applies alternative day/week plans.

  • Built a cross-platform scheduling app (Flutter front end, FastAPI back end) that generates alternative day/week plans, compares them, and applies the one the user picks.
  • Designed a constraint-based scheduling engine that orders tasks, builds the daily timeline, resolves conflicts, and merges blocks into a coherent schedule.
  • Engineered a scenario pipeline supporting generation, diff-based comparison, trade-off analysis, and undo/redo state management.

PSID Panel Dataset Analysis — Longitudinal Economics Project

PythonStatistical ModelingLongitudinal Analysis

A longitudinal analysis of a 17,000-observation PSID panel studying income and wealth dynamics.

  • Analyzed a 17,000-observation PSID panel (1999–2023) to study income and wealth dynamics.
  • Built the full pipeline: cleaning, transformation, feature engineering, regression modeling, and visualization.
  • Found wealth roughly twice as unequal as income (wealth Gini ~0.85 vs income Gini ~0.45), with the median wealth-to-income ratio rising from 1.25 to 2.05 across the panel.
View project ↗

Scientific Visualization Pipeline (ParaView) — Self-Directed

ParaViewparaview.simpleVTKNumPy

A fully Python-scripted ParaView pipeline for 3D scientific data visualization.

  • Built a NumPy/VTK dataset (3D Gaussian concentration field + analytic ABC-flow velocity) and a fully Python-scripted ParaView pipeline (paraview.simple).
  • Generated volume renderings, Contour isosurfaces, slices, and Stream Tracer streamline tubes, with automated figure and orbit-animation export.
View project ↗

Experience

Handshake

AI Trainer — May 2026 – Present

  • Trained AI models based on their ability to reason and evaluate tasks across mathematics, codebases, and visualizations.